• Title/Summary/Keyword: Visual Image Content

Search Result 283, Processing Time 0.031 seconds

Investigating the End-User Tagging Behavior and its Implications in Flickr (플리커 이미지 자료에 대한 이용자 태깅 행태 분석과 활용 방안)

  • Kim, Hyun-Hee;Kim, Min-Kyung
    • Journal of Information Management
    • /
    • v.40 no.2
    • /
    • pp.71-94
    • /
    • 2009
  • Indexing images using traditional indexing methods like taxonomy is not always efficient because of its visual content. This study examined how to apply folksonomies to image retrieval. To do this, first, we developed a category model for image tags found in Flickr. The model includes five categories and seventeen subcategories. Second, in order to evaluate the usefulness of the model to represent the various image tags as well as to investigate the end-user tagging behavior, three researchers classified the sampled image tags(141 most popular tags, 105 tags on three individual tag clouds and 3,848 image tags assigned on 156 images) according to the model. Finally, based on the research results, we proposed three methods for efficient image retrieval: extending folksonomies by combining them with ontologies; improving image retrieval efficiency using visual content and folksonomies; and updating taxonomy using folksonomies.

Analysis and description of the Visual Image Structure of Lemon Juice Squeezer, designed for Italy ALESSI company by Philippe Starck (필립 스탁의 디자인작 '레몬즙 짜개(Lemon Juice Squeezer)'에 대한 시각형상 구조 분석과 기술)

  • 조성근
    • Archives of design research
    • /
    • v.16 no.2
    • /
    • pp.405-414
    • /
    • 2003
  • The modeling analysis for objects placed in a given space can be described objectively when their visual image structure is grasped. It can't be answered without first analyzing the basic program, visual expression. And when the whole aspect of the visual image of the desired interior utensils is presented, the mindset of its designer can be deduced from that. Therefore, the study was based on the lemon juice squeezer, one of the interior kitchen utensils that Philippe Starck designed for Italy ALESSI company. For the study method, putting'The Elements of Dynamic Symmetry' by Prof. Jay Hambidge into practice, 'paradigm' analysis containing the whole'lemon juice squeezer'image was attempted. And to describe it, the visual mark description method by Prof. Bok-Young Kim was used. In conclusion, henceforth, the relationship between interior space and articles, the relationship between object and user, the modeling critique or analysis of the production itself shoud not be intended to be emotional. On the contrary, the study presented an art analyic methodology that can analyze and describe the visual image structure numerically, and confirm the relationship between form and content.

  • PDF

Content based Image Retrieval System by Shape Global Feature and Histogram (형태 전역특징과 히스토그램을 이용한 내용 기반 영상 검색 시스템)

  • 황병곤;정성호;이상열
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.7 no.4
    • /
    • pp.9-16
    • /
    • 2002
  • Content based Image retrieval methods in the multimedia information retrievals use primary visual features such as color, texture and shape. Color and texture generally are used as features of the image retrieval systems. But these systems may produce errors in similar image retrieval because two images with different shapes can represent very different contents. Therefore, the use of shape describing features is essential in an efficient content based image retrieval system. In this paper, after the global features filtering process by the boundary of objects, we have created a better shape similarity image retrieval system by a histogram of shape information.

  • PDF

COLORNET: Importance of Color Spaces in Content based Image Retrieval

  • Judy Gateri;Richard Rimiru;Micheal Kimwele
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.5
    • /
    • pp.33-40
    • /
    • 2023
  • The mainstay of current image recovery frameworks is Content-Based Image Retrieval (CBIR). The most distinctive retrieval method involves the submission of an image query, after which the system extracts visual characteristics such as shape, color, and texture from the images. Most of the techniques use RGB color space to extract and classify images as it is the default color space of the images when those techniques fail to change the color space of the images. To determine the most effective color space for retrieving images, this research discusses the transformation of RGB to different color spaces, feature extraction, and usage of Convolutional Neural Networks for retrieval.

MPEG-7 Visual Identifier

  • Oh, Weon-Geun
    • Proceedings of the IEEK Conference
    • /
    • 2008.06a
    • /
    • pp.41-44
    • /
    • 2008
  • Digital visual information(image and video) plays an important role in our society, and everyday, more and more visual information is available from many sources around the world. And there is an increasing number of cases where the these visual informations are created, stored, retrieved, and re-used by computational systems. But information modification, distortion and compilation occurred through trans-coding or legal/illegal information processing often make these operations difficult. Subordinately, advanced technology or commercialized software for retrieving and identifying the desired information among these massive and diverse ones is strongly required. In the paper, some recent activities and technical content of MPEG-7 Visual Group are described, especially with regarding to Visual Identifier.

  • PDF

Video Content Manipulation Using 3D Analysis for MPEG-4

  • Sull, Sanghoon
    • Journal of Broadcast Engineering
    • /
    • v.2 no.2
    • /
    • pp.125-135
    • /
    • 1997
  • This paper is concerned with realistic mainpulation of content in video sequences. Manipulation of content in video sequences is one of the content-based functionalities for MPEG-4 Visual standard. We present an approach to synthesizing video sequences by using the intermediate outputs of three-dimensional (3D) motion and depth analysis. For concreteness, we focus on video showing 3D motion of an observer relative to a scene containing planar runways (or roads). We first present a simple runway (or road) model. Then, we describe a method of identifying the runway (or road) boundary in the image using the Point of Heading Direction (PHD) which is defined as the image of, the ray along which a camera moves. The 3D motion of the camera is obtained from one of the existing 3D analysis methods. Then, a video sequence containing a runway is manipulated by (i) coloring the scene part above a vanishing line, say blue, to show sky, (ii) filling in the occluded scene parts, and (iii) overlaying the identified runway edges and placing yellow disks in them, simulating lights. Experimental results for a real video sequence are presented.

  • PDF

Web Service Workflows for Distributed Visual Media Retrieval Framework

  • Nah, Yun-Mook;Lee, Bog-Ju;Kim, Jung-Sun;Kwon, O-Byoung;Suh, Bo-Won;Ahn, Chul-Bum;Shin, Dong-Hoon
    • Journal of Korea Multimedia Society
    • /
    • v.10 no.6
    • /
    • pp.707-715
    • /
    • 2007
  • The need for content-based retrieval from visual media, such as image and video data, is ever increasing rapidly in many applications, such as electronic art museums, internet shopping malls, internet search engines, and medical information systems. In our previous research, we proposed an architecture, called the HERMES, which is a Web Service-enabled visual media retrieval framework. In this paper, we propose the Web Service workflows that are employed in the HERMES. We describe how we designed the workflows for service registration and query processing in the framework. We especially explain how metadata and ontology can be utilized to realize more intelligent content-based retrieval on visual media data.

  • PDF

Dual Autostereoscopic Display Platform for Multi-user Collaboration with Natural Interaction

  • Kim, Hye-Mi;Lee, Gun-A.;Yang, Ung-Yeon;Kwak, Tae-Jin;Kim, Ki-Hong
    • ETRI Journal
    • /
    • v.34 no.3
    • /
    • pp.466-469
    • /
    • 2012
  • In this letter, we propose a dual autostereoscopic display platform employing a natural interaction method, which will be useful for sharing visual data with users. To provide 3D visualization of a model to users who collaborate with each other, a beamsplitter is used with a pair of autostereoscopic displays, providing a visual illusion of a floating 3D image. To interact with the virtual object, we track the user's hands with a depth camera. The gesture recognition technique we use operates without any initialization process, such as specific poses or gestures, and supports several commands to control virtual objects by gesture recognition. Experiment results show that our system performs well in visualizing 3D models in real-time and handling them under unconstrained conditions, such as complicated backgrounds or a user wearing short sleeves.

Visual Feature Extraction for Image Retrieval using Wavelet Coefficient’s Fuzzy Homogeneity and High Frequency Energy (웨이브릿 계수의 퍼지 동질성과 고주파 에너지를 이용한 영상 검색용 특징벡터 추출)

  • 박원배;류은주;송영준
    • The Journal of the Korea Contents Association
    • /
    • v.4 no.1
    • /
    • pp.18-23
    • /
    • 2004
  • In this paper, we propose a new visual feature extraction method for content-based image retrieval(CBIR) based on wavelet transform which has both spatial-frequency characteristic and multi-resolution characteristic. We extract visual features for each frequency band in wavelet transformation and use them to CBIR. The lowest frequency band involves spacial information of original image. We extract L feature vectors using fuzzy homogeneity in the wavelet domain, which consider both the wavelet coefficients and the spacial information of each coefficient. Also, we extract 3 feature vectors wing the energy values of high frequency bands, and store those to image database. As a query, we retrieve the most similar image from image database according to the 10 largest homograms(normalized fuzzy homogeneity vectors) and 3 energy values. Simulation results show that the proposed method has good accuracy in image retrieval using 90 texture images.

  • PDF

Reaction Times to Predictable Visual Patterns Reflect Neural Responses in Early Visual Cortex

  • Joo, Sung Jun
    • Science of Emotion and Sensibility
    • /
    • v.24 no.2
    • /
    • pp.57-64
    • /
    • 2021
  • It has long been speculated that the visual system should use a coding strategy that takes advantage of statistical redundancies in images. But how such a coding strategy should manifest in neural responses has been less clear. Low-level image structure related to the power spectrum of natural images appears to be captured by a hard-wired efficient code in the retina of the fly and precortical structures like the LGN of cats that maximizes information content through the limited capacity channel of the optic nerve. But visual images are typically filled with higher-order structure beyond that captured by the power spectrum and visual cortex is not constrained by the same capacity limits as the optic nerve. Whether and how visual cortex can flexibly code for higher order redundancies is unknown. Here we show using psychophysical techniques that the neural response in early human visual cortex may be modulated by orientation redundancies in images such that a visual feature that is contained within a predictive pattern results in slower reaction times than a feature that deviates from a pattern, suggesting lower neural responses to predictable stimuli in the visual cortex. Our results point to a neural response in early visual cortex that is sensitive to global patterns and redundancies in visual images and is in marked contrast to standard models of cortical visual processing.