• Title/Summary/Keyword: Visual Information processing

Search Result 1,075, Processing Time 0.037 seconds

Fast Extraction of Objects of Interest from Images with Low Depth of Field

  • Kim, Chang-Ick;Park, Jung-Woo;Lee, Jae-Ho;Hwang, Jenq-Neng
    • ETRI Journal
    • /
    • v.29 no.3
    • /
    • pp.353-362
    • /
    • 2007
  • In this paper, we propose a novel unsupervised video object extraction algorithm for individual images or image sequences with low depth of field (DOF). Low DOF is a popular photographic technique which enables the representation of the photographer's intention by giving a clear focus only on an object of interest (OOI). We first describe a fast and efficient scheme for extracting OOIs from individual low-DOF images and then extend it to deal with image sequences with low DOF in the next part. The basic algorithm unfolds into three modules. In the first module, a higher-order statistics map, which represents the spatial distribution of the high-frequency components, is obtained from an input low-DOF image. The second module locates the block-based OOI for further processing. Using the block-based OOI, the final OOI is obtained with pixel-level accuracy. We also present an algorithm to extend the extraction scheme to image sequences with low DOF. The proposed system does not require any user assistance to determine the initial OOI. This is possible due to the use of low-DOF images. The experimental results indicate that the proposed algorithm can serve as an effective tool for applications, such as 2D to 3D and photo-realistic video scene generation.

  • PDF

Visual User Defined Schema Integration at Multimedia Database Environment (멀티미디어 데이터베이스 환경에서 시각화된 사용자 정의 스키마 통합)

  • 이현창
    • Journal of the Korea Society of Computer and Information
    • /
    • v.9 no.2
    • /
    • pp.57-62
    • /
    • 2004
  • In these days, application systems for processing information using database is increasing. Enterprises holding a lot of data do not possess needed data but instead include unrelated, independent and individual data. As a result, it only contains disparate data. Disparate data is ambiguous and it does not support current integrated information. In response to the above problems, data warehouse may provide a solution. Building a data warehouse needs a systematic design because of its complexity. This paper describes an efficient design methodology using visual environment for data warehouse to cope with the requirements of end users. Also, the system is able to process existential SQL query.

  • PDF

Depth Perception using A Parallel-Axis Stereoscopic Camera Rig

  • Ramesh, Rohit;Shin, Heung-Sub;Jeong, Shin-Il;Chung, Wan-Young
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2010.10a
    • /
    • pp.147-148
    • /
    • 2010
  • Recently, advancement in the visual technology has lead to the further development of the three dimensional (3D) imaging systems. The visual perception to view a pair of images simultaneously, is a crucial factor to build a stereoscopic 3D image. In this paper, we present the depth cues between the intensities of the two images when viewing with both eyes. Due to this stereoscopic effect, objects at different distances from the eyes differ in their horizontal positions, giving the depth cue of horizontal disparity. By simple image processing technique, we also present the binocular disparity map between the two images. A median filter has been used to filter out all the noises occurring in the disparity map image.

  • PDF

WAVELET-BASED DIGITAL WATERMARKING USING HUMAN VISUAL SYSTEM FOR COPYRIGHT PROTECTION

  • Sombun, Anuwat;Pinngern, Quen;Kimpan, Chom
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.800-803
    • /
    • 2004
  • This paper presents a wavelet-based digital watermarking technique for still images. The digital watermarking considering human visual system (HVS) to increase the robustness and perceptual invisibility of digital watermark. The watermarking embedding is modified discrete wavelet transform (DWT) coefficients of the subbands of the images. The human visual system is number of factors that effect the noise sensitivity of human eyes that is considered to increase the robustness and perceptual invisibility of digital watermark. The watermark detection is blind watermark ( original image is not required ). Experimental results successful against attacks by image processing such as add noise, cropping, filtering, JPEG and JPEG2000 compression.

  • PDF

Comparison of Database Processing Time according to Programing Languages (프로그래밍 언어에 따른 데이터베이스 처리시간 비교)

  • Seo, Sang-Uk;Kim, Kyeoung-Jin;Jang, Si-Woong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2010.05a
    • /
    • pp.909-912
    • /
    • 2010
  • 현재 컴퓨터와 관련된 거의 모든 업무처리가 데이터베이스에 의존하고 있을 정도로 데이터베이스 시스템의 사용이 확산 되어 있으며 중요도도 높아지고 있다. 현재 상용화된 공개용 DBMS를 기준으로 대용량 데이터베이스에 대한 벤치마크 테스트 결과는 많이 주어져 있지만, 각종 프로그래밍 언어를 이용해 처리속도를 비교 분석한 연구 결과가 많이 알려져 있지 않다. 본 논문에서는 오라클을 이용하여 Visual Basic, Visual C++, ASP 언어에 대해 데이터베이스 처리시간을 비교 분석하였다.

  • PDF

Reaction Times to Predictable Visual Patterns Reflect Neural Responses in Early Visual Cortex

  • Joo, Sung Jun
    • Science of Emotion and Sensibility
    • /
    • v.24 no.2
    • /
    • pp.57-64
    • /
    • 2021
  • It has long been speculated that the visual system should use a coding strategy that takes advantage of statistical redundancies in images. But how such a coding strategy should manifest in neural responses has been less clear. Low-level image structure related to the power spectrum of natural images appears to be captured by a hard-wired efficient code in the retina of the fly and precortical structures like the LGN of cats that maximizes information content through the limited capacity channel of the optic nerve. But visual images are typically filled with higher-order structure beyond that captured by the power spectrum and visual cortex is not constrained by the same capacity limits as the optic nerve. Whether and how visual cortex can flexibly code for higher order redundancies is unknown. Here we show using psychophysical techniques that the neural response in early human visual cortex may be modulated by orientation redundancies in images such that a visual feature that is contained within a predictive pattern results in slower reaction times than a feature that deviates from a pattern, suggesting lower neural responses to predictable stimuli in the visual cortex. Our results point to a neural response in early visual cortex that is sensitive to global patterns and redundancies in visual images and is in marked contrast to standard models of cortical visual processing.

Development of Visual Odometry Estimation for an Underwater Robot Navigation System

  • Wongsuwan, Kandith;Sukvichai, Kanjanapan
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.4 no.4
    • /
    • pp.216-223
    • /
    • 2015
  • The autonomous underwater vehicle (AUV) is being widely researched in order to achieve superior performance when working in hazardous environments. This research focuses on using image processing techniques to estimate the AUV's egomotion and the changes in orientation, based on image frames from different time frames captured from a single high-definition web camera attached to the bottom of the AUV. A visual odometry application is integrated with other sensors. An internal measurement unit (IMU) sensor is used to determine a correct set of answers corresponding to a homography motion equation. A pressure sensor is used to resolve image scale ambiguity. Uncertainty estimation is computed to correct drift that occurs in the system by using a Jacobian method, singular value decomposition, and backward and forward error propagation.

Investigating the Effects of Hearing Loss and Hearing Aid Digital Delay on Sound-Induced Flash Illusion

  • Moradi, Vahid;Kheirkhah, Kiana;Farahani, Saeid;Kavianpour, Iman
    • Korean Journal of Audiology
    • /
    • v.24 no.4
    • /
    • pp.174-179
    • /
    • 2020
  • Background and Objectives: The integration of auditory-visual speech information improves speech perception; however, if the auditory system input is disrupted due to hearing loss, auditory and visual inputs cannot be fully integrated. Additionally, temporal coincidence of auditory and visual input is a significantly important factor in integrating the input of these two senses. Time delayed acoustic pathway caused by the signal passing through digital signal processing. Therefore, this study aimed to investigate the effects of hearing loss and hearing aid digital delay circuit on sound-induced flash illusion. Subjects and Methods: A total of 13 adults with normal hearing, 13 with mild to moderate hearing loss, and 13 with moderate to severe hearing loss were enrolled in this study. Subsequently, the sound-induced flash illusion test was conducted, and the results were analyzed. Results: The results showed that hearing aid digital delay and hearing loss had no detrimental effect on sound-induced flash illusion. Conclusions: Transmission velocity and neural transduction rate of the auditory inputs decreased in patients with hearing loss. Hence, the integrating auditory and visual sensory cannot be combined completely. Although the transmission rate of the auditory sense input was approximately normal when the hearing aid was prescribed. Thus, it can be concluded that the processing delay in the hearing aid circuit is insufficient to disrupt the integration of auditory and visual information.

Interactive information process image with minute hand gestures

  • Lim, Chan
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2016.04a
    • /
    • pp.799-802
    • /
    • 2016
  • It is definitely an interesting job to work with V4 to create various contents emphasizing different interfaces like 3D graphics, and multimedia such as video, audio, and camera. Moreover, beyond the other interface, as it could be used in the many aspects of the sensory sign such as visual effects, auditory effects, and touchable effects, it feels free to make a better developed model. We intended the users to feel some kind of pleasure and interactions rather than just using in aspect of Media art.

An Ontology Design for converging augmented Information in visual media (영상 미디어의 증강정보 융합 온톨로지 설계)

  • Moon, Hee-Kyung;Xin, Li;Han, Sung-Kook
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2015.10a
    • /
    • pp.1324-1325
    • /
    • 2015
  • 본 논문은 영상미디어에 증강정보를 융합하여 스마트 미디어를 실현하는 방법을 제시한다. 증강정보의 다양한 유형을 대표하는 온톨로지 모델을 설계하고, 온톨로지 기반의 증강정보를 응용하여 스마트 미디어 시스템을 구현하는 개념적 아키텍처를 제시한다.