• Title/Summary/Keyword: Scene-specific System

Search Result 47, Processing Time 0.026 seconds

A Study on Voice Activity Detection Using Auditory Scene and Periodic to Aperiodic Component Ratio in CASA System (CASA 시스템의 청각장면과 PAR를 이용한 음성 영역 검출에 관한 연구)

  • Kim, Jung-Ho;Ko, Hyung-Hwa;Kang, Chul-Ho
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.10
    • /
    • pp.181-187
    • /
    • 2013
  • When there are background noises or some people speaking at the same time, a human's auditory sense has the ability to listen the target speech signal with a specific purpose through Auditory Scene Analysis. The CASA system with human's auditory faculty system is able to segregate the speech. However, the performance of CASA system is reduced when the CASA system fails to determine the correct position of the speech. In order to correct the error in locating the speech on the CASA system, voice activity detection algorithm is proposed in this paper, which is a combined auditory scene analysis with PAR(Periodic to Aperiodic component Ratio). The experiments have been conducted to evaluate the performance of voice activity detection in environments of white noise and car noise with the change of SNR 15~0dB. In this paper, by comparing the existing algorithms (Pitch and Guoning Hu) with the proposed algorithm, the accuracy of the voice activity detection performance has been improved as the following: improvement of maximum 4% at SNR 15dB and maximum 34% at SNR 0dB for white noise and car noise, respectively.

Retinex-based Logarithm Transformation Method for Color Image Enhancement (컬러 이미지 화질 개선을 위한 Retinex 기반의 로그변환 기법)

  • Kim, Donghyung
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.5
    • /
    • pp.9-16
    • /
    • 2018
  • Images with lower illumination from the light source or with dark regions due to shadows, etc., can improve subjective image quality by using retinex-based image enhancement schemes. The retinex theory is a method that recognizes the relative lightness of a scene, rather than recognizing the brightness of the scene. The way the human visual system recognizes a scene in a specific position can be in one of several methods: single-scale retinex, multi-scale retinex, and multi-scale retinex with color restoration (MSRCR). The proposed method is based on the MSRCR method, which includes a color restoration step, which consists of three phases. In the first phase, the existing MSRCR method is applied. In the second phase, the dynamic range of the MSRCR output is adjusted according to its histogram. In the last phase, the proposed method transforms the retinex output value into the display dynamic range using a logarithm transformation function considering human visual system characteristics. Experimental results show that the proposed algorithm effectively increases the subjective image quality, not only in dark images but also in images including both bright and dark areas. Especially in a low lightness image, the proposed algorithm showed higher performance improvement than the conventional approaches.

VMS Emulator System with Real-Time Scheduling

  • Kim, Jung-Sook
    • Journal of Multimedia Information System
    • /
    • v.1 no.2
    • /
    • pp.95-100
    • /
    • 2014
  • Variable message signs (VMS) have the different sizes and a specific type according to the city scene and it has to be displayed by different message on the display panel in real-time. And VMS manufacturers must produce the different products in order to give a customized product to each order. In addition that, they should test and check the correct operation to each VMS product using the different message frame. That is very time and workers consuming and VMS emulator with an automatic variable message generator system is necessary. Also, the automatic message generator system is needed to real-time scheduling in order to display the message on the VMS panel like real world. In this paper, we design and implement the VMS emulator embedded the automatic message frame generator system with real-time scheduling which can set several parameters easily on the windows dialog.

  • PDF

A study on Metadata Modeling using Structure Information of Video Document (비디오 문서의 구조 정보를 이용한 메타데이터 모델링에 관한 연구)

  • 권재길
    • Journal of the Korea Society of Computer and Information
    • /
    • v.3 no.4
    • /
    • pp.10-18
    • /
    • 1998
  • Video information is an important component of multimedia system such as Digital Library. World-Wide Web(WWW) and Video-On-Demand(VOD) service system. It can support various types of information because of including audio-visual, spatial-temporal and semantics information. In addition, it requires the ability of retrieving the specific scene of video instead of entire retrieval of video document. Therefore, so as to support a variety of retrieval, this paper models metadata using video document structure information that consists of hierarchical structure, and designs database schema that can manipulate video document.

  • PDF

Autostereoscopic Display System Using a Variable Parallax Barrier (가변형 패럴랙스배리어를 이용한 무안경 디스플레이 시스템)

  • Wi, Sung-Min;Lee, Seung-Hyun
    • Korean Journal of Optics and Photonics
    • /
    • v.19 no.2
    • /
    • pp.95-102
    • /
    • 2008
  • An advantage of parallax barrier displays is that they can also display 2D and 3D contents and can be automatically switched between the two types. But, as the viewer changes position, different views of the scene will be directed by the barrier to the visual system. Moving horizontally beyond a certain point will produce "image flipping" of the different views of the scene. These limitations make unavoidable the use of another autostereoscopic display solutions like eye tracking or increasing the number of views. In this paper, a method of the moving parallax barrier design is introduced to supplement a disadvantage of the fixed parallax barrier that provides observation at specific locations. For making the moving parallax barrier, the cross connector with 640 lines FPC is designed. A commercially available web camera is utilized to implement eye-tracking system and shows the experimental result.

CARA: Character Appearance Retrieval and Analysis for TV Programs

  • Jung Byunghee;Park Sungchoon;Kim Kyeongsoo
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2004.11a
    • /
    • pp.237-240
    • /
    • 2004
  • This paper describes a character retrieval system for TV programs and a set of novel algorithms for detecting and recognizing faces for the system. Our character retrieval system consists of two main components: Face Register and Face Recognizer. The Face Register detects faces in video frames and then guides users to register the detected faces of interest into the database. The Face Recognizer displays the appearance interval of each character on the timeline interface and the list of scenes with the names of characters that appear on each scene. These two components also provide a function to modify incorrect results. which is helpful to provide accurate character retrieval services. In the proposed face detection and recognition algorithms. we reduce the computation time without sacrificing the recognition accuracy by using the DCT/LDA method for face feature extraction. We also develop the character retrieval system in the form of plug-in. By plugging in our system to a cataloguing system. the metadata about the characters in a video can be automatically generated. Through this system, we can easily realize sophisticated on-demand video services which provide the search of scenes of a specific TV star.

  • PDF

Traveler Guidance System based on 3D Street Modeling

  • Kim, Seung-Jun;Eom, Seong-Eun;Byun, Sung-Cheal;Yang, See-Moon;Ahn, Byung-Ha
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.1187-1190
    • /
    • 2004
  • This paper presents a traveler guidance system that offers 3D street information such as road types, signal light systems, street trees, buildings, etc. We consider 5x4 road system of Gangnam(in Seoul, Korea) as a test area and reflect the traveler's car-driving situation. A web server is constructed to serve traveler's driving path by switching 3D animation scenes automatically. To do batch processing of geometric data for the 3D graphical streets construction, we have extracted major street information from present GIS database and created new GIS file formats (SMF files), which contain data sessions for links, nodes, and facilities. With these files, we can render 3D navigation scenes. A number of vector calculations were performed for the geometrical consistence and texture-mapping method was used for the realistic scene generation. Finally, we have verified the effectiveness of the service by operating a test scenario. We have checked whether traveler's 2D path and 3D navigation are exactly reported after setting specific departure and destination. This system offers us well awareness of streets and takes useful role of traveler guidance.

  • PDF

A Knowledge-Based Machine Vision System for Automated Industrial Web Inspection

  • Cho, Tai-Hoon;Jung, Young-Kee;Cho, Hyun-Chan
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.1 no.1
    • /
    • pp.13-23
    • /
    • 2001
  • Most current machine vision systems for industrial inspection were developed with one specific task in mind. Hence, these systems are inflexible in the sense that they cannot easily be adapted to other applications. In this paper, a general vision system framework has been developed that can be easily adapted to a variety of industrial web inspection problems. The objective of this system is to automatically locate and identify \\\"defects\\\" on the surface of the material being inspected. This framework is designed to be robust, to be flexible, and to be as computationally simple as possible. To assure robustness this framework employs a combined strategy of top-down and bottom-up control, hierarchical defect models, and uncertain reasoning methods. To make this framework flexible, a modular Blackboard framework is employed. To minimize computational complexity the system incorporates a simple multi-thresholding segmentation scheme, a fuzzy logic focus of attention mechanism for scene analysis operations, and a partitioning if knowledge that allows concurrent parallel processing during recognition.cognition.

  • PDF

Discriminant Analysis of Human's Implicit Intent based on Eyeball Movement (안구운동 기반의 사용자 묵시적 의도 판별 분석 모델)

  • Jang, Young-Min;Mallipeddi, Rammohan;Kim, Cheol-Su;Lee, Minho
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.6
    • /
    • pp.212-220
    • /
    • 2013
  • Recently, there has been tremendous increase in human-computer/machine interaction system, where the goal is to provide with an appropriate service to the user at the right time with minimal human inputs for human augmented cognition system. To develop an efficient human augmented cognition system based on human computer/machine interaction, it is important to interpret the user's implicit intention, which is vague, in addition to the explicit intention. According to cognitive visual-motor theory, human eye movements and pupillary responses are rich sources of information about human intention and behavior. In this paper, we propose a novel approach for the identification of human implicit visual search intention based on eye movement pattern and pupillary analysis such as pupil size, gradient of pupil size variation, fixation length/count for the area of interest. The proposed model identifies the human's implicit intention into three types such as navigational intent generation, informational intent generation, and informational intent disappearance. Navigational intent refers to the search to find something interesting in an input scene with no specific instructions, while informational intent refers to the search to find a particular target object at a specific location in the input scene. In the present study, based on the human eye movement pattern and pupillary analysis, we used a hierarchical support vector machine which can detect the transitions between the different implicit intents - navigational intent generation to informational intent generation and informational intent disappearance.

Development of Vibroacoustic Stimulation Seat for a Movie Theater Chair (영화관 의자용 음향진동자극 시트의 개발)

  • Moon, Deok-Hong
    • Journal of Power System Engineering
    • /
    • v.17 no.1
    • /
    • pp.42-49
    • /
    • 2013
  • The global movie industry is continuing rapid growth through application of the latest technology. 3D movies are being produced and shown for a more effective viewing experience. Special chairs for audiences are being experimentally manufactured and installed for the greatest viewing effect. This special chair has a structure that applies vibrating stimuli to specific parts of the body by attaching vibration transducers to theater chairs and synchronizing it with each scene of the movie. In a previous study, it has been confirmed that we can analyze the vibration transfer characteristics of sponge seats through the application of an experimental modal analysis method and obtain design variables easily. In this paper, we examine the major design parameters needed in the development of a foaming sponge seat in which auxiliary springs are inserted to improve the vibration transfer effect of a chair seat. Through analyzing several prototypes by applying experimentation as well as the experimental modal analysis method, it was confirmed that the effect of vibration transfer can be improved through the use of an auxiliary member.