• 제목/요약/키워드: Spatial Audio

Search Result 90, Processing Time 0.028 seconds

Verification of the Multi-channel Audio Service over T-DMB (지상파 DMB를 통한 멀티채널 오디오 서비스 검증에 관한 연구)

  • Jang, Dae-Young;Lee, Yong-Ju
    • Journal of Broadcast Engineering
    • /
    • v.12 no.3
    • /
    • pp.222-229
    • /
    • 2007
  • According to the advancement of multimedia compression technologies, high quality multi-media services are easily found in common life. Along with this situation, 5.1-channel audio service also has expanded the application area to home theater system and car theater system and consumer can easily take a chance to experience the feeling of 5.1-channel audio. On the other hand, terrestrial DMB service has been launched in Korea from Dec. 2005 as a handhold multi-media broadcasting service. However, multi-channel audio was not considered due to the insufficiency of bandwidth and the handhold usage. Lately, MPEG is standardizing high efficiency multi-channel audio compression technology for handheld broadcasting service, and several trial for application is introduced in Europe. In this paper, we would like to explain multi-channel audio compression technology, describe the implementation of the verification system for the multi-channel audio service over T-DMB and investigate the possibility of further realization of the service.

Analysis of learning effects using audio-visual manual of SWAT (SWAT의 시청각 매뉴얼을 통한 학습 효과 분석)

  • Lee, Ju-Yeong;Kim, Tea-Ho;Ryu, Ji-Chul;Kang, Hyun-Woo;Kum, Dong-Hyuk;Woo, Won-Hee;Jang, Chun-Hwa;Choi, Jong-Dae;Lim, Kyoung-Jae
    • Korean Journal of Agricultural Science
    • /
    • v.38 no.4
    • /
    • pp.731-737
    • /
    • 2011
  • In the modern society, GIS-based decision support system has been used in evaluating environmental issues and changes due to spatial and temporal analysis capabilities of the GIS. However without proper manual of these systems, its desired goals could not be achieved. In this study, audio-visual SWAT tutorial system was developed to evaluate its effectives in learning the SWAT model. Learning effects was analyzed after in-class demonstration and survey. The survey was conducted for $3^{rd}$ grade students with/without audio-visual materials using 30 questionnaires, composed of 3 items for trend of respondent, 5 items for effects of audio-visual materials, and 12 items for effects of with/without manual in learning the model. For group without audio-visual manual, 2.98 out of 5 was obtained and 4.05 out of 5 was obtained for group with audio-visual manual, indicating higher content delivery with audio-visual learning effects. As shown in this study, the audio-visual learning material should be developed and used in various computer-based modeling system.

A study on Metadata Modeling using Structure Information of Video Document (비디오 문서의 구조 정보를 이용한 메타데이터 모델링에 관한 연구)

  • 권재길
    • Journal of the Korea Society of Computer and Information
    • /
    • v.3 no.4
    • /
    • pp.10-18
    • /
    • 1998
  • Video information is an important component of multimedia system such as Digital Library. World-Wide Web(WWW) and Video-On-Demand(VOD) service system. It can support various types of information because of including audio-visual, spatial-temporal and semantics information. In addition, it requires the ability of retrieving the specific scene of video instead of entire retrieval of video document. Therefore, so as to support a variety of retrieval, this paper models metadata using video document structure information that consists of hierarchical structure, and designs database schema that can manipulate video document.

  • PDF

Audio-visual Spatial Coherence Judgments in the Peripheral Visual Fields

  • Lee, Chai-Bong;Kang, Dae-Gee
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.16 no.2
    • /
    • pp.35-39
    • /
    • 2015
  • Auditory and visual stimuli presented in the peripheral visual field were perceived as spatially coincident when the auditory stimulus was presented five to seven degrees outwards from the direction of the visual stimulus. Furthermore, judgments of the perceived distance between auditory and visual stimuli presented in the periphery did not increase when an auditory stimulus was presented in the peripheral side of the visual stimulus. As to the origin of this phenomenon, there would seem to be two possibilities. One is that the participants could not perceptually distinguish the distance on the peripheral side because of the limitation of accuracy perception. The other is that the participants could distinguish the distances, but could not evaluate them because of the insufficient experimental setup of auditory stimuli. In order to confirm which of these two alternative explanations is valid, we conducted an experiment similar to that of our previous study using a sufficient number of loudspeakers for the presentation of auditory stimuli. Results revealed that judgments of perceived distance increased on the peripheral side. This indicates that we can perceive discrimination between audio and visual stimuli on the peripheral side.

Spatial Audio Signal Processing Technology Using Multi-Channel 3D Microphone (멀티채널 3차원 마이크를 이용한 입체음향 처리 기술)

  • Kang Kyeongok;Lee Taejin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.2
    • /
    • pp.68-77
    • /
    • 2005
  • The purpose of a spatial audio system is to give a listener an impression as if he were present in a recorded environment when its sound is reproduced. For this purpose a dummy head microphone is generally used. Because of its human-like shape, dummy head microphone can reproduce spatial images through headphone reproduction. However, its shape and size are restriction to public use and it is difficult to convert the output signal of dummy head microphone into a multi-channel signal for multi-channel environment. So, in this paper, we propose a multi-channel 3D microphone technology. The multi-channel 3D microphone acquire a spatial audio using five microphones around a horizontal plane of a rigid sphere and through post processing, it can reproduce various reproduction signals for headphone, stereo, stereo dipole, 4ch and 5ch reproduction environments. Because of complex computation, we implemented H/W based post processing system. To verily the Performance of the multi-channel 3D microphone, localization experiments were Performed. The result shows that a front/back confusion, which is the one of common limitations of conventional dummy head technology, can be reduced dramatically.

Overview of MPEG Surround (MPEG Surround 표준화 동향 및 기술 분석)

  • Jang In-Seon;Beack Seung-Kwon;Seo Jeong-Il;Jang Dae-Young
    • Journal of Broadcast Engineering
    • /
    • v.11 no.2 s.31
    • /
    • pp.181-190
    • /
    • 2006
  • Technology for compressing low-bitrate multichannel audio coding should be developed owing to the increasing need of consumer for multichannel audio contents and services. To meet this requirement, MPEG has standardized MPEG Surround. In this paper, we introduce status on MPEG Surround standardization and analyze techniques adopted in the current MPEG Surround.

An Audio Coding Technique Employing the Inter-channel Phase Difference Skip (채널 간 위상차 파라미터 생략 기법을 이용한 오디오 부호화)

  • Kim, Hyun-Hwi;Kim, Rin-Chul
    • Journal of Broadcast Engineering
    • /
    • v.21 no.3
    • /
    • pp.369-379
    • /
    • 2016
  • This paper deals with an efficient method for skipping inter-channel phase differences (IPD) in the MPEG surround of the unified speech and audio coding (USAC). Based on the psycho-acoustic sensitivity on the IPD, we estimate a threshold on IPD, below which we can not notice degradation in spatial cue. We propose an IPD skip method, in which any IPDs within the threshold are set to zero and are not transmitted. The proposed IPD skip method gives about 38% savings in terms of bit amount for IPD. Nevertheless, in the MUSHRA test, the proposed method does not show any noticeable degradation in the decoded audio quality.

Sound Quality Enhancement in MPEG Surround by Using ILD Distortion (ILD DISTORTION을 이용한 MPEG SURROUND의 음질 개선)

  • Chon, Sang-Bae;Choi, In-Yong;Sung, Koeng-Mo
    • Proceedings of the IEEK Conference
    • /
    • 2006.06a
    • /
    • pp.241-242
    • /
    • 2006
  • MPEG Surround is an audio coding technology that represents multi-channel audio signal with downmixed audio signal(s) and very low bitrate side information based on Binaural Cue Coding. The side information consists of Inter-Channel Level Difference, Inter-Channel Correlation, and payloads. These two parameters are correspondent to the well-known spatial parameters in psycho-acoustics, Inter-aural Level Difference (ILD) and Inter-Aural Cross Correlation (IACC). Though ICLD is to provide perceptually equivalent ILD to the listener, however, the ILD of the original multi-channel audio signal and that of the MPEG Surround encoded signal was different. The difference between two ILD values is defined as ILD Distortion (ILDD). This paper provides how ILDD can be applied to enhance sound quality in MPEG Surround and how much ILDD is decreased.

  • PDF

Audio Source Separation Method based on Beamspace-domain Multichannel Non-negative Matrix Factorization, Part II: A Study on the Beamspace Transform Algorithms (빔공간-영역 다채널 비음수 행렬 분해 알고리즘을 이용한 음원 분리 기법 Part II: 빔공간-변환 기법에 대한 고찰)

  • Lee, Seok-Jin;Park, Sang-Ha;Sung, Koeng-Mo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.5
    • /
    • pp.332-339
    • /
    • 2012
  • Beamspace transform algorithm transforms spatial-domain data - such as x, y, z dimension - into incidence-angle-domain data, which is called beamspace-domain data. The beamspace transform method is generally used in source localization and tracking, and adaptive beamforming problem. When the beamspace transform method is used in multichannel audio source separation, the inverse beamspace transform is also important because the source image have to be reconstructed. This paper studies the beamspace transform and inverse transform algorithms for multichannel audio source separation system, especially for the beamspace-domain multichannel NMF algorithm.

Implementation of a Person Tracking Based Multi-channel Audio Panning System for Multi-view Broadcasting Services (다시점 방송 서비스를 위한 사용자 위치추적 기반 다채널 오디오 패닝 시스템 구현)

  • Kim, Yong-Guk;Yang, Jong-Yeol;Lee, Young-Han;Kim, Hong-Kook
    • 한국HCI학회:학술대회논문집
    • /
    • 2009.02a
    • /
    • pp.150-157
    • /
    • 2009
  • In this paper, we propose a person tracking based multi-channel audio panning system for multi-view broadcasting services. Multi-view broadcasting is to render the video sequences that are captured from a set of cameras based on different viewpoints, and multi-channel audio panning techniques are necessary for audio rendering in these services. In order to apply such a realistic audio technique to this multi-view broadcasting service, person tracking techniques which are to estimate the position of users are also necessary. For these reasons, proposed methods are composed of two parts. The first part is a person tracking method by using ultrasonic satellites and receiver. We could obtain user's coordinates of high resolution and short duration about 10 mm and 150 ms. The second part is MPEG Surround parameter-based multi-channel audio panning method. It is a method to obtain panned multi-channel audio by controlling the MPEG Surround spatial parameters. A MUSHRA test is conducted to objectively evaluate the perceptual quality and measure localization performance using a dummy head. From the experiments, it is shown that the proposed method provides better perceptual quality and localization performance than the conventional parameter-based audio panning method. In addition, we implement the prototype of person tracking based multi-view broadcasting system by integrating proposed methods with multi-view display system.

  • PDF