• Title/Summary/Keyword: Binaural Synthesis

Search Result 6, Processing Time 0.02 seconds

Listener Auditory Perception Enhancement using Virtual Sound Source Design for 3D Auditory System

  • Kang, Cheol Yong;Mariappan, Vinayagam;Cho, Juphil;Lee, Seon Hee
    • International journal of advanced smart convergence
    • /
    • v.5 no.4
    • /
    • pp.15-20
    • /
    • 2016
  • When a virtual sound source for 3D auditory system is reproduced by a linear loudspeaker array, listeners can perceive not only the direction of the source, but also its distance. Control over perceived distance has often been implemented via the adjustment of various acoustic parameters, such as loudness, spectrum change, and the direct-to-reverberant energy ratio; however, there is a neglected yet powerful cue to the distance of a nearby virtual sound source that can be manipulated for sources that are positioned away from the listener's median plane. This paper address the problem of generating binaural signals for moving sources in closed or in open environments. The proposed perceptual enhancement algorithm composed of three main parts is developed: propagation, reverberation and the effect of the head, torso and pinna. For propagation the effect of attenuation due to distance and molecular air-absorption is considered. Related to the interaction of sounds with the environment, especially in closed environments is reverberation. The effects of the head, torso and pinna on signals that arrive at the listener are also objectives of the consideration. The set of HRTF that have been used to simulate the virtual sound source environment for 3D auditory system. Special attention has been given to the modelling and interpolation of HRTFs for the generation of new transfer functions and definition of trajectories, definition of closed environment, etc. also be considered for their inclusion in the program to achieve realistic binaural renderings. The evaluation is implemented in MATLAB.

MPEG-H 3D Audio Decoder Structure and Complexity Analysis (MPEG-H 3D 오디오 표준 복호화기 구조 및 연산량 분석)

  • Moon, Hyeongi;Park, Young-cheol;Lee, Yong Ju;Whang, Young-soo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.42 no.2
    • /
    • pp.432-443
    • /
    • 2017
  • The primary goal of the MPEG-H 3D Audio standard is to provide immersive audio environments for high-resolution broadcasting services such as UHDTV. This standard incorporates a wide range of technologies such as encoding/decoding technology for multi-channel/object/scene-based signal, rendering technology for providing 3D audio in various playback environments, and post-processing technology. The reference software decoder of this standard is a structure combining several modules and can operate in various modes. Each module is composed of independent executable files and executed sequentially, real time decoding is impossible. In this paper, we make DLL library of the core decoder, format converter, object renderer, and binaural renderer of the standard and integrate them to enable frame-based decoding. In addition, by measuring the computation complexity of each mode of the MPEG-H 3D-Audio decoder, this paper also provides a reference for selecting the appropriate decoding mode for various hardware platforms. As a result of the computational complexity measurement, the low complexity profiles included in Korean broadcasting standard has a computation complexity of 2.8 times to 12.4 times that of the QMF synthesis operation in case of rendering as a channel signals, and it has a computation complexity of 4.1 times to 15.3 times of the QMF synthesis operation in case of rendering as a binaural signals.

3-channel HRTF measurement for binaural synthesis. (바이노럴 합성을 위한 3채널 HRTF 측정)

  • Lee Sin-lyul;Kim Lae-hoon;Pang Hee-suk;Sung Koeng-Mo
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.337-340
    • /
    • 2000
  • 입체음향 생성을 위한 기존의 방법은 크게 바이노럴 녹음기법과 머리전달함수(HRTF)를 이용한 바이노럴 합성 기법으로 나눌 수 있다. 기존 2채널 더미헤드를 이용한 바이노럴 녹음기법과 바이노럴 합성기법은 표준 더미헤드를 사용함으로써 청취자 머리와의 오차로 정면 음상 정위의 어려움, "Front-back confusion", 이동 음 음상 정위 어려움 등의 문제로 실제 녹음 현장에서는 거의 사용되지 않고 있다. 본 논문에서 제안한 3채널 더미헤드 기법은 이러한 문제점을 극복할 수 있고, 특히, HRTF 합성 시 기존의 HRTF의 문제점을 극복할 수 있는 새로운 HRTF를 구축할 수 있다. 따라서 바이노럴 합성 기법이 필요한 오락, 시뮬레이터, 음장 가청화 기술(Auralization) 프로그램 등 다양한 분야에서의 적용이 가능하다.

  • PDF

Low Dimensional Modeling and Synthesis of Head-Related Transfer Function (HRTF) Using Nonlinear Feature Extraction Methods (비선형 특징추출 기법에 의한 머리전달함수(HRTF)의 저차원 모델링 및 합성)

  • Seo, Sang-Won;Kim, Gi-Hong;Kim, Hyeon-Seok;Kim, Hyeon-Bin;Lee, Ui-Taek
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.5
    • /
    • pp.1361-1369
    • /
    • 2000
  • For the implementation of 3D Sound Localization system, the binaural filtering by HRTFs is generally employed. But the HRTF filter is of high order and its coefficients for all directions have to be stored, which imposes a rather large memory requirement. To cope with this, research works have centered on obtaining low dimensional HRTF representations without significant loss of information and synthesizing the original HRTF efficiently, by means of feature extraction methods for multivariate dat including PCA. In these researches, conventional linear PCA was applied to the frequency domain HRTF data and using relatively small number of principal components the original HRTFs could be synthesized in approximation. In this paper we applied neural network based nonlinear PCA model (NLPCA) and the nonlinear PLS repression model (NLPLS) for this low dimensional HRTF modeling and analyze the results in comparison with the PCA. The NLPCA that performs projection of data onto the nonlinear surfaces showed the capability of more efficient HRTF feature extraction than linear PCA and the NLPLS regression model that incorporates the direction information in feature extraction yielded more stable results in synthesizing general HRTFs not included in the model training.

  • PDF

A Study on Auditory Perception Characteristics of Directional Tonal Noise (방향성을 가진 회전체 소음의 청각계 인지 특성에 관한 연구)

  • Seo, Kang-Won;Kim, Eui-Youl;Kim, Sung-Ki
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2012.04a
    • /
    • pp.348-353
    • /
    • 2012
  • This paper presents the HRTF based experimental approach to figure out why the human auditory perception on the interior noise source including the directional tonal components does not well match with the dominant features extracted from recorded acoustic signals in terms of psycho-acoustics. Since the general objective evaluation models for tonalness among various sound attributes are a function of width, frequency, excessive level of tonal components respectively, the directional tonal components cannot be properly evaluated without considering the effects of head-related transfer function on the binaural auditory perception. Thus, the directivity of source is additionally considered to prevent the erroneous conclusions from the same sound source in the process of source identification. The signal synthesis technique is used to solve a little difficulty in measuring all of desired acoustic signals for jury evaluation. The sound attributes of synthetic acoustics signals are analyzed to roughly predict the results of jury evaluation in advance by using sound quality factors such as loudness, sharpness, roughness, fluctuation strength and tonality. The jury evaluation is carefully conducted based on the recommended guideline suggested by N. Ottoet al. Each sound is respectively evaluated by selecting a value between -2 and 2 in intervals of 0.2 point. Through above procedure, based on the results of jury evaluation, it is confirmed that serious problems can be caused in the process of analyzing the dominant sound attributes in terms of psycho-acoustics according to the type of a microphone and a playback system.

  • PDF

A Relevant Distortion Criterion for Interpolation of the Head-Related Transfer Functions (머리 전달 함수의 보간에 적합한 왜곡 척도)

  • Lee, Ki-Seung;Lee, Seok-Pil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.2
    • /
    • pp.85-95
    • /
    • 2009
  • In the binaural synthesis environments, wide varieties of the head-related transfer functions (HRTFs) that have measured with a various direction would be desirable to obtain the accurate and various spatial sound images. To reduce the size' of HRTFs, interpolation has been often employed, where the HRTF for any direction is obtained by a limited number of the representative HRTFs. In this paper, we study on the distortion measures for interpolation, which has an important role in interpolation. With lhe various objective distortion metrics, the differences between the interpolated and the measured HRTFs were computed. These were then compared and analyzed with the results from the listening tests. From the results, the objective distortion measures were selected, that reflected the perceptual differences in spatial sound image. This measure was employed in a practical interpolation technique. We applied the proposed method to four kinds of an HRTF set, measured from three human heads and one mannequin. As a result, the Mel-frequency cepstral distortion was shown to be a good predictor for the differences in spatial sound location, when three HRTF measured from human, and the time-domain signal to distortion ratio revealed good prediction results for the entire four HRTF sets.