• Title/Summary/Keyword: 객관적 음질 평가 모델

Search Result 14, Processing Time 0.022 seconds

A Study on the Implementation of Realistic Sound Through Cross-Talk Cancellation (크로스토크 제거를 통한 입체 음향 구현에 관한 연구)

  • 김학진
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.2
    • /
    • pp.99-108
    • /
    • 2004
  • This thesis deals a method to deliver more realistic sound by cancelling the cross-talk which is inherent to the 5.1 channel speaker system. The acoustical model for cross-talk cancellation is the free field model. This model minimizes distortion of sound. I used the bark scale sound quality compensation which based on psycho-acoustic. For the surround channels, band-limited sound quality compensation is performed in the frequency domain. I also performed the sound quality assessment test on the traditional 2 channel stereo and 5.1 channel system. This test is performed in the test chamber which satisfies the ITU-R specifications. I uses the IACC(Inter-Aural Cross-Correlation) to determine the preferences of the amateur and the golden ear experts to asses the trans-aural filter. According to the result from the proposed method, I got more the 38㏈ separation rates with the Dolby standard speaker array. The results on the diffusion by the subjective test with the experts shows 0.4 point increased then before.

Speech Enhancement using RNN Phoneme based VAD (음소기반의 순환 신경망 음성 검출기를 이용한 음성 향상)

  • Lee, Kang;Kang, Sang-Ick;Kwon, Jang-woo;Lee, Samgmin
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.54 no.5
    • /
    • pp.85-89
    • /
    • 2017
  • In this papers, we apply high performance hardware and machine learning algorithm to build an advanced VAD algorithm for speech enhancement. Since speech is made of series of phoneme, using recurrent neural network (RNN) which consider previous data is proper method to build a speech model. It is impossible to study every noise in real world. So our algorithm is builded by phoneme based study. we detect voice present frames in noisy speech signal and make enhancement of the speech signal. Phoneme based RNN model shows advanced performance in speech signal which has high correlation among each frames. To verify the performance of proposed algorithm, we compare VAD result with label data and speech enhancement result in various noise environments with previous speech enhancement algorithm.

IoT Based Performance Measurement of Car Audio Systems in Korean Recreation Vehicles (IoT 센서를 이용한 국산 RV차량 음향시스템의 음향특성에 관한 분석)

  • Park, Hyung Woo;Lee, Sangmin
    • Journal of Internet Computing and Services
    • /
    • v.18 no.1
    • /
    • pp.57-64
    • /
    • 2017
  • Recent automobile manufacturing technology has improved not only the function and performance of cars, but also the audio systems in cars so as to increase their marketability. Automobile manufacturers always have the option of simply installing an expensive acoustic system to help customers enjoy a high-level sound quality car audio system. However, this also tends to increase the MSRP (Manufacturer's Suggested Retail Price) of the car. Therefore, it is desirable, where possible, to enhance the sound quality of plainer, less expensive audio devices to help customers feel as if they have a high-quality and expensive audio device in their car. In order to make this happen, the manufacturer must develop an optimal interior environment and audio system at a relatively lower cost. To this end, features of the car audio system can be enhanced by analyzing audio frequency response and using performance metrics to figure out the characteristics of the human auditory system. This study analyzed the sound field of Korean Recreation Vehicles (RVs) using the Internet of Things (IoT) sensor for the measurement of car audio system. As a result, high energy of sensitive bandwidth, one of the human auditory characteristics often makes annoying sound. This study also found that increasing the frequency response flatness is required by taking human auditory field into account when designing the car audio system for the future.

Salience of Envelope Interaural Time Difference of High Frequency as Spatial Feature (공간감 인자로서의 고주파 대역 포락선 양이 시간차의 유효성)

  • Seo, Jeong-Hun;Chon, Sang-Bae;Sung, Koeng-Mo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.6
    • /
    • pp.381-387
    • /
    • 2010
  • Both timbral features and spatial features are important in the assessment of multichannel audio coding systems. The prediction model, extending the ITU-R Rec. BS. 1387-1 to multichannel audio coding systems, with the use of spatial features such as ITDDist (Interaural Time Difference Distortion), ILDDist (Interaural Level Difference Distortion), and IACCDist (InterAural Cross-correlation Coefficient Distortion) was proposed by Choi et al. In that model, ITDDistswere only computed for low frequency bands (below 1500Hz), and ILDDists were computed only for high frequency bands (over 2500Hz) according to classical duplex theory. However, in the high frequency range, information in temporal envelope is also important in spatial perception, especially in sound localization. A new model to compute the ITD distortions of temporal envelopes in high frequency components is introduced in this paper to investigate the role of such ITD on spatial perception quantitatively. The computed ITD distortions of temporal envelopes in high frequency components were highly correlated with perceived sound quality of multichannel audio sounds.