• Title/Summary/Keyword: perceptual weighting

Search Result 27, Processing Time 0.027 seconds

Perceptual weighting on English lexical stress by Korean learners of English

  • Goun Lee
    • Phonetics and Speech Sciences
    • /
    • v.14 no.4
    • /
    • pp.19-24
    • /
    • 2022
  • This study examined which acoustic cue(s) that Korean learners of English give weight to in perceiving English lexical stress. We manipulated segmental and suprasegmental cues in 5 steps in the first and second syllables of an English stress minimal pair "object". A total of 27 subjects (14 native speakers of English and 13 Korean L2 learners) participated in the English stress judgment task. The results revealed that native Korean listeners used the F0 and intensity cues in identifying English stress and weighted vowel quality most strongly, as native English listeners did. These results indicate that Korean learners' experience with these cues in L1 prosody can help them attend to these cues in their L2 perception. However, L2 learners' perceptual attention is not entirely predicted by their linguistic experience with specific acoustic cues in their native language.

A new Implementation of Perceptual LPC Cepstrum and its Application to Speech Recognition (인지 LPC cepstrum의 새로운 구현 및 음성인식에의 적용)

  • Kim, Jin-Young;Choi, Seong-Ho
    • The Journal of the Acoustical Society of Korea
    • /
    • v.15 no.5
    • /
    • pp.61-64
    • /
    • 1996
  • To improve the performance of a recognition system, namely the recognition rate, we propose a hew implementation of perceptual distance using LPC cepstrum(perceptual cepstrum, PLC). The PLC is caculated by convolution of a usual LPC cepstrum and a perceptual lifter(PL). To caculate PL, we define a new weighting function in the linear frequency domain considering the frequency scale(Bark-scale) characteristics. The PL is the inverse Fourier transform of the exponents of the weighting function. We verified our method through the speech recognition experiments. The performance of PLC was compared with that of the rasied sine liftering method.

  • PDF

Sinusoidal Modeling of Audio Signals Using Perceptually Weighted Matching Pursuit (지각적으로 가중된 매칭 퍼슈잇을 이용한 오디오 신호의 정현파 모델링)

  • 김연지;이인성
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.2
    • /
    • pp.96-103
    • /
    • 2003
  • This paper describes a method for sinusoidal modeling of audio signals using perceptually weighted matching pursuit. Matching pursuits extracts iteratively the greatest energy signals from the input signals until the residual between the original and the reconstructed signal is zero. In this paper, perceptual matching pursuits using psychoacoustic model to matching pursuit extracts greatest perceived energy iteratively. To evaluate the performance of the perceptual matching pursuits it is compared with the sinusoidal matching pursuits which is not included perceptual weighting. For various audio signals the result of simulation shows that the perceptual matching pursuit is superior to the sinusoidal matching pursuits, especially for a high change rate in time domain it can synthesized original signal.

Enhanced Adjustment Strategy of Masking Threshold for Speech Signals in Low Bit-Rate Audio Coding (저전송률 오디오 부호화에서 음성 신호의 성능 개선을 위한 마스킹 임계값 적응기법 향상)

  • Lee, Chang-Heon;Kang, Hong-Goo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.1
    • /
    • pp.62-68
    • /
    • 2010
  • This paper proposes a new masking threshold adjustment strategy to improve the performance for speech signals in low bit-rate audio coding. After determining formant regions, the masking threshold is adjusted by using the energy ratio of each sub-band to the average energy of each formant. More quantization noises are added to the bands that have relatively large energy, but less distortion is allowed in spectral valley regions by allocating more bits, which reflects the concept of perceptual weighting widely used in speech coding. From the results of objective speech quality measure, we verified that the proposed method improves quality for the speech input signals compared to the conventional one.

New filter design to replace the post and perceptual weighting filter of transcoder and performance evaluation (상호부호화기의 후처리 필터와 인지가중 필터를 대신하는 새로운 필터 설계 및 성능 평가)

  • 최진규;윤성완;강홍구;윤대희
    • Proceedings of the IEEK Conference
    • /
    • 2003.07e
    • /
    • pp.2232-2235
    • /
    • 2003
  • In speech communication systems where two different speech codecs are interoperated, transcoding algorithm is a good approach because of its low complexity and improved synthesized speech quality. This paper proposes an efficient method to further improve the performance of transcoding algorithms as well as to reduce the complexity. In the conventional transcoding algorithms. a post-filter and a perceptual weighting filter should be operated sequentially because both decoding and encoding processes are needed. This results in the redundancy of the processing in terms of complexity and perceptual quality. Using the fact that their filter structures are similar, we replaced the two filters with one. The proposed algorithm requires 72.8% lower complexity than the conventional transcoding algorithm when we compare only the complexity of the filtering processes. The results of both objective and subjective tests verify that the proposed algorithm has slightly better quality than the conventional one.

  • PDF

Exploring stress encoding cues in English by Korean L2 speakers

  • Goun Lee
    • Phonetics and Speech Sciences
    • /
    • v.16 no.3
    • /
    • pp.33-38
    • /
    • 2024
  • The present study investigated the perceptual cues utilized by Korean L2 learners of English in recognizing lexical stress in English nonwords, with a focus on the roles of fundamental frequency (F0) and duration. Twenty-three Korean learners of English participated in a sequence recall task involving nonword stimuli under five different conditions: (1) the naturally-produced stimuli, (2) the duration-only condition, (3) the F0-only condition, (4) the duration-F0 matching condition, and (5) the duration-F0 conflicting condition. The results demonstrate that F0 is the primary cue for stress perception among Korean L2 learners, whereas duration acts as a secondary cue, particularly when F0 is unreliable or absent. These findings highlight the influence of L1 prosodic structures on L2 perception and suggest that Korean L2 learners adapt their perceptual weighting of stress based on cue availability. This study contributes to the understanding of the role of cue weighting in L2 prosodic acquisition.

A Perceptually Motivated Active Noise Control Design and Its Psychoacoustic Analysis

  • Bao, Hua;Panahi, Issa M.S.
    • ETRI Journal
    • /
    • v.35 no.5
    • /
    • pp.859-868
    • /
    • 2013
  • The active noise control (ANC) technique attenuates acoustic noise in a flexible and effective way. Traditional ANC design aims to minimize the residual noise energy, which is indiscriminative in the frequency domain. However, human hearing perception exhibits selective sensitivity for different frequency ranges. In this paper, we aim to improve the noise attenuation performance in perceptual perspective by incorporating noise weighting into ANC design. We also introduce psychoacoustic analysis to evaluate the sound quality of the residual noise by using a predictive pleasantness model, which combines four psychoacoustic parameters: loudness, sharpness, roughness, and tonality. Simulations on synthetic random noise and realistic noise show that our method improves the sound quality and that ITU-R 468 noise weighting even performs better than A-weighting.

HDTV Image Compression Algorithm Using Leak Factor and Human Visual System (누설요소와 인간 시각 시스템을 이용한 HDTV 영상 압축 알고리듬)

  • 김용하;최진수;이광천;하영호
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.5
    • /
    • pp.822-832
    • /
    • 1994
  • DSC-HDTV image compression algorithm removes spatial, temporal, and amplitude redundancies of an image by using transform coding, motion-compensated predictive coding, and adaptive quantization, respectively. In this paper, leak processing method which is used to recover image quality quickly from scene change and transmission error and adaptive quantization using perceptual weighting factor obtained by HVS are proposed. Perceptual weighting factor is calculated by contrast sensitivity, spatio-temporal masking and frequency sensitivity. Adaptive quantization uses the perceptual weighting factor and global distortion level from buffer history state. Redundant bits according to adaptation of HVS are used for the next image coding. In the case of scene change, DFD using motion compensated predictive coding has high value, large bit rate and unstabilized buffer states since reconstructed image has large quantization noise. Thus, leak factor is set to 0 for scene change frame and leak factor to 15/16 for next frame, and global distortion level is calculated by using standard deviation. Experimental results show that image quality of the proposed method is recovered after several frames and then buffer status is stabilized.

  • PDF

A Fine Granular Scalable Video Coding Algorithm using Frequency Weighting (주파수 특성을 이용한 미세 계위적 동영상 부호화 방법)

  • 김승환;호요성
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.40 no.6
    • /
    • pp.124-131
    • /
    • 2003
  • In this paper, we propose a Progressive scalable video coding algorithm using frequency weighting in the DCT domain. Since the human visual system (HVS) can be modeled as a nonlinear point transformation, called the modulation transfer function (MTF), we tan use the frequency weighting matrix to enhance the video image quality. We change this frequency weighting matrix into the frequency shift matrix to apply to the bit-plane coding method for the fine granular scalable (FGS) video coding We also define a new error metric JNDE (just noticeable difference) to measure the perceptual image quality in terms of human vision.