• 제목/요약/키워드: perceptual quality

검색결과 344건 처리시간 0.025초

Discrimination of Synthesized English Vowels by American and Korean Listeners

  • Yang, Byung-Gon
    • 음성과학
    • /
    • 제13권1호
    • /
    • pp.7-27
    • /
    • 2006
  • This study explored the discrimination of synthesized English vowel pairs by twenty-seven American and Korean, male and female listeners. The average formant values of nine monophthongs produced by ten American English male speakers were employed to synthesize the vowels. Then, subjects were instructed explicitly to respond to AX discrimination tasks in which the standard vowel was followed by another one with the increment or decrement of the original formant values. The highest and lowest formant values of the same vowel quality were collected and compared to examine patterns of vowel discrimination. Results showed that the American and Korean groups discriminated the vowel pairs almost identically and their center formant frequency values of the high and low boundary fell almost exactly on those of the standards. In addition, the acceptable range of the same vowel quality was similar among the language and gender groups. The acceptable thresholds of each vowel formed oval to maintain perceptual contrast from adjacent vowels. The results suggested that nonnative speakers with high English proficiency could match native speakers' performance in discriminating vowel pairs with a shorter inter-stimulus interval. Pedagogical implications of those findings are discussed.

  • PDF

Source controlled variable bit-rate scheme을 이용한 파형 보간 부호화기의 음질 개선 기법 (Enhanced source controlled variable bit-rate scheme in a waveform interpolation coder)

  • 조근석;양희식;정상배;한민수
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2007년도 한국음성과학회 공동학술대회 발표논문집
    • /
    • pp.315-318
    • /
    • 2007
  • This paper proposes the methods to enhance the speech quality of source controlled variable bit-rate coder based on the waveform interpolation. The methods are to estimate and generate the parameters that are not transmitted from encoder to decoder by the repetition and extrapolation schemes. For the performance evaluation, the PESQ(Perceptual Evaluation of Speech Quality) scores are measured. The experimental results shows that our proposed method outperforms the conventional source controlled variable bit-rate coder. Especially, the performance of the extrapolation method is better than that of the repetition method.

  • PDF

PCS 이동전화망에서의 객관적인 음질평가척도별 성능비교 (Performance Comparison for Objective Measures of Speech Quality Evaluation in PCS Wireless Telephone Network)

  • 김낙철;김광수;정호열;정현열
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1999년도 학술발표대회 논문집 제18권 1호
    • /
    • pp.48-51
    • /
    • 1999
  • 본 연구에서는 PCS 이동전화의 객관적 통화품질평가 척도개발을 위한 기초연구로 기존의 CD(Cepstral Distance), MSD (Mel Spectral Distance), BSD(Bark Spectral Distance), PSQM (Perceptual Speech Quality Measure) 척도를 적용하여 그 성능을 비교 분석하였다. 이 척도들을 실제환경에서 수집된 PCS 음성데이터에 대해서 적용하였고 이 결과치와 청취자들의 평가 반응에 의해 얻어진 MOS 결과치와의 상관성을 조사하였다. 실험 결과, BSD와 PSQM 척도의 상관성이 0.81, 0.84로 나타나 CD, MSD보다 성능이 더 우수함을 보였다.

  • PDF

MPEG Audio을 위 한 MDCT/IMDCT의 설계에 관한 연구 (A Study on the Design of MDCT/IMDCT for MPEG Audio)

  • 김정태;방기천;이강현
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 1999년도 하계종합학술대회 논문집
    • /
    • pp.530-533
    • /
    • 1999
  • During the last decade, high quality digital audio has essentially replaced analog audio. During this period, digital audio have applied many application areas of the info-industry. These applications have created a demand for high quality digital audio. In audio compression, the methods using human auditory nervous properties are used and introduced from psychoacoustical model utilized perceptual audio coding unable to code above the limitation of human perception. The discussion concentrates on architectures and applications of those techniques which utilize psychoacoustical models to exploit efficiently masking characteristics of the human receiver. In this paper, the designed MDCT/IMBCT as a standard of current MPEG is implemented onto FPGA.

  • PDF

Spline 코드북 기반의 spectral folding을 이용한 대역폭 확장 방법 (Bandwidth Expansion Method Using Spline Codebook Based Spectral Folding)

  • 박지훈;한승호;양희식;정상배;한민수
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2006년도 추계학술대회 발표논문집
    • /
    • pp.131-134
    • /
    • 2006
  • Quality of narrowband speech $(0{\sim}4kHz)$ can be enhanced by the bandwidth expansion technique, by which the high- band components are estimated. This paper proposes the bandwidth expansion method using the spline codebook based spectral folding. For the performance evaluation, the PESQ(Perceptual Evaluation of Speech Quality) scores are measured as the objective measurement In addition, the MOS (Mean Opinion Score) and the preference tests are performed as the subjective measurement. The results show our proposed method outperforms the existing spline based one.

  • PDF

CDMA 이동전화 통화품질평가를 위한 객관적 음질평가척도별 성능 비교 (Performance Comparison of Objective Measures for Speech Quality for Evaluation in CDMA Mobile Telephone)

  • 이준희;김광수;윤정오
    • 한국산업정보학회:학술대회논문집
    • /
    • 한국산업정보학회 2001년도 춘계학술대회논문집:21세기 신지식정보의 창출
    • /
    • pp.256-260
    • /
    • 2001
  • 본 논문에서는 디지털 이동전화(CDMA) 채널환경을 통과한 왜곡된 전화음성에 대해 객관적 음질평가 척도의 개발을 위한 기초 연구로서 기존의 CD(Cepstral Distance), MSD(Mel Spectral Distance), BSD(Bark Spectral Distance), Modified BSD, PSQM(Perceptual Speech Quality Measure)를 대상으로 객관척도 알고리즘을 성능평가 하였다. 이 척도들은 실제 이동전화 환경에서 수집된 PCS 음성데이터에 대해서 적용하였으며 이 결과치를 주관적 음질평가 방법인 MU와 상관성을 비교 조사하였다. 실험 결과, BSD와 MBSD, 그리고 PSQM 척도의 상관성이 각각 0.80, 0.85, 0.84로 나타났으며 CD, MSD 보다 성능이 상대적으로 더 우수함을 보였다.

  • PDF

Watermarking for Digital Images Using Differences and Means of the Neighboring Wavelet Coefficients

  • Kim, Hyun-Soon;Bae, Sung-Ho;Yoon, Ock-Kyung;Park, Kil-Houm
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2000년도 ITC-CSCC -1
    • /
    • pp.466-469
    • /
    • 2000
  • In this paper, a watermarking technique for digital images is proposed. In our method, an image is 1-1eve1 wavelet transformed, and then the watermark of a binary stamp is embedded into the baseband. The watermark is embedded by inverting the polarities of the selected coefficient pairs. In the inverting process, we can increase perceptual image quality by finding means and differences of the selected neighboring coefficient pairs, and then adding values, which are inversely proportional to the differences, to the means. The experimental results show that the proposed method has good quality and is robust to JPEG lossy compression and various image processing operations.

  • PDF

An Adaptive Rate Control Algorithm for RCBR Transmission of Streaming Video

  • Hwangjun Song
    • 한국통신학회논문지
    • /
    • 제27권2A호
    • /
    • pp.146-156
    • /
    • 2002
  • This paper presents an adaptive H.263+ rate control algorithm for streaming video applications under the networks supporting bandwidth renegotiation, which can communicate with end-users to accommodate their time-varying bandwidth requests during the data transmission. That is, the requests of end-users can be supported adaptively according to the availability of the network resources, and thus the overall network utilization can be improved simultaneously. They are especially suitable for the transmission of non-stationary video traffics. The proposed rate control algorithm communicates with the network to renegotiate the required bandwidth fort the underlying video which are measured based on the motion change information, and choose their control strategies according to the renegotiation results. Unlike most conventional algorithms that control only the spatial quality by adjusting quantization parameters, the proposed algorithm treats both the spatial and temporal qualities at the same time to enhance human visual perceptual quality. Experimental results are provided to demonstrate that the proposed rate control algorithm can achieve superior performance to the conventional ones with low computational complexity under the networks supporting bandwidth renegotiation.

DCT영역에서의 국부 Contrast 조절 기법 (Method for Local Contrast Control in DCT Domain)

  • ;;김원하;김선국
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2013년도 추계학술대회
    • /
    • pp.8-11
    • /
    • 2013
  • We implement the foveation and frequency sensitivity feature of human visual system in discrete cosine transform (DCT) domain. Resolution of human visual perception decays as distance from the eye-focused point, known as foveation property, and the middle frequency components give most pleasant image quality to human than the low and high frequency components, which is the frequency sensitivity property of human visual system. For satisfying the foveation property, we enhanced the local contrast at the focused regions and smoothed local contrast at the non-focused regions in the DCT domain without bringing the blocking and ringing artifacts. Moreover, the energies at each DCT frequency components is modified with various degree to fulfill the frequency sensitivity property. The proposed method is verified by the subjective and objective evaluations that it can the improve the human perceptual visual quality.

  • PDF

Weighted DCT-IF for Image up Scaling

  • Lee, Jae-Yung;Yoon, Sung-Jun;Kim, Jae-Gon;Han, Jong-Ki
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권2호
    • /
    • pp.790-809
    • /
    • 2019
  • The design of an efficient scaler to enhance the edge data is one of the most important issues in video signal applications, because the perceptual quality of the processed image is sensitively affected by the degradation of edge data. Various conventional scaling schemes have been proposed to enhance the edge data. In this paper, we propose an efficient scaling algorithm for this purpose. The proposed method is based on the discrete cosine transform-based interpolation filter (DCT-IF) because it outperforms other scaling algorithms in various configurations. The proposed DCT-IF incorporates weighting parameters that are optimized for training data. Simulation results show that the quality of the resized image produced by the proposed DCT-IF is much higher than that of those produced by the conventional schemes, although the proposed DCT-IF is more complex than other conventional scaling algorithms.