통합 검색 | Korea Science

HMM 기반의 한국어 합성음에 대한 PESQ 및 MOS 평가의 상관도 분석 (Correlation Analysis of PESQ and MOS Evaluation for HMM-based Synthetic Korean Speech)

임창송;배건성
- 말소리와 음성과학
- /
- 제2권1호
- /
- pp.71-75
- /
- 2010
The PESQ is an objective speech quality evaluation measure that is known to have a high correlation with a subjective speech quality measure such as MOS. To examine whether it could be useful as an objective quality measure of synthetic speech, we carried out both subjective evaluation tests with MOS and DMOS and an objective evaluation test with PESQ for HMM-based Korean synthetic speech signals and analyzed the correlation between them. Experimental results have shown that the PESQ has correlations of 0.87 with MOS and 0.92 with DMOS. It means that the PESQ holds much promise for evaluating the quality of synthetic Korean speech.
PDF

규칙합성음의 객관적 품질평가에 관한 연구 (A Study on Objective Quality Assessment for Synthesized speech by Rule)

홍진우;김순협
- 전자공학회논문지B
- /
- 제30B권10호
- /
- pp.42-49
- /
- 1993
In this paper, we evaluate the quality of synthesized speech by rule using the LPC CD as a objective measure, and then compare the test result with the subjective one. Speech used for the test consists of 108 words which are selected by word construction method using Korean attribute and frequency distribution, synthesized by demi-syllable rule. By evaluating the quality of synthesized speech by reule objectively, we have tried to resolve the problems such as lots of evaluation time, expansion of test scale, and variables of analysis result arised by subjective measure. We have, also, proved the validity of the objective test using the LPC CD, by comparing intelligibility which is the index for the subjective quality evaluation of synthesized speech by rule with MOS. From this results, we can provide a guide for quality assessment that would be useful in the R&D of synthesis method and the commercial products using synthesized speech.
PDF

규칙 합성음의 객관적 품질평가에 관한 연구 (A Study on Objective Quality Assessment of Synthesized Speech by Rule)

홍진우
- 한국음향학회:학술대회논문집
- /
- 한국음향학회 1991년도 학술발표회 논문집
- /
- pp.67-72
- /
- 1991
This paper evaluates thequality of synthesized speech by rule using the LPC CD in the objective measure and then compares the result with the subjective analysis. By evaluating the quality of synthesized speech by rule objectively. We have tried to resolve the problems (Evaluation time or size expansion, variables within the analysis results) that arise when the evaluation is done subjectively. Also by comparing intelligibility-the index for the subjective quality evaluation of synthesized speech by rule-with evaluation results obtained using MOS and the objective evaluation. We have proved the validity of the objective analysis and thus provides a guide that would be useful when R&D and marketing of synthesis by rule method is done.
PDF

심리음향 특성을 이용한 음성 향상 알고리즘 (A Speech Enhancement Algorithm based on Human Psychoacoustic Property)

전유용;이상민
- 전기학회논문지
- /
- 제59권6호
- /
- pp.1120-1125
- /
- 2010
In the speech system, for example hearing aid as well as speech communication, speech quality is degraded by environmental noise. In this study, to enhance the speech quality which is degraded by environmental speech, we proposed an algorithm to reduce the noise and reinforce the speech. The minima controlled recursive averaging (MCRA) algorithm is used to estimate the noise spectrum and spectral weighting factor is used to reduce the noise. And partial masking effect which is one of the human hearing properties is introduced to reinforce the speech. Then we compared the waveform, spectrogram, Perceptual Evaluation of Speech Quality (PESQ) and segmental Signal to Noise Ratio (segSNR) between original speech, noisy speech, noise reduced speech and enhanced speech by proposed method. As a result, enhanced speech by proposed method is reinforced in high frequency which is degraded by noise, and PESQ, segSNR is enhanced. It means that the speech quality is enhanced.
https://doi.org/10.5370/KIEE.2010.59.6.1120 인용 PDF KSCI

Performance Evaluation of Novel AMDF-Based Pitch Detection Scheme

Kumar, Sandeep
- ETRI Journal
- /
- 제38권3호
- /
- pp.425-434
- /
- 2016
A novel average magnitude difference function (AMDF)-based pitch detection scheme (PDS) is proposed to achieve better performance in speech quality. A performance evaluation of the proposed PDS is carried out through both a simulation and a real-time implementation of a speech analysis-synthesis system. The parameters used to compare the performance of the proposed PDS with that of PDSs that are based on either a cepstrum, an autocorrelation function (ACF), an AMDF, or circular AMDF (CAMDF) methods are as follows: percentage gross pitch error (%GPE); a subjective listening test; an objective speech quality assessment; a speech intelligibility test; a synthesized speech waveform; computation time; and memory consumption. The proposed PDS results in lower %GPE and better synthesized speech quality and intelligibility for different speech signals as compared to the cepstrum-, ACF-, AMDF-, and CAMDF-based PDSs. The computational time of the proposed PDS is also less than that for the cepstrum-, ACF-, and CAMDF-based PDSs. Moreover, the total memory consumed by the proposed PDS is less than that for the ACF- and cepstrum-based PDSs.
https://doi.org/10.4218/etrij.16.0115.0926 인용 PDF KSCI

통화품질 객관평가 모델링에 관한 연구 (A Study on the Objective Evaluation Model of Telephone Transmission Quality)

조재철;박순영;방만원
- 한국통신학회논문지
- /
- 제16권6호
- /
- pp.509-516
- /
- 1991
In this paper, we propose on objective evaluation model of telephone transmission qulity in order to estimate a satisfaction score regarding speech quality in a relephone network. As the degradantion factors of telephone transmission quality, this model takes into account transmission loss, noise, distortion, talker echo and sidetone. A performance index[PI] is introduced for five psychological factors affecting telephone speech qualty, and a Mean Opinion Score(MOS) is estimated from the sum of all Pis. The simulation results indicate theat the MOS obtained from the objective evaluation model is in good agreement with that of subjective evaluation.
PDF

명료발화와 보통발화에서 파킨슨병환자 음성의 켑스트럼 및 스펙트럼 분석 (Characteristics of voice quality on clear versus casual speech in individuals with Parkinson's disease)

신희백;심희정;정훈;고도흥
- 말소리와 음성과학
- /
- 제10권2호
- /
- pp.77-84
- /
- 2018
The purpose of this study is to examine the acoustic characteristics of Parkinsonian speech, with respect to different utterance conditions, by employing acoustic/auditory-perceptual analysis. The subjects of the study were 15 patients (M=7, F=8) with Parkinson's disease who were asked to read out sentences under different utterance conditions (clear/casual). The sentences read out by each subject were recorded, and the recorded speech was subjected to cepstrum and spectrum analysis using Analysis of Dysphonia in Speech and Voice (ADSV). Additionally, auditory-perceptual evaluation of the recorded speech was conducted with respect to breathiness and loudness. Results indicate that in the case of clear speech, there was a statistically significant increase in the cepstral peak prominence (CPP), and a decrease in the L/H ratio SD (ratio of low to high frequency spectral energy SD) and CPP F0 SD values. In the auditory-perceptual evaluation, a decrease in breathiness and an increase in loudness were noted. Furthermore, CPP was found to be highly correlated to breathiness and loudness. This provides objective evidence of the immediate usefulness of clear speech intervention in improving the voice quality of Parkinsonian speech.
https://doi.org/10.13064/KSSS.2018.10.2.077 인용 PDF KSCI

VoIP 코더들의 프레임손실은닉 알고리즘 성능평가 (Performance Evaluation of Frame Erasure Concealment Algorithms in VoIP Coders)

한승호;문광;한민수
- 대한음성학회:학술대회논문집
- /
- 대한음성학회 2004년도 춘계 학술대회 발표논문집
- /
- pp.235-238
- /
- 2004
Frame erasures cause speech quality degradation in wireless communication networks or packet networks. The degradation becomes worse when consecutive frame erasures occur. Speech coders have a frame erasure concealment(FEC) mechanism to compensate for frame erasures. It is meaningful to evaluate the performance of FEC mechanisms for frame erasures that occur in communications networks. In this paper, various frame erasures are designed. And the FEC algorithms of speech coders are evaluated and analyzed with the Perceptual Evaluation of Speech Quality(PESQ). It is found that the performances vary in accordance with frame erasure types, frame erasure rates, and utterance lengths.
PDF

Spline 코드북 기반의 spectral folding을 이용한 대역폭 확장 방법 (Bandwidth Expansion Method Using Spline Codebook Based Spectral Folding)

박지훈;한승호;양희식;정상배;한민수
- 대한음성학회:학술대회논문집
- /
- 대한음성학회 2006년도 추계학술대회 발표논문집
- /
- pp.131-134
- /
- 2006
Quality of narrowband speech $(0{\sim}4kHz)$ can be enhanced by the bandwidth expansion technique, by which the high- band components are estimated. This paper proposes the bandwidth expansion method using the spline codebook based spectral folding. For the performance evaluation, the PESQ(Perceptual Evaluation of Speech Quality) scores are measured as the objective measurement In addition, the MOS (Mean Opinion Score) and the preference tests are performed as the subjective measurement. The results show our proposed method outperforms the existing spline based one.
PDF

음성장애 환자에서 시행되는 청지각적 평가에 대한 논의 (Discussions on Auditory-Perceptual Evaluation Performed in Patients With Voice Disorders)

이승진
- 대한후두음성언어의학회지
- /
- 제32권3호
- /
- pp.109-117
- /
- 2021
The auditory-perceptual evaluation of speech-language pathologists (SLP) in patients with voice disorders is often regarded as a touchstone in the multi-dimensional voice evaluation procedures and provides important information not available in other assessment modalities. Therefore, it is necessary for the SLPs to conduct a comprehensive and in-depth evaluation of not only voice but also the overall speech production mechanism, and they often encounter various difficulties in the evaluation process. In addition, SLPs should strive to avoid bias during the evaluation process and to maintain a wide and constant spectrum of severity for each parameter of voice quality. Lastly, it is very important for the SLPs to perform a team approach by documenting and delivering important information pertaining to auditory-perceptual characteristics in an appropriate and efficient way through close communication with the laryngologists.
https://doi.org/10.22469/jkslp.2021.32.3.109 인용 PDF KSCI

검색결과 178건 처리시간 0.019초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)