• Title/Summary/Keyword: speech quality evaluation

Search Result 178, Processing Time 0.026 seconds

Comparison of Vowel and Text-Based Cepstral Analysis in Dysphonia Evaluation (발성장애 평가 시 /a/ 모음연장발성 및 문장검사의 켑스트럼 분석 비교)

  • Kim, Tae Hwan;Choi, Jeong Im;Lee, Sang Hyuk;Jin, Sung Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.26 no.2
    • /
    • pp.117-121
    • /
    • 2015
  • Background : Cepstral analysis which is obtained from Fourier transformation of spectrum has been known to be effective indicator to analyze the voice disorder. To evaluate the voice disorder, phonation of sustained vowel /a/ sound or continuous speech have been used but the former was limited to capture hoarseness properly. This study is aimed to compare the effectiveness in analysis of cepstrum between the sustained vowel /a/ sound and continuous speech. Methods : From March 2012 to December 2014, total 72 patients was enrolled in this study, including 24 unilateral vocal cord palsy, vocal nodule and vocal polyp patients, respectively. The entire patient evaluated their voice quality by VHI (Voice Handicap Index) before and after treatment. Phonation of sustained vowel /a/ sample and continuous speech using the first sentence of autumn paragraph was subjected by cepstral analysis and compare the pre-treatment group and post-treatment group. Results : The measured values of pre and post treatment in CPP-a (cepstral peak prominence in /a/ vowel sound) was 13.80, 13.91 in vocal cord palsy, 16.62, 17.99 in vocal cord nodule, 14.19, 18.50 in vocal cord polyp respectively. Values of CPP-s (cepstral peak prominence in text-based speech) in pre and post treatment was 11.11, 12.09 in vocal cord palsy, 12.11, 14.09 in vocal cord nodule, 12.63, 14.17 in vocal cord polyp. All 72 patients showed subjective improvement in VHI after treatment. CPP-a showed statistical improvement only in vocal polyp group, but CPP-s showed statistical improvement in all three groups (p<0.05). Conclusion : In analysis of cepstrum, text-based analysis is more representative in voice disorder than vowel sound speech. So when the acoustic analysis of voice by cepstrum, both phonation of sustained vowel /a/ sound and text based speech should be performed to obtain more accurate result.

  • PDF

Evaluation of the readability of self-reported voice disorder questionnaires (자기보고식 음성장애 설문지 문항의 가독성 평가)

  • HyeRim Kwak;Seok-Chae Rhee;Seung Jin Lee;HyangHee Kim
    • Phonetics and Speech Sciences
    • /
    • v.16 no.1
    • /
    • pp.41-48
    • /
    • 2024
  • The significance of self-reported voice assessments concerning patients' chief complaints and quality of life has increased. Therefore, readability assessments of questionnaire items are essential. In this study, readability analyses were performed based on text grade and complexity, vocabulary frequency and grade, and lexical diversity of the 11 Korean versions of self-reported voice disorder questionnaires (KVHI, KAVI, KVQOL, K-SVHI, K-VAPP, K-VPPC, TVSQ, K-VDCQ, K-VFI, K-VTDS, and K-VoiSS). Additionally, a comparative readability assessment was conducted on the original versions of these questionnaires to discern the differences between their Korean counterparts and the questionnaires for children. Consequently, it was determined that voice disorder questionnaires could be used without difficulty for populations with lower literacy levels. Evaluators should consider subjects' reading levels when conducting assessments, and future developments and revisions should consider their reading difficulties.

Communication Aid System For Dementia Patients (치매환자를 위한 대화 보조 시스템)

  • Sung-Ill Kim;Byoung-Chul Kim
    • Journal of Biomedical Engineering Research
    • /
    • v.23 no.6
    • /
    • pp.459-465
    • /
    • 2002
  • The goat of the present research is to improve the quality of life of both the elderly patients with dementia and their caregivers. For this Purpose, we developed a communication aid system that is consisted of three modules such as speech recognition engine, graphical agent. and database classified by a nursing schedule. The system was evaluated in an actual environment of nursing facility by introducing the system to an older mail patient with dementia. The comparison study was then carried out with and without system, respectively. The occupational therapists then evaluated subject"s reaction to the system by photographing his behaviors. The evaluation results revealed that the proposed system was more responsive in catering to needs of subject than professional caregivers. Moreover we could see that the frequency of causing the utterances of subject increased by introducing the system.

An efficient transcoding algorithm for AMR and G.723.1 speech coders and performance evaluation (AMR과 G.723.1 음성부호화기를 위한 효율적인 상호부호화 알고리듬 및 성능평가)

  • 최진규;윤성완;강홍구;윤대희
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.4
    • /
    • pp.121-130
    • /
    • 2004
  • In the application requiring the interoperability of different networks such as VoIP and wireless communication system, two speech codecs must work together with the structure of cascaded connection, tandem. Tandem has several problems such as long delay, high complexity and quality degradation due to twice complete encoding/decoding process. Transcoding is one of the best solutions to solve these problems. Transcoding algorithm is varied with the structure of source and target coder. In this paper, transcoding algorithm including the LSP conversion, the pitch estimation and new perceptual weighting filter for reducing complexity and improving qualify is proposed. These algorithms are applied to the pair of AMR md G.723.1. By employing the proposed algorithms in the transcoder, the complexity is reduced by about 20%-58% and quality is improved compared to tandem.

Development of Korean-to-English and English-to-Korean Mobile Translator for Smartphone (스마트폰용 영한, 한영 모바일 번역기 개발)

  • Yuh, Sang-Hwa;Chae, Heung-Seok
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.3
    • /
    • pp.229-236
    • /
    • 2011
  • In this paper we present light weighted English-to-Korean and Korean-to-English mobile translators on smart phones. For natural translation and higher translation quality, translation engines are hybridized with Translation Memory (TM) and Rule-based translation engine. In order to maximize the usability of the system, we combined an Optical Character Recognition (OCR) engine and Text-to-Speech (TTS) engine as a Front-End and Back-end of the mobile translators. With the BLEU and NIST evaluation metrics, the experimental results show our E-K and K-E mobile translation equality reach 72.4% and 77.7% of Google translators, respectively. This shows the quality of our mobile translators almost reaches the that of server-based machine translation to show its commercial usefulness.

Small-Aperture Adaptive Microphone Array System for High Quality Speech Acquisition (고품질 음성 취득을 위한 Small-Aper ture 적응 마이크로폰 어레이 시스템)

  • Lee, Junho;Park, Young-Cheol;Youn, Dae-Hee
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.1 no.1
    • /
    • pp.21-27
    • /
    • 2008
  • In this paper, a PC-based real-time microphone array system with small aperture is presented. The microphone array system is based on the generalized sidelobe canceler (GSC) but it employs a new adaptation mode controller (AMC). The performance of the proposed system was evaluated in the Multimedia Room modeled on an office situation. Evaluation experiments show that the proposed system can perform with stable noise suppression.

  • PDF

A Study of Subjective Quality-evaluation for Speech using VoIP Network (VoIP망을 이용한 음질의 주관적 품질평가에 관한 연구)

  • 강영도;강진석;최연성;김장형
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2001.05a
    • /
    • pp.285-290
    • /
    • 2001
  • 본 논문에서는 멀티미디어 서비스 요소 중의 하나인 VoIP(Voice Over Internet Protocol)망에서의 음성 품질에 대한 평가를 위해 VoIP망에서 송화자 내용- 발생과정에 있어서 어느 정도 완전히 표현되었는가를 나타내는 송화품질과 음성의 전송계를 통해 수화자에게 전달되는 과정에서 왜곡이나 잡음 등의 방해요인에 의해 열화되는 정도를 나타내는 전송품질, 그리고 수화자가 청각에서 신호처리 과정을 거친 송화자의 내용을 어느 징도 이해할 수 있는지를 나타내는 수화품질에 대한 주관적 방법을 평가한 후 통화품질을 측정한 내용을 분석하여 그 원인과 개선책에 대한 방법을 제시하고자 한다.

  • PDF

Comparison of Patient's Subjective Rating Scales for Voice Evaluation in Professional Voice Users with Vocal Fold Lesions (전문직 음성사용자의 주관적 음성평가도구간의 비교)

  • Kim, Jae-Ock;Choi, Sung-Hee;Lim, Sung-Eun;Choi, Jae-Nam;Choi, Hong-Shik
    • Proceedings of the KSPS conference
    • /
    • 2007.05a
    • /
    • pp.292-294
    • /
    • 2007
  • This study was designed to compare the translated patient's subjective rating scales for voice evaluation (Voice Handicap Index; VHI, Voice-Related Quality of Life; V-RQOL, Voice Rating Score; VRS) into Korean, taken from 24 professional voice users diagnosed with organic voice disorders. First, the correlation amongh those scales were observed. Second, the correlation between the patient's subjective rating scales and acoustic measures (Jitter%, Shimmer%, NHR) were examined. Third, those scales were compared by clinician's objective scale (G in GRBAS scale). Results indicated that significant correlations among the patients' subjective rating scales and significant correlations of clinician's rating scale with jitter% and Shimmer%, but not with NHR were observed. In addition, there were significant correlations of G with VHI and VHI-P (one of subscale of VHI). However, none of acoustic measures were correlated with the patient's subjective rating scales.

  • PDF

A Case Study on Vocal Aerobic Treatment Voice Therapy Development and Application for Classical Singers (성악가를 위한 VAT 음성치료 개발 및 적용 사례연구)

  • Yoo, Jae-Yeon;Lee, Ha-Na
    • 재활복지
    • /
    • v.22 no.1
    • /
    • pp.157-168
    • /
    • 2018
  • The purpose of this study is to investigate the impact of semi-closed vocal training-based Vocal Aerobic Treatment on the voice improvement of soprano. Study subject was one soprano who appealed to the suffering of her voice problem due to vocal cord nodule. A study method of conducting pre/post acoustic evaluation and subjective voice evaluation to compare the measures was used; Vocal Aerobic Treatment was carried out twice a week for a total of 32 session. In the acoustic evaluation, MDVP (multi-dimensional voice program) and VRP (voice range profile) were used to evaluate the pitch, voice quality, and voice range; in the subjective voice evaluation, SVHI (singing voice handicap index) was used to assess voice satisfaction. As a result of the pitch evaluation, the soprano maintained a proper Fo. As a result of the voice quality evaluation, the jitter, shimmer, and the noise harmonic ratio numbers decreased compared to the numbers shown before the treatment. As a result of the voice range evaluation, the scope of the range was broadened, with the number of semitone increasing from 30 to 35. As for the subjective voice evaluation, the result of the total score obtained after the survey report divided by the number of questions showed a decrease from 3.6 to 0.6. The soprano herself reported of having a minor extent of a voice problem. The summary of the above results reflects that Vocal Aerobic Treatment is useful in the voice improvement of vocalists However, as this study is case research regarding the Vocal Aerobic Treatment effect on one soprano, further research on the treatment effect covering many other vocalists is necessary. Also, there is a need for follow-up studies regarding voice management and voice treatment program on not only the vocalists but also the voice users in many other professions.

Preferred masking levels of water sounds according to various noise background levels in small scale open plan offices (소규모 개방형 사무실 배경 소음 레벨에 따른 최적 물소리 마스킹 레벨)

  • Tae-Hui Kim;Sang-Hyeon Lee;Chae-Hyun Yoon;Hyo-Won Sim;Joo-Young Hong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.42 no.6
    • /
    • pp.617-626
    • /
    • 2023
  • This study aims to investigate the preferred sound level of water sound for various levels of open-plan-office noise regarding soundscape quality and speech privacy. And assessment of the work efficiency of the water sound. For the laboratory experiment, office noise was recorded using a binaural microphone in a real open-plan office. For the assessment of the soundscape quality and speech privacy, Overall Soundscape Quality (OSQ) and Listening Difficulty (LD) were evaluated under three different sound levels (55 dBA, 60 dBA, and 65 dBA) and five different signal-to-noise ratios (SNR -10 dB, -5 dB, 0 dB, +5 dB, and +10 dB). After the evaluation, the preferred SNR was proposed according to OSQ and LD. For the assessment of to work efficiency of water sound, this study evaluated the cognitive performance of both of the condition noise only and combine the water sound with office noise. The results showed that LD increased as the water sound level increased, but OSQ decreased. When the water sound level was more than the office noise level, the OSQ decreased from noise only. Therefore, considering OSQ and LD, the preferred SNR of water sound was -5 dB for all noise levels. At the preferred level of water sound, the cognitive performance results were shown to decrease at 55 dBA compared to noise only, but at 60 dBA and 65 dBA combine the water sound results were increased than the noise only.