• Title/Summary/Keyword: speech quality evaluation

Search Result 178, Processing Time 0.025 seconds

Auditory-Perceptual and Acoustic Evaluation in Measuring Dysphonia Severity of Vocal Cord Paralysis (성대마비의 음성장애 측정을 위한 청지각적 및 음향학적 평가)

  • Kim, Geun-Hyo;Lee, Yeon-Woo;Park, Hee-June;Bae, In-Ho;Lee, Byung-Joo;Kwon, Soon-Bok
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.28 no.2
    • /
    • pp.106-111
    • /
    • 2017
  • Background and Objectives : The purpose of this study was to investigate the criterion-related concurrent validity of two standardized auditory-perceptual assessments and the Acoustic Voice Quality Index (AVQI) for measuring dysphonia severity in patients with vocal cord paralysis (VCP). Materials and Methods : Total 210 patients with VCP and 236 normal voice subjects were asked to sustain the vowel [a:] and to read aloud the Korean text "Walk". A 2 second mid-vowel portion of the sustained vowel and two sentences (with 26 syllables) were recorded. And then voice samples were edited, concatenated, and analyzed according to Praat script. Two standardized auditory-perceptual assessment (GRBAS and CAPE-V) were performed by three raters. Results : The VCP group showed higher AVQI, Grade (G) and Overall Severity (OS) values than normal voice group. And the correlation among AVQI, G, and OS ranged from 0.904 to 0.926. In ROC curve analysis, cutoff values of AVQI, G, and OS were <3.79, <0.00, and <30.00, respectively, and the AUC of each analysis was over .89. Conclusion : AVQI and auditory evaluation can improve the early screening ability of VCP voice and help to establish effective diagnosis and treatment plan for VCP-related dysphonia.

  • PDF

Transcoding Algorithm for SMV and G.729A Vocoders via Direct Parameter Transformation (G.729A와 SMV 음성부호화기를 위한 파라미터 직접 변환 방식의 상호부호화 알고리듬)

  • 장달원;서성호;이선일;유창동
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.40 no.6
    • /
    • pp.71-83
    • /
    • 2003
  • In this paper, a novel transcoding algorithm for the G.729A and the Selectable Mode Vocoder(SMV) vocoders via direct parameter transformation is proposed. In contrast to the conventional tandem transcoding algorithm, the proposed algorithm converts the parameters of one coder to the other without going through the decoding and encoding processes. In transcoder from SMV to G.729A, LSP conversion algorithm, pitch delay conversion algorithm and transcoding algorithm in lower rate are proposed, and in transcoder from G.729A to SMV, LSP conversion algorithm, pitch delay conversion algorithm and rate selection algorithm are proposed. Evaluation results show that while exhibiting better computational and delay characteristics, the proposed algorithm produces equivalent or Improved speech quality to that produced by the tandem transcoding algorithm.

Measurement and Evaluation of the Acoustic Performance in the Royal Palace Buildings of Joseon Dynasty - Focused on Pyeonjeon and Chimjeon - (조선 궁궐 건축물의 음향성능 측정 및 평가 - 편전 및 침전을 중심으로 -)

  • Kim, Nam-Wook;Kim, Myung-Jun;Han, Wook
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.19 no.12
    • /
    • pp.1269-1280
    • /
    • 2009
  • This study was performed to construct sound performance DB of royal palace buildings and to examine the special quality more scientifically. Research target of royal palace were Changdeokgung and Gyeongbokgung. Sound insulation performance between the adjacent room and facade, room acoustics of Pyeonjeon and Chimjeon which is representative building in royal palace were examined through field measurement. Measured values of RT($T_{mf}$) at Pyeonjeon were 0.78 sec. and 1.03 sec. in Seonjeongjoen and Sajeongjoen, respectively. The RTs of both Pyeonjeon buildings were estimated suitable for speech and lecture considering their volume. The RT($T_{mf}$)s at Chimjeon were measured in range of 0.29~0.55 sec. This meant that the acoustic energy in rooms was decreased by sound transmission through mulberry paper(Hanji) of traditional windows and doors. As a sound insulation performance, the single-number quantities($D_{ls,2m,nT,w}$) of the building facades in Pyeonjeon and Chimjeon were measured 4~20 dB. Also the single-number quantities($D_{p,w}$) between the adjacent rooms in Chimjeon were measured 3~18 dB. Sound insulation performance of traditional building elements such as window and door depended strongly on their layers and area.

Evaluation of Women with Myofascial Abdominal Syndrome Based on Traditional Chinese Medicine

  • Mitidieri, Andreia;Gurian, Maria Beatriz;Silva, Ana Paula;Tawasha, Kalil;Poli-Neto, Omero;Nogueira, Antonio;Reis, Francisco;Rosa-e-Silva, Julio
    • Journal of Pharmacopuncture
    • /
    • v.18 no.4
    • /
    • pp.26-31
    • /
    • 2015
  • Objectives: This study used semiology based on traditional Chinese medicine (TCM) to investigate vital energy (Qi) behavior in women with abdominal myofascial pain syndrome (AMPS). Methods: Fifty women diagnosed with chronic pelvic pain (CPP) secondary to AMPS were evaluated by using a questionnaire based on the theories of "yin-yang," "zang-fu", and "five elements". We assessed the following aspects of the illness: symptomatology; specific location of myofascial trigger points (MTrPs); onset, cause, duration and frequency of symptoms; and patient and family history. The patients tongues, lips, skin colors, and tones of speech were examined. Patients were questioned on various aspects related to breathing, sweating, sleep quality, emotions, and preferences related to color, food, flavors, and weather or seasons. Thirst, gastrointestinal dysfunction, excreta (feces and urine), menstrual cycle, the five senses, and characteristic pain symptoms related to headache, musculoskeletal pain, abdomen, and chest were also investigated. Results: Patients were between 22 and 56 years old, and most were married (78%), possessed a elementary school (66%), and had one or two children (76%). The mean body mass index and body fat were 26.86 kg/cm2 (range: 17.7 - 39.0) and 32.4% (range: 10.7 - 45.7), respectively. A large majority of women (96%) exhibited alterations in the kidney meridian, and 98% had an altered gallbladder meridian. We observed major changes in the kidney and the gallbladder Qi meridians in 76% and 62% of patients, respectively. Five of the twelve meridians analyzed exhibited Qi patterns similar to pelvic innervation Qi and meridians, indicating that the paths of some of these meridians were directly related to innervation of the pelvic floor and abdominal region. Conclusion: The women in this study showed changes in the behavior of the energy meridians, and the paths of some of the meridians were directly related to innervation of the pelvic floor and abdominal region.

Differentiation of Adductor-Type Spasmodic Dysphonia from Muscle Tension Dysphonia Using Spectrogram (스펙트로그램을 이용한 내전형 연축성 발성 장애와 근긴장성 발성 장애의 감별)

  • Noh, Seung Ho;Kim, So Yean;Cho, Jae Kyung;Lee, Sang Hyuk;Jin, Sung Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.28 no.2
    • /
    • pp.100-105
    • /
    • 2017
  • Background and Objectives : Adductor type spasmodic dysphonia (ADSD) is neurogenic disorder and focal laryngeal dystonia, while muscle tension dysphonia (MTD) is caused by functional voice disorder. Both ADSD and MTD may be associated with excessive supraglottic contraction and compensation, resulting in a strained voice quality with spastic voice breaks. The aim of this study was to determine the utility of spectrogram analysis in the differentiation of ADSD from MTD. Materials and Methods : From 2015 through 2017, 17 patients of ADSD and 20 of MTD, underwent acoustic recording and phonatory function studies, were enrolled. Jitter (frequency perturbation), Shimmer (amplitude perturbation) were obtained using MDVP (Multi-dimensional Voice Program) and GRBAS scale was used for perceptual evaluation. The two speech therapist evaluated a wide band (11,250 Hz) spectrogram by blind test using 4 scales (0-3 point) for four spectral findings, abrupt voice breaks, irregular wide spaced vertical striations, well defined formants and high frequency spectral noise. Results : Jitter, Shimmer and GRBAS were not found different between two groups with no significant correlation (p>0.05). Abrupt voice breaks and irregular wide spaced vertical striations of ADSD were significantly higher than those of MTD with strong correlation (p<0.01). High frequency spectral noise of MTD were higher than those of ADSD with strong correlation (p<0.01). Well defined formants were not found different between two groups. Conclusion : The wide band spectrograms provided visual perceptual information can differentiate ADSD from MTD. Spectrogram analysis is a useful diagnostic tool for differentiating ADSD from MTD where perceptual analysis and clinical evaluation alone are insufficient.

  • PDF

The Verification of Korean Version Swallowing Disturbance Questionnaire (K-SDQ) (한국판 삼킴 곤란 척도(K-SDQ)의 번안본 검증)

  • Jung, SoWoon;Kim, JungWan
    • 재활복지
    • /
    • v.22 no.4
    • /
    • pp.43-58
    • /
    • 2018
  • Swallowing disorders that can affect nutrient intakes and quality of life are commonly shown among the elderly as well as patients with neurogenic disorder. This study verifies the reliability and validity of the Swallowing Disturbance Questionnaire (SDQ), a subjective swallowing disability assessment tool, modified for Koreans' eating habit and cultural sentiment, against 105 stroke patients, in order to help identify early swallowing problems of the elderly. Reliability of internal consistency in the Korean version of SDQ is .601, test-retest reliability is .97, and concurrent validity is .956. Based on 8 points of cut-off score, 46.8% of sensitivity and 81.6% of specificity. Comparing the results of video fluoroscopic study (VFSS), an objective swallowing disorder test with those of Korean version of SDQ, negative predictive value (NPV) and positive predictive value (PPV) was shown as 81% and 53%. The Korean version of SDQ is expected to be a useful testing tool to discriminate swallowing disorders in stroke patients. It has great clinical significance in that swallowing difficulties shown by subjects can be sorted out to request a diagnostic assessment before clinical evaluation by a rehabilitation therapist or ruling out unnecessary exposure to additional tests by accurately identifying stroke patients without swallowing problems.

The Development of the Korean Evaluation Scale for Hearing Handicap (KESHH) for the Geriatric Hearing Los (노인성난청을 위한 청각장애평가지수(KESHH)의 개발)

  • Ku, Ho-Lim;Kim, Jin-Sook
    • 한국노년학
    • /
    • v.30 no.3
    • /
    • pp.973-992
    • /
    • 2010
  • The hearing impairment is the representative disorder that affects the quality of the routine life of the aged period. This study was aimed to develop the Korean evaluation scale for hearing handicap(KESHH) with which we can evaluate social and psychological effects of the hearing impairment. Applying this scale clinically, we can analyze the geriatric hearing loss specifically and improve the quality of the aural rehabilitation that can help the hardness of the hearing impairment. Data were collected from 288 participants(176 hearing aid users and 112 non-hearing aid users) and the average age of the participants was 67.4 years old ( 60.15 for the hearing aids users and 78.9 for the non hearing users). The composition ratio of the male and female participants were 58.0% and 42.0% and extrovert and introvert personality were 49.3% and 50.7% showing balanced formation. The tentative draft of KESHH measurements were produced with 30 items and following 5 subscales. Using factor analysis, 6 items were erased and 4 subscales - social effect, psycho/emotional effect, interpersonal effect, and perception of hearing aids - were identified. As each subscale consisted of 6 items, 24 items were corrected and remained totally. Conclusively, the KESHH was developed with 24 items and 4 subscales including 6 items on each subscale. In addition, the KESHH was divided into type-1 and 2 depending on hearing aid users and non hearing aid users. The results of this study can be summarized as the following 5 parts. Firstly, the reliabilities of the KESHH were proved to be high because the subscales' Cronbach alpha values were from 0.723 through 0.895. Secondly, the KESHH showed systematically increasing score as the hearing impairment increased. The lowest score was 24 and the highest score was 117 and the average scores of the hearing impaired and non-hearing impaired are 72.06(SD=15.67) and 66.98(SD=20.94) showing 5.08 increased score for the hearing impaired. Depending on the degree of the hearing loss, the scores recorded 52.63 at the below of the mild hearing loss, 67.29 for the moderate hearing loss, 71.89 for the moderately severe hearing loss, and 75.57 for the severe hearing loss The comparison of the scores by hearing levels indicated that the higher the hearing levels were, the higher the scores of the KESHH with statistical significance(p<0.001). Thirdly, the correlation among 4 subscales was 0.384~0.880(p<0.001). Also, the pure tone average, personality, and the four subscales correlations showed statistical significance with 0.148~0.880 except for the pure tone average and personality and the pure tone average and perception of hearing aids. Fourthly, the total variances explained for the independent subscles were analyzed with multiple regression. The social effect was explained 17.4% with pure tone average, personality, and the status of hearing aid use variances. The psycho/emotional effect was explained 14.4% with puretone average, personality, and age variances. The interpersonal effect was explained 11.2% with pure tone average, personality, and the status of hearing aid use variances. The perception of hearing aids effect was explained 2.2% with only personality. Finally, test-retest reliability was proved to be high with 0.791(p<0.001). Conclusively, the KESHH that was developed considering Korean culture can be a useful instrument for expressing the hearing handicaps of the Korean aged hearing impaired in scores for both hearing aid users and non-users. Also, it is thought that the KESHH is useful clinically for identifying the changes of the hearing handicap scores before and after wearing hearing aids and aural rehabilitation at diverse situations.

Real data-based active sonar signal synthesis method (실데이터 기반 능동 소나 신호 합성 방법론)

  • Yunsu Kim;Juho Kim;Jongwon Seok;Jungpyo Hong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.43 no.1
    • /
    • pp.9-18
    • /
    • 2024
  • The importance of active sonar systems is emerging due to the quietness of underwater targets and the increase in ambient noise due to the increase in maritime traffic. However, the low signal-to-noise ratio of the echo signal due to multipath propagation of the signal, various clutter, ambient noise and reverberation makes it difficult to identify underwater targets using active sonar. Attempts have been made to apply data-based methods such as machine learning or deep learning to improve the performance of underwater target recognition systems, but it is difficult to collect enough data for training due to the nature of sonar datasets. Methods based on mathematical modeling have been mainly used to compensate for insufficient active sonar data. However, methodologies based on mathematical modeling have limitations in accurately simulating complex underwater phenomena. Therefore, in this paper, we propose a sonar signal synthesis method based on a deep neural network. In order to apply the neural network model to the field of sonar signal synthesis, the proposed method appropriately corrects the attention-based encoder and decoder to the sonar signal, which is the main module of the Tacotron model mainly used in the field of speech synthesis. It is possible to synthesize a signal more similar to the actual signal by training the proposed model using the dataset collected by arranging a simulated target in an actual marine environment. In order to verify the performance of the proposed method, Perceptual evaluation of audio quality test was conducted and within score difference -2.3 was shown compared to actual signal in a total of four different environments. These results prove that the active sonar signal generated by the proposed method approximates the actual signal.