• 제목/요약/키워드: Voice Diagnosis

검색결과 151건 처리시간 0.028초

각종 음성분석기에 따른 음성장애 환자의 주기간 주파수 및 진폭변동률 분석 (Jitter and Shimmer Measurements of Dysphonia among the Different Voice Analysis Programs)

  • 최성희;남도현;이승훈;정원혁;김덕원;최홍식
    • 대한후두음성언어의학회지
    • /
    • 제16권2호
    • /
    • pp.140-145
    • /
    • 2005
  • Background and Objectives : Voice perturbation measures, such as jitter and shimmer has been importantly used for diagnosis and treatment efficacy of laryngeal dysfunction. This study was conducted to investigate validity of newly developed multi-channel voice analyzer program by comparing with MDVP, PRAAT, TF32. In addition, we compared the voice perturbation measures with different voice analyzer program by type of signals. Materials and Methods : Nineteen mild-severe dysphonic patients participated in our study. Fundamental frequency, jitter and shimmer values were obtained from different voice analyzer program using the same sustained/ah/phonation. Results : Fundamental frequency and shimmer were highly correlated whereas jitter was weakly correlated between newly developed multi-channel voice analyzer program and the others though different pitch computation algorithm except MDVP, In addition, Type 2 and 3 signals were weakly correlated than Type 1. Conclusion : In the clinical setting, clinician may have sufficient information of voice analyzer and control conditions properly for severity of pathologic voice before voice perturbation measure to obtain reliable results.

  • PDF

성별에 따른 한국 정상 성인 음성의 음향학적 평가 기준치 (Acoustic Characteristics of the Voices of Korean Normal Adults by Gender on MDVP)

  • 김재옥
    • 말소리와 음성과학
    • /
    • 제1권4호
    • /
    • pp.147-157
    • /
    • 2009
  • The purpose of the study is to develop the normal voice database and to analyze the acoustic characteristics of Korean adults' voices by gender using MDVP. Eight categories in the 34 parameters of MDVP were analyzed in the voices of 170 Korean normal adults taken from /a/ vowel. Among them, Fundamental Frequency Parameters and Frequency Perturbation Parameters were significantly different by gender. In addition, Fundamental Frequency Parameters of our data were remarkably different from the data suggested in the MDVP program which currently used in clinics. Therefore, the data obtained from the current study can be effectively used for the diagnosis of voice disorders of Korean adults as the standard parameter values of MDVP.

  • PDF

음원 파라미터 모델과 인공신경망을 이용한 음성장애 검출 (Screening of Voice Disorder using Source Parameter Model and Artificial Neural Network)

  • 파벨시틸;조철우;미샤파벨
    • 음성과학
    • /
    • 제15권2호
    • /
    • pp.89-97
    • /
    • 2008
  • There is a number of clinical conditions that affect directly or indirectly the physical properties of the vocal folds and thereby the pressure waveforms of elicited sounds. If the relationships between the clinical conditions and the voice quality are sufficiently reliable, it should be possible to detect these diseases or disorders. The focus of this paper is to determine the set of features and their values that would characterize the speaker's state of vocal folds. To the extent that these features can capture the anatomical, physiological, and neurological aspects of the speaker they can be potentially used to mediate an unobtrusive approach to diagnosis. We will show a new approach to this problem supported with results obtained from two disordered voice corpora.

  • PDF

보툴리눔독소 주입에 의한 음성장애 및 언어장애의 치료 (Botulinum Toxin Injection for the Treatment of Voice and Speech Disorders)

  • 최홍식
    • 음성과학
    • /
    • 제3권
    • /
    • pp.5-17
    • /
    • 1998
  • Botulinum toxin, a neurotoxin derived from Clostridia Botulinum, has been injected into the target muscle(s) for the treatment of several kinds of voice and speech disorders at the Voice Clinic, Yonsei Institute of Logopedics and Phoniatrics since December 1995. Criteria for the diagnosis and method of injection for spasmodic dysphonia, mutational dysphonia, muscle tension dysphonia, dysphonia after total laryngectomy, and stuttering were summarized. Among 144 patients with adductor type spasmodic dysphonia, who were injected one time to maximum 8 times during the 27 months, 90% were recognized as having better than slight improvement. Even though the injected cases were small, not only the abductor type spasmodic dysphonia, but also the intractable mutational dysphonia or muscle tension dysphonia resistant to voice therapy revealed that botulinum toxin injection would be another options for treatment. Patients who cannot phonate after total laryngectomy and some forms of adulthood stutterers can also be candidates for the injection of botulinum toxin.

  • PDF

혈압 상승이 성대 진동 및 음성 에너지 크기에 미치는 영향 분석 (Analysis for the Effect of Blood Pressure Increase on Vocal Cord Vibration and Voice Intensity)

  • 김봉현
    • 한국정보통신학회논문지
    • /
    • 제17권2호
    • /
    • pp.431-437
    • /
    • 2013
  • 건강한 삶의 질이 향상되고 있으나 만성 질환으로 인한 고통은 날로 증가하고 있다. 만성 질환의 주요 요인은 스트레스, 혈압, 비만 등이 있으며 고혈압으로 인한 만성 질환 발병율은 매우 높은 편이다. 따라서 본 논문에서는 혈압 상승에 따른 음성을 분석하여 혈압 상승이 지속적으로 발생되는 현상을 조기에 진단하여 예방하기 위한 방법을 제안하고자 한다. 이를 위해 유산소 운동으로 혈압을 상승시킨 후 음성을 수집하고 음성 분석 기술 중 성대 진동을 측정하는 Pitch와 음성 에너지의 크기를 측정하는 Intensity를 적용하여 혈압 상승에 의해 음성에 미치는 영향을 분석, 연구하였다.

후두음성 질환에 대한 인공지능 연구 (Artificial Intelligence for Clinical Research in Voice Disease)

  • 석준걸;권택균
    • 대한후두음성언어의학회지
    • /
    • 제33권3호
    • /
    • pp.142-155
    • /
    • 2022
  • Diagnosis using voice is non-invasive and can be implemented through various voice recording devices; therefore, it can be used as a screening or diagnostic assistant tool for laryngeal voice disease to help clinicians. The development of artificial intelligence algorithms, such as machine learning, led by the latest deep learning technology, began with a binary classification that distinguishes normal and pathological voices; consequently, it has contributed in improving the accuracy of multi-classification to classify various types of pathological voices. However, no conclusions that can be applied in the clinical field have yet been achieved. Most studies on pathological speech classification using speech have used the continuous short vowel /ah/, which is relatively easier than using continuous or running speech. However, continuous speech has the potential to derive more accurate results as additional information can be obtained from the change in the voice signal over time. In this review, explanations of terms related to artificial intelligence research, and the latest trends in machine learning and deep learning algorithms are reviewed; furthermore, the latest research results and limitations are introduced to provide future directions for researchers.

발성장애: 후두내시경 검사에서 놓치기 쉬운 성대점막질환 (Dysphonia : Vocal Fold Mucosal Lesions Easily Missed in Laryngoscopy)

  • 김한수
    • 대한후두음성언어의학회지
    • /
    • 제21권1호
    • /
    • pp.17-21
    • /
    • 2010
  • Dysphonia is a medical terminology for voice disorders characterized by hoarseness, harshness, weakness, or even loss of voice ; any impairment in ability to produce voice sounds using the vocal organs, larynx, The causes of dysphonia can be classified into two groups, organic and functional. Functional dysphonia includes spasmodic dysphonia, muscle tension dysphonia, mutational dysphonia and conversion dysphonia, etc, The findings of laryngoscopy in these dysphonia are almost normal. Therefore, physicians should diagnosis these diseases from careful history taking and abundant understandings about the phonation pattern, Organic dysphonia is caused by anatomical problems in the larynx, especially on the vocal fold, Some lesions, however, are not easily found because these lesions are too small, or located on the lower lip of vibrating vocal fold. Laryngopharyngeal reflux induced laryngitis, vascular lesions, sulcus vocalis, vocal atropy including presbylaryngis, and mucosal tears are common lesions easily missed in laryngoscopy, Therefore, a high index of suspicion is necessary to avoid missing vocal fold mucosal lesions, and the strobovideolaryngoscopy is indispensable in making the diagnosis,

  • PDF

Wav2vec을 이용한 오디오 음성 기반의 파킨슨병 진단 (Diagnosis of Parkinson's disease based on audio voice using wav2vec)

  • 윤희진
    • 디지털융복합연구
    • /
    • 제19권12호
    • /
    • pp.353-358
    • /
    • 2021
  • 노년기에 접어들면서 알츠하이머 다음으로 흔한 퇴행성 뇌 질환은 파킨슨병이다. 파킨슨병의 증상은 손 떨림, 행동의 느려짐, 인지기능의 저하 등 일상생활의 삶의 질을 저하시키는 요인이 된다. 파킨슨병은 조기진단을 통하여 병의 진행 속도를 늦출 수 있는 질환이다. 파킨슨병의 조기진단을 위해 오디오 음성 파일 입력으로 wav2vec을 이용하여 특징을 추출하고 딥러닝(ANN)으로 파킨슨병의 유무를 진단하는 알고리즘을 구현하였다. 오디오 음성 파일을 이용하여 파킨슨병을 진단하는 실험 결과 정확도는 97.47%로 나타났다. 기존의 뉴럴네트워크를 이용하여 파킨슨병을 진단하는 결과보다 좋은 결과를 나타냈다. 오디오 음성 파일을 wav2vec 이용으로 간단하게 실험을 과정을 줄일 수 있었으며, 실험 결과 향상된 결과를 얻을 수 있었다.

후두외근 과긴장에 대한 음성피로도 검사의 유용성 (Usefulness of Vocal Fatigue Index for Hypertension of Extrinsic Laryngeal Muscles)

  • 김지성;이동욱
    • 대한후두음성언어의학회지
    • /
    • 제32권3호
    • /
    • pp.124-129
    • /
    • 2021
  • Background and Objectives This study compares Vocal Fatigue Index (VFI) scores according to the presence or absence of external laryngeal tension in hyperfunctional voice disorder. And through this, it is to confirm the usefulness of VFI to hypertension of extrinsic laryngeal muscles. Materials and Method The subjects were 61 female diagnosed with hyperfunctional voice disorder (hypertension group 41, non-hypertension group 20). The author palpated extrinsic laryngeal muscles for evaluation of hypertension and classified them as the presence or absence. The voice measurements were jitter, shimmer, Korean-Voice Handicap Index-10 (K-VHI-10), and Korean-Vocal Fatigue Index (K-VFI). The voice compared were according to the diagnosis and presence of hypertension only for patients with hyperfunctional voice disorder. Results As a result of comparing the voice measurement according to the presence or absence of hypertension, there was no significant difference in the acoustic variables, K-VHI-10 and K-VFI-Total, K-VFI-Fatigue. Whereas, K-VFI-Physical (p=0.006) and K-VFI-Rest (p=0.022) were significantly higher in the hypertension group. Conclusion These results indicate that the hypertension group has more physical discomfort and less voice recovery than the group without hypertension. It means that K-VFI can measure the physical discomfort and limitations of voice recovery due to hypertension of the external laryngeal muscle. The VFI can be used as one of the methods to evaluate the hypertension of the external laryngeal muscle in Hyperfunctional voice disorder.

SVM을 이용한 음성 사상체질 분류 알고리즘 (Voice Classification Algorithm for Sasang Constitution Using Support Vector Machine)

  • 강재환;도준형;김종열
    • 사상체질의학회지
    • /
    • 제22권1호
    • /
    • pp.17-25
    • /
    • 2010
  • 1. Objectives: Voice diagnosis has been used to classify individuals into the Sasang constitution in SCM(Sasang Constitution Medicine) and to recognize his/her health condition in TKM(Traditional Korean Medicine). In this paper, we purposed a new speech classification algorithm for Sasang constitution. 2. Methods: This algorithm is based on the SVM(Support Vector Machine) technique, which is a classification method to classify two distinct groups by finding voluntary nonlinear boundary in vector space. It showed high performance in classification with a few numbers of trained data set. We designed for this algorithm using 3 SVM classifiers to classify into 4 groups, which are composed of 3 constitutional groups and additional indecision group. 3. Results: For the optimal performance, we found that 32.2% of the voice data were classified into three constitutional groups and 79.8% out of them were grouped correctly. 4. Conclusions: This new classification method including indecision group appears efficient compared to the standard classification algorithm which classifies only into 3 constitutional groups. We find that more thorough investigation on the voice features is required to improve the classification efficiency into Sasang constitution.