• 제목/요약/키워드: pathological speech

검색결과 44건 처리시간 0.023초

HOS 특징 벡터를 이용한 장애 음성 분류 성능의 향상 (Performance Improvement of Classification Between Pathological and Normal Voice Using HOS Parameter)

  • 이지연;정상배;최흥식;한민수
    • 대한음성학회지:말소리
    • /
    • 제66호
    • /
    • pp.61-72
    • /
    • 2008
  • This paper proposes a method to improve pathological and normal voice classification performance by combining multiple features such as auditory-based and higher-order features. Their performances are measured by Gaussian mixture models (GMMs) and linear discriminant analysis (LDA). The combination of multiple features proposed by the frame-based LDA method is shown to be an effective method for pathological and normal voice classification, with a 87.0% classification rate. This is a noticeable improvement of 17.72% compared to the MFCC-based GMM algorithm in terms of error reduction.

  • PDF

비대칭 4 질량 성대 모델에 의한 쉰목소리 분석 (Hoarse Speech Analysis Using Dissymmetric Four-Mass Model of Vocal Cords)

  • 장강의;진혜방;최태영
    • 한국음향학회지
    • /
    • 제14권5호
    • /
    • pp.94-101
    • /
    • 1995
  • 본 논문에서는 쉰 목소리 메커니즘 분석을 위한 4질량 성대 모델을 제안하였다. 쉰 목소리가 성대의 병리학적 변화에 기인한다는 것과 성문 파형이 성대의 움직임 상태를 반영한다는 사실에서, 병든 성대를 비대칭 구조이고 4질량형으로 가정하였다. 정상 목소리와 쉰 목소리에 대한 모델 변수들과 성문 파형을 분석하여 모델 변수와 병리학 사이의 관계를 검토하였다. 실험 결과 쉰 목소리의 음향 특징과 병리학간의 관계를 밝힐 수 있었고 후두 질병 진단과 쉰 목소리의 음질 향상에도 본 논문에서 제안한 방법이 사용될 수 있음을 알았다.

  • PDF

양성후두 질환 음성에 대한 여러 기존 피치검출 알고리즘의 성능 평가 (Performance Assessment of Several Established Pitch Detection Algorithms in Voices of Benign Vocal Fold Lesions)

  • 장승진;최성희;김효민;최홍식;윤영로
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2007년도 하계종합학술대회 논문집
    • /
    • pp.407-408
    • /
    • 2007
  • Robust pitch estimation is an important study in many areas of speech processing. In voice pathology, diverse statistics extracted form pitch were commonly used to test voice quality. In this study, we compared several established pitch detection algorithms (PDAs) for verification of adequacy of the PDAs. In the database of total pathological voices of 99 and normal voices of 30, an analysis of errors related with pitch detection was evaluated between pathological and normal voices, or among the types of pathological voices such as benign vocal fold lesions; polyp, nodule, and cysts. Consequently, it is required to survey the severity of tested voice in order to obtain accurate pitch estimates.

  • PDF

병리적 음성에 대한 언어습득 이후 인공와우이식 성인의 청지각적 변별특성과 중재 프로그램의 효과 (The Effect on Intervention Program and Auditory-Perceptual Discrimination Feature of Postlingual Cochlear Implant Adults about Pathological Voice)

  • 배인호;김근효;이연우;박희준;김진동;이일우;권순복
    • 말소리와 음성과학
    • /
    • 제7권2호
    • /
    • pp.9-17
    • /
    • 2015
  • In the present study, we investigated ability of recognition of auditory perception with regards to the quality of voice in postlingual CI adults and proposed a training program to improve within subject reliability. A prospective case-control study was conducted in adults with 7 postlingual deaf who received a CI surgery and 10 normal hearing controls. The pre and post test and training program included parameters of consensus auditory-perceptual evaluation of voice(CAPE-V) with pathological voice sample by using Alvin. In results of pre-post test for monitoring improvements of internal reliability for listeners via the training program, there was statistically significant difference in both test and group. There was statistically significant difference in internal reliability between pre-post test in the normal hearing group, the result was no significant in the CI group. The present study found that CI adults showed less ability in awareness of voice quality compared to normal hearing group. Also the training program improved pitch and loudness in CI adults.

후두질환에 대한 술전 술후 음성의 음향적 특성비교 분석 (Analysis and Comparisons of Acoustical Characteristics of Pathologic Voice before and after Surgery)

  • 김대현;조철우;백무진;왕수건
    • 음성과학
    • /
    • 제7권3호
    • /
    • pp.285-294
    • /
    • 2000
  • In this paper the acoustic characteristics of pathological voice, which are measured before and after surgical operation, are compared. This experiment is conducted for the purpose of predicting patients' speech after operation. The voices are recorded from the same patients. Jitter, shimmer and other parameters are. computed and their statistical characteristics are compared. Also spectral changes, such as formant frequency shift and spectral slope change, are compared. From the experimental results, it is verified that not only source characteristics but also vocal tract components vary. And this indicates that the modification of source parameters are not enough for the prediction. Also the result indicates that the operation causes change to both the physical shape of vocal folds and the manner of articulation.

  • PDF

음원 파라미터 모델과 인공신경망을 이용한 음성장애 검출 (Screening of Voice Disorder using Source Parameter Model and Artificial Neural Network)

  • 파벨시틸;조철우;미샤파벨
    • 음성과학
    • /
    • 제15권2호
    • /
    • pp.89-97
    • /
    • 2008
  • There is a number of clinical conditions that affect directly or indirectly the physical properties of the vocal folds and thereby the pressure waveforms of elicited sounds. If the relationships between the clinical conditions and the voice quality are sufficiently reliable, it should be possible to detect these diseases or disorders. The focus of this paper is to determine the set of features and their values that would characterize the speaker's state of vocal folds. To the extent that these features can capture the anatomical, physiological, and neurological aspects of the speaker they can be potentially used to mediate an unobtrusive approach to diagnosis. We will show a new approach to this problem supported with results obtained from two disordered voice corpora.

  • PDF

소음환경이 정상 및 병적음성에 미치는 영향 (The Effect of Noise on the Normal and Pathological Voice)

  • 홍기환;양윤수;김현기
    • 음성과학
    • /
    • 제9권4호
    • /
    • pp.27-38
    • /
    • 2002
  • The purpose of this article is to present the acoustic parameters (VOT, jitter, shimmer, vF0, vAm, NHR, SPI, VTI, DVB, DSH) for consonants (/pipi/, /$p^{h}ip^{h}i$/, /p'ip'i/) and sustained vowels (/a/, /e/, /i/) produced by normal subjects and dysphonia patients at two vocal effort(normal, high) by Lombard effect using 60dB white noise. Lombard effect indicates the vocal effort increase in noisy situation. At normal vocal effort, in general the acoustic parameter values of patients are greater than normal. And in noisy situation, significant decrease of acoustic values is seen in normal compared with in dysphonia patients. The clinical implication of this finding, the vocal quality in dysphonia is not compensated by vocal effort as well as normal subjects because of the inefficiency caused by abnormal vocal fold appearance and function. And with this result, we can counsel that the voice quality can not be improved as well as the patient expect.

  • PDF

Wavelet 변환과 신경회로망을 이용한 후두의 양성종양의 식별에 관한 연구 (Classification of Pathological Speech Signals Using Wavelet Transform and Neural Network)

  • 김대현
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1998년도 학술발표대회 논문집 제17권 2호
    • /
    • pp.395-398
    • /
    • 1998
  • 본 논문에서는 웨이브렛 변환에서 구해진 파라미터와 신경회로망을 이용하여 후두의 양성종양과 정상상태를 구분하는 실험을 행하였다. 식별 파라미터로는 웨이브렛변환으로부터 도출된 ECS 파라미터와 jitter, shimmer를 이용하였으며 신경회로망은 한 개의 은닉층을 갖는 다층구조 신경망을 이용하였다. 신경망의 입력으로는 세가지 파라미터의 조합을 두 개 또는 세 개를 입력하여 각각의 경우의 식별율을 조사하였다. 실험결과 75%에서 93%에 이르는 식별율을 얻었다.

  • PDF

장애음성의 분류방법에 관한 연구 (On the Classification of the Pathological Speech)

  • 김대현
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1998년도 제15회 음성통신 및 신호처리 워크샵(KSCSP 98 15권1호)
    • /
    • pp.388-391
    • /
    • 1998
  • jitter, shimmer 및 켑스트럼 방식의 음원분석에 의한 파라미터를 이용하여 장애음성을 진단, 식별하는 방법을 제안한다. 먼저 통계적 처리결과르 바탕으로 식별에 유효한 파라미터들을 선택하고 이들 파라미터들을 이용하여 최종 진단한다. 식별방법으로는 신경회로망을 이용한다. 입력파라미터로는 jitter, shimmer, HNRR을 사용한다. 신경회로망은 1 은닉층을 갖는 3- layer 신경회로망을 사용한다. 실험결과 효과적으로 정상음성과 장애음성의구분이 가능해졌다.

  • PDF

The Latency of Distortion Product Otoacoustic Emissions in Ears with Hearing Impairment

  • Lee, Jung-Hak;Cho, Soo-Jin;Kim, Jin-Sook
    • 음성과학
    • /
    • 제7권1호
    • /
    • pp.77-87
    • /
    • 2000
  • Distortion Product Otoacoustic Emissions (DPOAEs) can be measured in the external ear canal two fold: amplitude and latency, but most DPOAE studies deal with amplitude aspects. The purpose of this study was to investigate the latency of the 2f1-f2 DPOAEs in ears with hearing losses and to see if it could be a clinically useful method to distinguish normal from abnormal ears. For this purpose, DPOAE latency were measured as a function of frequency from 1 to 8 kHz in 30 ears with conductive and sensorineural hearing losses (SNHLs). DPOAEs were recorded with Otodynamic Analyzer ILO92. Results showed that the latency decreased as the frequency increased up to 8 kHz. The mean values of DPOAE latency for ears of SNHLs were shorter at all frequencies when they were compared to the mean values of normal ears. The latency in ears of conductive hearing losses was shorter than normal ears at the selective frequencies, as well. The results support the hypothesis that latency values are shorter in pathological ears.

  • PDF