• 제목/요약/키워드: normal voices

검색결과 46건 처리시간 0.024초

후두음성 질환에 대한 인공지능 연구 (Artificial Intelligence for Clinical Research in Voice Disease)

  • 석준걸;권택균
    • 대한후두음성언어의학회지
    • /
    • 제33권3호
    • /
    • pp.142-155
    • /
    • 2022
  • Diagnosis using voice is non-invasive and can be implemented through various voice recording devices; therefore, it can be used as a screening or diagnostic assistant tool for laryngeal voice disease to help clinicians. The development of artificial intelligence algorithms, such as machine learning, led by the latest deep learning technology, began with a binary classification that distinguishes normal and pathological voices; consequently, it has contributed in improving the accuracy of multi-classification to classify various types of pathological voices. However, no conclusions that can be applied in the clinical field have yet been achieved. Most studies on pathological speech classification using speech have used the continuous short vowel /ah/, which is relatively easier than using continuous or running speech. However, continuous speech has the potential to derive more accurate results as additional information can be obtained from the change in the voice signal over time. In this review, explanations of terms related to artificial intelligence research, and the latest trends in machine learning and deep learning algorithms are reviewed; furthermore, the latest research results and limitations are introduced to provide future directions for researchers.

동일 후적자가 산출하는 기관식도 발성($PROVOX^{(R)}$ 발성)과 식도 발성에 대한 음향학적 및 공기역학적 특성 비교 (The Comparison of the Acoustic and Aerodynamic Characteristics of $PROVOX^{(R)}$ Voice and Esophageal Voice Produced by the Same Laryngectomee)

  • 표화영;최홍식;임성은;최성희
    • 음성과학
    • /
    • 제5권1호
    • /
    • pp.121-139
    • /
    • 1999
  • Our experimental subject was a laryngectomee who had undergone total laryngectomy with $PROVOX^{(R)}$ insertion, and learned esophageal speech after the surgery, so he could produce both $PROVOX^{(R)}$ voice and esophageal voice. With this subject's production of $PROVOX^{(R)}$ and esophageal voice, we are to compare the acoustic and aerodynamic characteristics of the two voices, under the same physical conditions of the same person. As a result, the fundamental frequency of esophageal voice was 137.2 Hz, and that of $PROVOX^{(R)}$ was 97.5 Hz. $PROVOX^{(R)}$ voice showed lower jitter, shimmer and NHR than esophageal voice, which means that $PROVOX^{(R)}$ voice showed better voice quality than esophageal voice. In spectrographic analysis, the formation of formants and pseudoformants were more distinct in esophageal voice and several temporal aspects of acoutic features such as VOT and closure duration were more similar with normal voice in $PROVOX^{(R)}$ voice. During the sentence utterance, esophageal voice showed longer pause or silence duration than $PROVOX^{(R)}$ voice. Maximum phonation time and mean flow rate of $PROVOX^{(R)}$ voice were much longer and larger than esophageal voice, but mean and range of sound pressure level, subglottic pressure and voice efficiency were similar in the two voices. Glottal resistance of esophageal voice was much larger than $PROVOX^{(R)}$ voice which showed still larger glottal resistance than normal voice.

  • PDF

학령기 인공와우 이식 아동과 일반 건청 아동의 음악 선호도에 대한 비교 연구 (A Comparative Study on Music Preference between School-Aged Children with Cochlear Implant and Normal Hearing)

  • 황선영
    • 인간행동과 음악연구
    • /
    • 제8권2호
    • /
    • pp.47-64
    • /
    • 2011
  • 본 연구는 학령기의 인공와우를 이식한 아동과 건청 아동의 음악 선호도에 대해 비교 연구하는데 목적이 있다. 연구 대상은 인공와우를 이식 받은 지 최소 2년 이상 된 아동 12명과 안양에 위치한 A 초등학교에 재학 중인 1~5학년 학생 155명을 무작위로 선정하여 1:1 설문조사를 시행하였다. 연구 결과 음성 선호도에서 인공와우 이식 아동그룹은 아동 목소리를 선호한다고 응답한 반면 건청 아동그룹은 어른 목소리를 선호한다고 응답하였고, 성별에 따른 음성 선호도는 두 그룹 모두 여자 목소리를 선호한다고 응답하였으며, 음고에 따른 선호도 역시 두 그룹 모두 높은 소리를 선호한다고 응답하였다. 연주 형태에 따른 선호도는 인공와우 이식 아동과 건청 아동 두 그룹 모두 악기와 목소리가 공존하는 소리를 선호한다고 응답하였다. 장르별 음악 선호도는 리커드(Likert scale) 5점 척도에서 인공와우 이식 아동그룹은 민요를 선호하는 반면, 건청 아동그룹은 대중음악과 TV음악을 선호하는 것으로 나타났다. 이러한 결과들은 인공와우의 특성과 음악 환경에 영향을 받아 나타난 결과로 음악 환경에 의해 인공와우를 이식 받은 아동의 음악 선호도가 변화 될 수 있음을 나타낸다. 본 연구는 인공와우를 이식한 아동과 건청 아동의 음악 선호도를 비교 분석한 것으로 인공와우이식 아동의 청각 재활에 기초 자료가 될 수 있을 것이다.

Electroglottography를 사용한 한국어 폐쇄자음의 특성 및 임상적 적용 (Characteristics of Korean Stop Consonants by Using Electroglottography and Its Clinical Application)

  • 채윤정;김현기;홍기환
    • 음성과학
    • /
    • 제4권2호
    • /
    • pp.157-177
    • /
    • 1998
  • An electroglottography (EGG) was used to investigate the function of the vocal folds during their vibration. In this study, four Korean native speakers and 10 vocal polyp patients were selected. To investigate the dynamic change of EGG waveforms for the three-way distinction of Korean stops, a DSP-Sona graph model 5500, a Rino- Laryngeal stroboscope, a CSL model 4300B and a Laryngograph were used. An EGG Model 4338 was used to exam the vocal polyp of patients' voices during high, low, comfortable pitch production. The purpose of this study is to investigate the characteristics of Korean stop consonants in relation to pitch and to observe laryngeal movement during vocal fold vibration and speech production. The basic data accumulated during this research can be applied in clinical treatment. The results are as follows: on the Korean stop consonants, the aspirated stop is the highest in the GOT and PC1. On the angle of vowel contour, the angle of lenis is smaller than the angle of heavily aspirated and glottalized stops. The fundamental frequency is lowest at the lenis stop, In vocal polyp patients', the low pitch range is smaller than in normal speakers'. The pitch break and the vocal fry were observed. The jitter and OQ value are higher in vocal polyp patients than in those of normal speakers'.

  • PDF

The Impact of COVID-19 Pandemic on Indonesia's Economy and Alternative Prospects for Untact Society

  • Lee, Kyungchan
    • 수완나부미
    • /
    • 제13권2호
    • /
    • pp.7-35
    • /
    • 2021
  • This research is an attempt to understand the economic and social consequences that are occurring in Indonesia due to the spread of COVID-19. Indonesia, which has maintained solid economic growth since the inauguration of President Jokowi's government, is also experiencing difficulties to deal with unexpected COVID-19 pandemic as the global economic turmoil has had a very significant impact on its economy. The economic impact of COVID-19 can be felt, starting from the phenomenon of panic buying, the free fall of the stock price index, the depreciation of the Rupiah against the Dollar, sluggish activities in the processing industry, and ultimately it has an impact on slowing economic growth. Various policies and measures have been taken by the Indonesian government to minimize the negative impact caused by the COVID-19 pandemic on the economy. One such area is electronic commerce business or e-commerce that witnessed a vast increase of online and non-cash transaction amid rising voices that the country needs to prepare for the advent of a new economic system, the so-called New Normal era. The Covid-19 pandemic will temporarily slow economic growth and delay some development projects and policy initiatives as the Indonesian government diverts capital from infrastructure development to help respond to the crisis. However, the Jokowi administration's efforts for continuous reform are expected to accelerate the transition to the digital economy.

외 후두부 길이와 발화기본주파수 간의 상관관계 (Correlation Between the External Laryngeal Length and the Habitual Speaking Fundamental Frequency)

  • 남도현;임성수;최홍식
    • 말소리와 음성과학
    • /
    • 제1권4호
    • /
    • pp.187-193
    • /
    • 2009
  • For this study, the external laryngeal lengths of 9 females and 9 males with normal voices were measured together with their ages, heights, and weights, and after they read aloud sentences for 3 minutes, their habitual speaking fundamental frequencies, speaking low pitches, speaking high pitches, and vocal fold closed quotients were measured. The Spearman rank correlation analysis on these data showed a significant negative correlation between the external laryngeal length and the habitual speaking fundamental frequency for both females and males, a significant negative correlation between the external laryngeal length and the speaking high pitch for only males, a significant negative correlation between the external laryngeal length and the speaking low pitch for both females and males, and a significant positive correlation between the external laryngeal length and the vocal fold closed quotient for only males.

  • PDF

Dr. Speech Science의 음성합성프로그램을 이용하여 합성한 정상음성과 병적음성(Pathologic Voice)의 음향학적 분석 (Acoustic Analysis of Normal and Pathologic Voice Synthesized with Voice Synthesis Program of Dr. Speech Science)

  • 최홍식;김성수
    • 대한후두음성언어의학회지
    • /
    • 제12권2호
    • /
    • pp.115-120
    • /
    • 2001
  • In this paper, we synthesized vowel /ae/ with voice synthesis program of Dr. Speech Science, and we also synthesized pathologic vowel /ae/ by some parameters such as high frequency gain (HFG), low frequency gain(LFG), pitch flutter(PF) which represents jitter value and flutter of amplitude(FA) which represents shimmer value, and grade ranked as mild, moderate and severe respectively. And then we analysed all pathologic voice by analysis program of Dr. Speech Science. We expect that this synthesized pathologic voices are useful for understanding the parameter such as noise, jitter and shimmer and feedback effect to patient with voice disorder.

  • PDF

후두미세수술 전후 /아/의 음향적 특성 비교 (Comparative Study on the Acoustic Characteristics of the Korean Vowel /a/ before and after LMS)

  • 황연시;성철재
    • 대한음성학회지:말소리
    • /
    • 제67호
    • /
    • pp.33-60
    • /
    • 2008
  • The aim of this study is to show the differences in acoustic parameters between a pathological voice /a/ caused by vocal polyp and a normal voice /a/ produced after LMS (Laryngeal Microscopic Surgery). It was expected that voices of two kinds could be analyzed effectively in terms of HNR in specific frequency bands than in all frequency bands. For this study, 10 patients' voice were recorded before and after LMS and then were manipulated in terms of four acoustic parameter. It was found out that (a) frequency bands of 500Hz in the range of 1,000Hz to 4,000Hz were very useful to obtain HNR values; (b) frequency bands in the range of 1,248Hz to 5,500Hz on a log scale were very useful to obtain HNR values; (c) F0 dropped after LMS but not significantly; (d) the bandwidth of the second formant (B2) decreased significantly after LMS, while that of the first formant (B1) decreased after LMS but not significantly.

  • PDF

변성발성장애 환자에 대한 음성치료의 효과 (The Efficiency of Voice Therapy for the Patients with Mutational Falsetto)

  • 표화영
    • 대한후두음성언어의학회지
    • /
    • 제9권2호
    • /
    • pp.134-141
    • /
    • 1998
  • Mutational falsetto is a kind of voice disorders due to the failure to acquire proper low-pitched voice during the puberty. The patients with mutational falsetto can produce the normal low-pitched voice by the surgical treatment, like the type III-thyroplasty, or the voice therapy. The present study is, focusing on the latter treatment, to consider the efficiency of voice therapy for the mutational falsetto. The 7 patients who were diagnosed as mutational falsetto by the laryngologists, and treated by the voice therapist were selected as subjects. Their voices of pretherapy and posttherapy were analyzed on the aspects of acoustics and aerodynamics. Acoustic analysis was done by the MDVP(Multidimensional Voice Program) of CSL(Computerized Speech Lab, Kay Elemetrics, Co.), and aerodynamic analysis, by the Maximum Sustained Phonation of Aerophone II(Kay Elemetrics, Co.). By these measurements, we could find that fundamental frequency(F0) was significantly lowered, on the average, 65Hz. Maximum phonation time(MPT) was increased 4.57 second, and shimmer was decreased 1.644%, respectively, and each changes was statistically significant, too. On the average, jitter was decreased 0.499%, mean flow rate(MFR) was decreased 27.71ml/sec, and NHR was increased 0.023 which was the only parameter not showing improvement. But the changes of jitter, MFR and NHR were not statistically significant.

  • PDF

병적음성에 대한 지속 모음 및 이음절어 발화시 나타나는 음향학적 차이에 대한 연구 (A Study of Acoustic Characteristics of Two Syllables Words and Sustained Vowel)

  • 채윤정;김범규;홍기환
    • 대한후두음성언어의학회지
    • /
    • 제11권1호
    • /
    • pp.104-112
    • /
    • 2000
  • An evaluation of voice disorder has two methods. One is a perceptual analysis and the other is an acoustic analysis. All of these methods are just focused on sustained vowel. The analysis of conversational speech levels in voice disorder has not been achieved enough. The purpose of the present study is to compare two syllable words and sustained vowel in the vocal polyp patients and normal male speakers and to be applied on the vocal assessment and the voice therapy as a basic data. fifteen male patients with vocal polyp were the subject group. Fifteen healthy male were the control group for this study. The voices of the subject and control group, saved in MDVP of CSL were analyzed by its own analysis program. As a results, in subject group, the voice qualities between the vowel following lenis stop and the sustained vowel had no differences, and the voice qualities were different significantly between the vowel following heavily aspirated stop and the sustained vowel. In the control group the vowel fllowing stops and sustained vowel had also many differences in their voice quality, especially significant between the vowel following glottal stop and e sustained vowel.

  • PDF