• 제목/요약/키워드: Korean speech

검색결과 5,286건 처리시간 0.026초

음성인식프로그램을 이용한 무후두 음성의 말 명료도와 병적 음성의 수술 전후 개선도 측정 (Speech Intelligibility of Alaryngeal Voices and Pre/Post Operative Evaluation of Voice Quality using the Speech Recognition Program(HUVOIS))

  • 김한수;최성희;김재인;임재열;최홍식
    • 대한후두음성언어의학회지
    • /
    • 제15권2호
    • /
    • pp.92-97
    • /
    • 2004
  • Background and Objectives : The purpose of this study was to examine objectively pre and post operative voice quality evaluation and intelligibility of alaryngeal voice using speech recognition program, HUVOIS. Materials and Methods : 2 laryngologists and 1 speech pathologist were evaluated 'G', 'R', 'B' in the GRBAS sclae and speech intelligibility using NTID rating scale from standard paragraph. And also acoustic estimates such as jitter, shimmer, HNR were obtained from Lx Speech Studio. Results : Speech recognition rate was not significantly different between pre and post operation for pathological vocie samples though voice quality(G, B) and acoustic values(Jitter, HNR) were significantly improved after post operation. In Alaryngeal voices, reed type electrolarynx 'Moksori' was the highest both speech intelligibility and speech recognition rate, whereas esophageal speech was the lowest. Coefficient correlation of speech intelligibility and speech recognition rate was found in alaryngeal voices, but not in pathological voices. Conclusion : Current study was not proved speech recognition program, HUVOIS during telephone program was not objective and efficient method for assisting subjective GRBAS scale.

  • PDF

언어치료에 대한 장애아동 어머니의 이해도와 상담 만족도 (A study of the understanding about speech therapy and the satisfaction about counseling for mothers who have children with disability)

  • 박진원
    • 한국임상보건과학회지
    • /
    • 제9권1호
    • /
    • pp.1469-1477
    • /
    • 2021
  • Purpose: The purpose of this study is to investigate the understanding about speech therapy and the satisfaction of counseling about speech therapy according to the characteristics of mothers who have children with disabilities, and to devise the clinical instruction methods to provide the effective speech therapy by identifying the correlation between the two variables. Methods: This study conducted a survey for 78 mothers of children with disabilities who use speech therapy labs in university. 17 questions were composed to investigate the understanding degree about speech therapy and 24 questions were composed to investigate the satisfaction degree about speech therapy counseling. Results: First, the survey showed that mothers who have the higher education level have the higher understanding degree about language(p<0.01). Second, the survey showed that mothers who have the higher education level have the lower satisfaction degree about counseling process(p<0.5). In the view of job status, mothers who have a job have the higher satisfaction degree about counseling time(p<0.5). Third, the survey showed that in the view of mothers'understanding degree about speech therapy and satisfaction degree about counseling, mothers who have the higher understanding degree about language, speech therapy tools and speech therapy area have the higher satisfaction degree about counseling. Conclusions: This study showed the necessity to understand the subjects'needs exactly and communicate with mothers actively. In addition, the concrete and various methods should be devised in order to increase the understanding degree about speech therapy and increase the satisfaction degree of counseling about the clinical practice environment and language therapy process.

자동차 주행 환경에서의 음성 전달 명료도와 음성 인식 성능 비교 (Comparison of Speech Intelligibility & Performance of Speech Recognition in Real Driving Environments)

  • 이광현;최대림;김영일;김봉완;이용주
    • 대한음성학회지:말소리
    • /
    • 제50호
    • /
    • pp.99-110
    • /
    • 2004
  • The normal transmission characteristics of sound are hardly obtained due to the various noises and structural factors in a running car environment. It is due to the channel distortion of the original source sound recorded by microphones, and it seriously degrades the performance of the speech recognition in real driving environments. In this paper we analyze the degree of intelligibility under the various sound distortion environments by channels according to driving speed with respect to speech transmission index(STI) and compare the STI with rates of speech recognition. We examine the correlation between measures of intelligibility depending on sound pick-up patterns and performance in speech recognition. Thereby we consider the optimal location of a microphone in single channel environment. In experimentation we find that high correlation is obtained between STI and rates of speech recognition.

  • PDF

적응 콤 필터링을 이용한 이동 통신 환경에서의 강인한 음성 인식 (Robust Speech Recognition using Adaptive Comb Filtering in Mobile Communication Environment)

  • 박정식;정규준;오영환
    • 대한음성학회지:말소리
    • /
    • 제46호
    • /
    • pp.65-76
    • /
    • 2003
  • In this paper, we employ the adaptive comb filtering for effective noise reduction in mobile communication environment. Adaptive comb filtering is a well-known method for noise reduction, but requires correct pitch period and must be applied just in voiced speech frames. To satisfy these requirements we use two kinds of information extracted from speech packets, one of which is the pitch period information measured precisely by a speech coder and the other is the frame rate information related to a decision on speech or silence frame. Experiments on speech recognition system confirm the efficiency of this method. Feature parameters employing this method give superior performance in noise environment to those extracted directly from output speech.

  • PDF

변곡점 및 단구간 에너지평가에 의한 음성의 천이구간 특징분석 (Analysis of Transient Features in Speech Signal by Estimating the Short-term Energy and Inflection points)

  • 최일홍;장승관;차태호;최웅세;김창석
    • 음성과학
    • /
    • 제3권
    • /
    • pp.156-166
    • /
    • 1998
  • In this paper, I would like to propose a dividing method by estimating the inflection points and the average magnitude energy in speech signals. The method proposed in this paper gave not only a satisfactory solution for the problems on dividing method by zero-crossing rate, but could estimate the feature of the transient period after dividing the starting point and transient period in speech signals before steady state. In the results of the experiment carried out with monosyllabic speech, it was found that even through speech samples indicated in D.C. level, the staring and ending point of the speech signals were exactly divided by the method. In addition to the results, I could compare with the features, such as the length of transient period, the short term energy, the frequency characteristics, in each speech signal.

  • PDF

비인강 폐쇄부전 환자에서 발음보조장치의 치료효과 (The Effect of Speech Aids in Velopharyngeal Incompetency Patients)

  • 고승오;신효근;김현기;홍기환;서정환;고도흥
    • 음성과학
    • /
    • 제3권
    • /
    • pp.57-69
    • /
    • 1998
  • Velopharyngeal function refers to the combined activity of the soft palate and pharynx in closing and opening the velopharyngeal port to the required degree. In normal speech, during the production of oral consonant sounds elevation of the soft palate, along with the superior constrictor muscle, occludes the oropharynx from the nasopharynx. Inadequate velopharyngeal function caused by congenital or acquired insufficiency or incompetency may result in abnormal speech characterized by hypernasality, nasal emission and decreased intelligibility of speech due to weak consonant production. The speech aid is often helpful in improving the speech of individuals with velopharyngeal incompetency. In this article, the pathogenesis and treatment of velopharyngeal incompetence are discussed and a speech aid appliance that was constructed for the patient is described.

  • PDF

연속구어 내 발성 종결-개시의 음향학적 특징 - 말더듬 화자와 비말더듬 화자 비교 - (Acoustic Features of Phonatory Offset-Onset in the Connected Speech between a Female Stutterer and Non-Stutterers)

  • 한지연;이옥분
    • 음성과학
    • /
    • 제13권2호
    • /
    • pp.19-33
    • /
    • 2006
  • The purpose of this paper was to examine acoustical characteristics of phonatory offset-onset mechanism in the connected speech of female adults with stuttering and normal nonfluency. The phonatory offset-onset mechanism refers to the laryngeal articulatory gestures. Those gestures are required to mark word boundaries in phonetic contexts of the connected speech. This mechanism included 7 patterns based on the speech spectrogram. This study showed the acoustic features in the connected speech in the production of female adults with stuttering (n=1) and normal nonfluency (n=3). Speech tokens in V_V, V_H, and V_S contexts were selected for the analysis. Speech samples were recorded by Sound Forge, and the spectrographic analysis was conducted using Praat. Results revealed a stuttering (with a type of block) female exhibited more laryngealization gestures in the V_V context. Laryngealization gesture was more characterized by a complete glottal stop or glottal fry both in V_H and in V_S contexts. The results were discussed from theoretical and clinical perspectives.

  • PDF

말속도가 인공와우 청각장애인의 문장지각에 미치는 영향 (Effects of Speech Rate on the Sentence Perception of Adults with Cochlear Implantation)

  • 신수진;신지철;윤미선;김덕용
    • 음성과학
    • /
    • 제13권2호
    • /
    • pp.47-58
    • /
    • 2006
  • People tend to control their speech rate to help those with listening problems such as hearing impaired people. The aim of this study was to investigate effects of speech rate on the sentence perception by 10 adults with cochlear implantation. The sample speech included 42 sentences at normal, slow, and very slow speed focusing on the overall duration, vowel or pause duration. The subjects listened to the speech and wrote down what they heard. Each correct syllable of the content words in the sentence was counted to obtain the score. Partial points were given to the incomplete syllables. Results of this study were as follows: 1. The changes of speech rate had some influence on the sentence perception score by the cochlear implanted people. 2. In slow pause condition, the controlled speech rate had a positive effect on the perception score.

  • PDF

확률적 목표 음성 검출을 통한 다채널 입력 기반 음성개선 (Probabilistic Target Speech Detection and Its Application to Multi-Input-Based Speech Enhancement)

  • 이영재;김수환;한승호;한민수;김영일;정상배
    • 말소리와 음성과학
    • /
    • 제1권3호
    • /
    • pp.95-102
    • /
    • 2009
  • In this paper, an efficient target speech detection algorithm is proposed for the performance improvement of multi-input speech enhancement. Using the normalized cross correlation value between two selected channels, the proposed algorithm estimates the probabilistic distribution function of the value from the pure noise interval. Then, log-likelihoods are calculated with the function and the normalized cross correlation value to detect the target speech interval precisely. The detection results are applied to the generalized sidelobe canceller-based algorithm. Experimental results show that the proposed algorithm significantly improves the speech recognition performance and the signal-to-noise ratios.

  • PDF

PROSODY CONTROL BASED ON SYNTACTIC INFORMATION IN KOREAN TEXT-TO-SPEECH CONVERSION SYSTEM

  • Kim, Yeon-Jun;Oh, Yung-Hwan
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1994년도 FIFTH WESTERN PACIFIC REGIONAL ACOUSTICS CONFERENCE SEOUL KOREA
    • /
    • pp.937-942
    • /
    • 1994
  • Text-to-Speech(TTS) conversion system can convert any words or sentences into speech. To synthesize the speech like human beings do, careful prosody control including intonation, duration, accent, and pause is required. It helps listeners to understand the speech clearly and makes the speech sound more natural. In this paper, a prosody control scheme which makes use of the information of the function word is proposed. Among many factors of prosody, intonation, duration, and pause are closely related to syntactic structure, and their relations have been formalized and embodied in TTS. To evaluate the synthesized speech with the proposed prosody control, one of the subjective evaluation methods-MOS(Mean Opinion Score) method has been used. Synthesized speech has been tested on 10 listeners and each listener scored the speech between 1 and 5. Through the evaluation experiments, it is observed that the proposed prosody control helps TTS system synthesize the more natural speech.

  • PDF