• 제목/요약/키워드: Speech improvement

검색결과 610건 처리시간 0.021초

다계통위축증 환자를 대상으로 한 마비말장애 집중 치료의 효과 (Efficacy of intensive treatment of dysarthria for people with multiple system atrophy)

  • 박영미
    • 말소리와 음성과학
    • /
    • 제10권4호
    • /
    • pp.163-171
    • /
    • 2018
  • A mixed dysarthria with combinations of hypokinetic, ataxic, and spastic components is a common clinical feature of multiple system atrophy (MSA). Due to the rapid progress of dysarthria after diagnosis, people with MSA experience difficulty with verbal communication, which eventually affects their quality of life negatively. In this study, SPEAK $OUT!^{(R)}$, an intensive 1:1 treatment of dysarthria for improving functional communicative ability, was provided to twelve people with MSA. To evaluate the efficacy of SPEAK $OUT!^{(R)}$ in people with MSA, aerodynamic, acoustic, and perceptual analyses were conducted. Pre-and post-therapy data included maximum phonation time, vocal intensity, and fundamental frequency during /a/ sustained phonation and passage reading; frequency range between high /a/ and low /a/ phonation; jitter, shimmer, and HNR for vocal quality; speech rate during passage reading; and perceptual evaluation scores for articulation precision and intonation. The participants achieved statistically significant improvement in vocal intensity, pitch range, vocal quality, speech rate, and speech intelligibility. In conclusion, SPEAK $OUT!^{(R)}$ is a feasible treatment for people with MSA to efficaciously improve their speech ability.

복식호흡 훈련과 Self Voice Feedback 프로그램이 성대결절 환자의 음성개선에 미치는 효과 (Effects of Abdominal Respiration and Self Voice Feedback Therapy on the Voice Improvement of Patients with Vocal Nodules)

  • 권순복;왕수건;양병곤;전계록
    • 음성과학
    • /
    • 제13권3호
    • /
    • pp.133-149
    • /
    • 2006
  • This study attempted to compare acoustic parameters, physiological observation and perceptual evaluation values obtained from the treatment and control groups in order to find out which of the self voice feedback therapies was better and which methods to train them were more effective. The experimental group carried out various self voice feedback therapies while the control group did only vocal hygiene. The acoustic measurement and voice manipulation for providing the patients visual, auditory feedback were done by a speech analysis software, Praat. The authors designed vocal hygiene, abdominal respiration and Praat self voice feedback therapies and applied them to 15 patients while applying only one vocal hygiene to 15 of the control group. For the purpose of examining the degree of their voice improvement after the treatment, pre- mid- and final evaluations were made for the two groups at the beginning, the 6th week and immediately after the 8th treatment session. Results of this study were as follows: The treatment group showed much improvement after receiving the voice treatment. In particular, acoustical and physiological indices from the optical endoscopy, pitch variation(Jitter), amplitude variation (Shimmer), maximum phonation time(MPT), and psychoacoustic evaluation showed statistically significant improvements over the control groups.

  • PDF

음성 합성 시스템의 품질 향상을 위한 한국어 문장 기호 전처리 시스템 (Korean Sentence Symbol Preprocess System for the Improvement of Speech Synthesis Quality)

  • 이호준
    • 한국컴퓨터정보학회논문지
    • /
    • 제20권2호
    • /
    • pp.149-156
    • /
    • 2015
  • 본 논문에서는 한국어 문장 기호의 처리를 통해 자연스러운 음성 합성 결과를 생성하는 방법에 대해서 논의한다. 이를 위해 한국어 위키피디아 문서를 분석하여 문장 기호의 사용을 8가지 형태로 분류하고, 11개의 정규표현식 규칙으로 문장 기호를 처리하는 방안을 제시한다. 그 결과 63,000 문장에 대해 56%의 정확도와 71.45%의 재현율을 달성하였으며, 문장 기호 처리 결과를 SSML 기반의 음성 합성 표현으로 변환하여 음성 합성 결과의 품질을 향상시키는 방법을 제안한다.

음성 인식용 데이터베이스 검증시스템을 위한 새로운 음성 인식 성능 지표 (A New Speech Quality Measure for Speech Database Verification System)

  • 지승은;김우일
    • 한국정보통신학회논문지
    • /
    • 제20권3호
    • /
    • pp.464-470
    • /
    • 2016
  • 본 논문에서는 음성의 특성 지표를 이용한 음성 인식용 데이터베이스 검증 시스템의 개발 내용을 소개하고 이 시스템의 핵심 기술인 음성 특성 지표 추출 알고리즘을 설명한다. 선행 연구에서는 본 시스템에 필요한 효과적인 음성 인식 성능 지표를 생성하기 위해 대표적인 음성 인식 성능 지표인 단어 오인식률(Word Error Rate, WER)과 상관도가 높은 여러 가지 음성 특성 지표들을 조합하여 새로운 성능 지표를 생성하였다. 생성된 음성 인식 성능 지표는 다양한 잡음 환경에서 각 음성 특성 지표를 단독으로 사용할 때보다 단어 오인식률과 높은 상관도를 나타내어 음성 인식 성능을 예측하는데 효과적임을 입증 하였다. 본 실험에서는 선행 연구에서 조합에 사용한 이차적인 음성 인식기에서 추출된 음향 모델 확률 값을 GMM(Gaussian Mixture Model) 음향 모델 확률 값으로 대체해 조합함으로써 시스템 구축 시 다른 음성 인식기에 대한 의존성을 감소시킨다.

HMM 기반 한국어 음성합성에서의 화자적응 방식 성능비교 및 지속시간 모델 개선 (Performance Comparison and Duration Model Improvement of Speaker Adaptation Methods in HMM-based Korean Speech Synthesis)

  • 이혜민;김형순
    • 말소리와 음성과학
    • /
    • 제4권3호
    • /
    • pp.111-117
    • /
    • 2012
  • In this paper, we compare the performance of several speaker adaptation methods for a HMM-based Korean speech synthesis system with small amounts of adaptation data. According to objective and subjective evaluations, a hybrid method of constrained structural maximum a posteriori linear regression (CSMAPLR) and maximum a posteriori (MAP) adaptation shows better performance than other methods, when only five minutes of adaptation data are available for the target speaker. During the objective evaluation, we find that the duration models are insufficiently adapted to the target speaker as the spectral envelope and pitch models. To alleviate the problem, we propose the duration rectification method and the duration interpolation method. Both the objective and subjective evaluations reveal that the incorporation of the proposed two methods into the conventional speaker adaptation method is effective in improving the performance of the duration model adaptation.

화자적응화 연속음성 인식 시스템의 구현에 관한 연구 (A Study on Realization of Continuous Speech Recognition System of Speaker Adaptation)

  • 김상범;김수훈;허강인;고시영
    • 한국음향학회지
    • /
    • 제18권3호
    • /
    • pp.10-16
    • /
    • 1999
  • 본 연구에서는 소량의 음성 데이터만으로 적응화가 가능한 MAPE(최대사후확률추정)을 이용한 연속음성 인식시스템 개발에 대해 연구하였다. 음절단위 모델을 구축한 후 적응화 하고자 하는 화자의 데이터를 연결학습법과 Viterbi 알고리즘으로 음절단위의 추출을 자동화 한 후 MAPE로 적응화하였다. 자동차 제어문에 대해 화자 적응화한 경우의 인식률(O(n)DP인 경우)은 77.18%로 적응화 전의 결과보다 약 6%향상되었다.

  • PDF

A Study of the Effects of Similarity on L2 Phone Acquisition: An Experimental Study of the Korean Vowels Produced by Japanese Learners

  • Kwon, Sung-Mi
    • 음성과학
    • /
    • 제14권1호
    • /
    • pp.93-103
    • /
    • 2007
  • The aims of this study were to examine the acoustic features of Korean and Japanese vowels, and to determine whether new phones that do not have counterparts in Japanese or similar phones that have counterparts improve more from learning. This study consisted of three parts. In Experiment I, a speech production test was performed to observe the acoustic features of Korean and Japanese vowels. In Experiment II, the speech production of Korean vowels produced by Koreans, advanced Japanese learners of Korean, and beginning Japanese learners of Korean was investigated. In Experiment III, a speech perception study of Korean vowels produced by the two Japanese learner groups was conducted to observe the effect of learning on acquiring L2 phones. The conclusion drawn from the study was that the similar phones produced by Japanese show more similarity with those of Koreans than new phones in terms of F1 and F2, but Japanese learners of Korean displayed more improvement in new phones from learning.

  • PDF

한국어 음성인식을 위한 음성학 기반의 유사음소단위 집합 설계 (A Phonetics Based Design of PLU Sets for Korean Speech Recognition)

  • 홍혜진;김선희;정민화
    • 대한음성학회지:말소리
    • /
    • 제65호
    • /
    • pp.105-124
    • /
    • 2008
  • This paper presents the effects of different phone-like-unit (PLU) sets in order to propose an optimal PLU set for the performance improvement of Korean automatic speech recognition (ASR) systems. The examination of 9 currently used PLU sets indicates that most of them include a selection of allophones without any sufficient phonetic base. In this paper, a total of 34 PLU sets are designed based on Korean phonetic characteristics arid the effects of each PLU set are evaluated through experiments. The results show that the accuracy rate of each phone is influenced by different phonetic constraint(s) which determine(s) the PLU sets, and that an optimal PLU set can be anticipated through the phonetic analysis of the given speech data.

  • PDF

대화형 코퍼스의 설계 및 구조적 문서화에 관한 연구 (A Study in Design and Construction of Structured Documents for Dialogue Corpus)

  • 강창규;남명우;양옥렬
    • 한국콘텐츠학회논문지
    • /
    • 제4권4호
    • /
    • pp.1-10
    • /
    • 2004
  • 음성인식의 연구 대상은 낭독음성에서 대화음성으로 발전해가고 있다. 이를 위해서는 대량의 대화코퍼스가 필요하다. 그러나 아직 충분한 양의 대화코퍼스가 구축되어 있지 못하며 코퍼스의 주석 정보 또한 복잡하고 다양하게 표현하고 있어 효율적인 활용이 어렵다. 따라서 본 논문에서는 TEI를 기반으로 하여 대화 영역을 텔레뱅킹으로 설정하고 대화코퍼스를 구축하여 구축된 대화코퍼스의 주석 정보를 XML(extensible Markup Language)로 표준화할 수 있도록 DTD (Document Type Definition) 정의하고 저장 시스템을 설계하였다.

  • PDF

FSN 기반의 대어휘 연속음성인식 시스템 개발 (Development of FSN-based Large Vocabulary Continuous Speech Recognition System)

  • 박전규;이윤근
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2007년도 한국음성과학회 공동학술대회 발표논문집
    • /
    • pp.327-329
    • /
    • 2007
  • This paper presents a FSN-based LVCSR system and it's application to the speech TV program guide. Unlike the most popular statistical language model-based system, we used FSN grammar based on the graph theory-based FSN optimization algorithm and knowledge-based advanced word boundary modeling. For the memory and latency efficiency, we implemented the dynamic pruning scheduling based on the histogram of active words and their likelihood distribution. We achieved a 10.7% word accuracy improvement with 57.3% speedup.

  • PDF