• Title/Summary/Keyword: Speech Confidence

Search Result 70, Processing Time 0.027 seconds

A Study of the use of allophonic cues in the perception of English word boundaries by Korean learners of English (한국인 영어 학습자의 영어 단어 경계 인지 시 변이음 단서 사용 연구)

  • Chang, Soo-Young;Park, Han-Sang
    • Phonetics and Speech Sciences
    • /
    • v.3 no.3
    • /
    • pp.63-68
    • /
    • 2011
  • This study investigates how Korean students employ acoustic-phonetic cues in perceiving word boundaries of near-homophonous English phrases. For this study, 60 Korean college students participated in the experiment of discriminating word boundaries for 42 pairs of stimuli comprising the allophonic cues of aspiration and glottal stop. Results were analysed in terms of the correctness of responses and the correlation between correctness and confidence. Results showed that stimuli pairs of the glottal stop cue give a higher correctness but those of aspiration a relatively lower correctness. Comparison of the results of this study with those of the previous studies of English and Japanese speakers showed that Korean and Japanese speakers of English give a substantially lower correctness than native speakers of English, while Korean learners of English as a foreign language provide a lower correctness than Japanese speakers of English as a second language.

  • PDF

DTW based Utterance Rejection on Broadcasting News Keyword Spotting System (방송뉴스 핵심어 검출 시스템에서의 오인식 거부를 위한 DTW의 적용)

  • Park, Kyung-Mi;Park, Jeong-Sik;Oh, Yung-Hwan
    • Proceedings of the KSPS conference
    • /
    • 2005.11a
    • /
    • pp.155-158
    • /
    • 2005
  • Keyword spotting is effective to find keyword from the continuously pronounced speech. However, non-keyword may be accepted as keyword when the environmental noise occurs or speaker changes. To overcome this performance degradation, utterance rejection techniques using confidence measure on the recognition result have been developed. In this paper, we apply DTW to the HMM based broadcasting news keyword spotting system for rejecting non-keyword. Experimental result shows that false acceptance rate is decreased to 50%.

  • PDF

The Effect of Voice Generalization on Puberphonia Patients via Generalization -Reinforced Visual Feedback Program: A Case Study (일반화를 강화한 시각적 피드백 프로그램이 무변성 환자의 음성 일반화에 미치는 영향 : 사례연구)

  • Kwon, Soon-Bok;Park, Hee-June;Jeong, Ok-Ran;Wang, Soo-Geun
    • Speech Sciences
    • /
    • v.15 no.2
    • /
    • pp.145-156
    • /
    • 2008
  • The purpose of this study was to investigate the reason why puberphonia patients revisit hospitals after completion of its treatment and the effect of visual voice therapy on voice improvement. The subject the study included are two puberphonia patients who had been diagnosed by laryngologists. The patients who were diagnosed as puberphonia by the laryngologist and treated by the a speech pathologist, completed their treatment and revisited hospital. The study used laryngoscopy, acoustic and aerodynamic analysis before and after voice treatment to investigate what change happens and why generalization of treatment effect did not occur naturally in the daily life. Their voices of pre-therapy and post-therapy were analyzed on the aspects of acoustics, aerodynamics and laryngeal endoscopy. As a result, it was found that fundamental frequency(Fo) was significantly lowered in respect of acoustic change and maximum phonation time(MPT) was increased to some extent in respect of aerodynamic change. In addition, there was a laryngoscopic change and commissure glottic chink disappeared generally in the phonation. The reason why the generalization did not occur naturally in one’s daily routine was mainly due to the fact that high-pitched voicing was used for a long time. Other than that reason, negative reaction or attitude of surrounding people and lack of confidence were to blame for failure of generalization.

  • PDF

Academic Performance, Communication, and Psychosocial Development of Prelingual Deaf Children with Cochlear Implants in Mainstream Schools

  • Choi, Ji Eun;Hong, Sung Hwa;Moon, Il Joon
    • Journal of Audiology & Otology
    • /
    • v.24 no.2
    • /
    • pp.61-70
    • /
    • 2020
  • Background and Objectives: To assess the academic performance, communication skills, and psychosocial development of prelingual deaf children with cochlear implants (CIs) attending mainstream schools, and to evaluate the impact of auditory speech perception on their classroom performance. Subjects and Methods: As participant, 67 children with CI attending mainstream schools were included. A survey was conducted using a structured questionnaire on academic performance in the native language, second language, mathematics, social studies, science, art, communication skills, self-esteem, and social relations. Additionally, auditory and speech performances on the last follow-up were reviewed retrospectively. Results: Most implanted children attending mainstream school appeared to have positive self-esteem and confidence, and had little difficulty in conversing in a quiet classroom. Also, half of the implanted children (38/67) scored above average in general academic achievement. However, academic achievement in the second language (English), social studies, and science were usually poorer than general academic achievement. Furthermore, half of the implanted children had difficulty in understanding the class content (30/67) or conversing with peers in a noisy classroom (32/67). These difficulties were significantly associated with poor speech perception. Conclusions: Improving the listening environment for implanted children attending mainstream schools is necessary.

Academic Performance, Communication, and Psychosocial Development of Prelingual Deaf Children with Cochlear Implants in Mainstream Schools

  • Choi, Ji Eun;Hong, Sung Hwa;Moon, Il Joon
    • Korean Journal of Audiology
    • /
    • v.24 no.2
    • /
    • pp.61-70
    • /
    • 2020
  • Background and Objectives: To assess the academic performance, communication skills, and psychosocial development of prelingual deaf children with cochlear implants (CIs) attending mainstream schools, and to evaluate the impact of auditory speech perception on their classroom performance. Subjects and Methods: As participant, 67 children with CI attending mainstream schools were included. A survey was conducted using a structured questionnaire on academic performance in the native language, second language, mathematics, social studies, science, art, communication skills, self-esteem, and social relations. Additionally, auditory and speech performances on the last follow-up were reviewed retrospectively. Results: Most implanted children attending mainstream school appeared to have positive self-esteem and confidence, and had little difficulty in conversing in a quiet classroom. Also, half of the implanted children (38/67) scored above average in general academic achievement. However, academic achievement in the second language (English), social studies, and science were usually poorer than general academic achievement. Furthermore, half of the implanted children had difficulty in understanding the class content (30/67) or conversing with peers in a noisy classroom (32/67). These difficulties were significantly associated with poor speech perception. Conclusions: Improving the listening environment for implanted children attending mainstream schools is necessary.

New Postprocessing Methods for Rejectin Out-of-Vocabulary Words

  • Song, Myung-Gyu
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.3E
    • /
    • pp.19-23
    • /
    • 1997
  • The goal of postprocessing in automatic speech recognition is to improve recognition performance by utterance verification at the output of recognition stage. It is focused on the effective rejection of out-of vocabulary words based on the confidence score of hypothesized candidate word. We present two methods for computing confidence scores. Both methods are based on the distance between each observation vector and the representative code vector, which is defined by the most likely code vector at each state. While the first method employs simple time normalization, the second one uses a normalization technique based on the concept of on-line garbage mode[1]. According to the speaker independent isolated words recognition experiment with discrete density HMM, the second method outperforms both the first one and conventional likelihood ratio scoring method[2].

  • PDF

Improvement of Confidence Measure Performance using Background Model Set Algorithm (BMS 알고리즘을 이용한 거절기능 성능 향상)

  • Kim ByoungDon;Lee KyongRok;Kim JinYoung;Choi SeungHo
    • Proceedings of the KSPS conference
    • /
    • 2003.05a
    • /
    • pp.79-82
    • /
    • 2003
  • In this paper, we proposed Backgorund Model Set algorithm for the speaker verification to improve the shortcoming of calculating process in conventional confidence measure(CM). CM is to display relative likelihood between recognized models and unrecognized models. Unrecognized models is known as antiphone models. Calculate probability and standard deviation using all phonemes at process that compose antiphone model. At this process, antiphone CM brought bad result. Also, recognition time increases. In order problem, we studied about method to reconstitute average and standard deviation taking BMS algorithm using antiphoneme that near phoneme of CM calculation.

  • PDF

Development of Autonomous Mobile Robot with Speech Teaching Command Recognition System Based on Hidden Markov Model (HMM을 기반으로 한 자율이동로봇의 음성명령 인식시스템의 개발)

  • Cho, Hyeon-Soo;Park, Min-Gyu;Lee, Hyun-Jeong;Lee, Min-Cheol
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.13 no.8
    • /
    • pp.726-734
    • /
    • 2007
  • Generally, a mobile robot is moved by original input programs. However, it is very hard for a non-expert to change the program generating the moving path of a mobile robot, because he doesn't know almost the teaching command and operating method for driving the robot. Therefore, the teaching method with speech command for a handicapped person without hands or a non-expert without an expert knowledge to generate the path is required gradually. In this study, for easily teaching the moving path of the autonomous mobile robot, the autonomous mobile robot with the function of speech recognition is developed. The use of human voice as the teaching method provides more convenient user-interface for mobile robot. To implement the teaching function, the designed robot system is composed of three separated control modules, which are speech preprocessing module, DC servo motor control module, and main control module. In this study, we design and implement a speaker dependent isolated word recognition system for creating moving path of an autonomous mobile robot in the unknown environment. The system uses word-level Hidden Markov Models(HMM) for designated command vocabularies to control a mobile robot, and it has postprocessing by neural network according to the condition based on confidence score. As the spectral analysis method, we use a filter-bank analysis model to extract of features of the voice. The proposed word recognition system is tested using 33 Korean words for control of the mobile robot navigation, and we also evaluate the performance of navigation of a mobile robot using only voice command.

Articulation error of children with adenoid hypertrophy

  • Eom, Tae-Hoon;Jang, Eun-Sil;Kim, Young-Hoon;Chung, Seung-Yun;Lee, In-Goo
    • Clinical and Experimental Pediatrics
    • /
    • v.57 no.7
    • /
    • pp.323-328
    • /
    • 2014
  • Purpose: Adenoid hypertrophy is a physical alteration that may affect speech, and a speech disorder can have other negative effects on a child's life. Airway obstruction leads to constricted oral breathing and causes postural alterations of several oro-facial structures, including the mouth, tongue, and hyoid bone. The postural modifications may affect several aspects of speech production. Methods: In this study, we compared articulation errors in 19 children with adenoid hypertrophy (subject group) to those of 33 children with functional articulation disorders independent of anatomical problems (control group). Results: The mean age of the subject group was significantly higher (P=0.016). Substitution was more frequent in the subject group (P=0.003; odds ratio [OR], 1.80; 95% confidence interval [CI], 1.23- 2.62), while omission was less frequent (P<0.001; OR, 0.43; 95% CI, 0.27-0.67). Articulation errors were significantly less frequent in the palatal affricative in the subject group (P=0.047; OR, 0.25; 95% CI, 0.07-0.92). The number of articulation errors in other consonants was not different between the two groups. Nasalization and aspiration were significantly more frequent in the subject group (P=0.007 and 0.014; OR, 14.77 and 0.014; 95% CI, [1.62-135.04] and NA, respectively). Otherwise, there were no differences between the two groups. Conclusion: We identified the characteristics of articulation errors in children with adenoid hypertrophy, but our data did not show the relationship between adenoid hypertrophy and oral motor function that has been observed in previous studies. The association between adenoid hypertrophy and oral motor function remains doubtful.

A Method of Automated Quality Evaluation for Voice-Based Consultation (음성 기반 상담의 품질 평가를 위한 자동화 기법)

  • Lee, Keonsoo;Kim, Jung-Yeon
    • Journal of Internet Computing and Services
    • /
    • v.22 no.2
    • /
    • pp.69-75
    • /
    • 2021
  • In a contact-free society, online services are becoming more important than classic offline services. At the same time, the role of a contact center, which executes customer relation management (CRM), is increasingly essential. For supporting the CRM tasks and their effectiveness, techniques of process automation need to be applied. Quality assurance (QA) is one of the time and resource consuming, and typical processes that are suitable for automation. In this paper, a method of automatic quality evaluation for voice based consultations is proposed. Firstly, the speech in consultations is transformed into a text by speech recognition. Then quantitative evaluation based on the QA metrics, including checking the elements in opening and closing mention, the existence of asking the mandatory information, the attitude of listening and speaking, is executed. 92.7% of the automated evaluations are the same to the result done by human experts. It was found that the non matching cases of the automated evaluations were mainly caused from the mistranslated Speech-to-Text (STT) result. With the confidence of STT result, this proposed method can be employed for enhancing the efficiency of QA process in contact centers.