• 제목/요약/키워드: Speech Feedback

Search Result 86, Processing Time 0.022 seconds

Acoustic Analysis of Normal and Pathologic Voice Synthesized with Voice Synthesis Program of Dr. Speech Science (Dr. Speech Science의 음성합성프로그램을 이용하여 합성한 정상음성과 병적음성(Pathologic Voice)의 음향학적 분석)

  • 최홍식;김성수
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.12 no.2
    • /
    • pp.115-120
    • /
    • 2001
  • In this paper, we synthesized vowel /ae/ with voice synthesis program of Dr. Speech Science, and we also synthesized pathologic vowel /ae/ by some parameters such as high frequency gain (HFG), low frequency gain(LFG), pitch flutter(PF) which represents jitter value and flutter of amplitude(FA) which represents shimmer value, and grade ranked as mild, moderate and severe respectively. And then we analysed all pathologic voice by analysis program of Dr. Speech Science. We expect that this synthesized pathologic voices are useful for understanding the parameter such as noise, jitter and shimmer and feedback effect to patient with voice disorder.

  • PDF

Effects on the Speech Enhancement Algorithms for Sensorineural Hearing Impairment and Normal Listeners (배경잡음하에서의 감음신경성난청과 정상청력자의 어음인지향상 연구)

  • Kim, D.W.;Kim, I.Y.;Youn, G.W.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1998 no.11
    • /
    • pp.171-172
    • /
    • 1998
  • Recent development of digital technology has offered new possibilities for noticeable advances of hearing aids. Using the digital technology, it is possible to equip hearing aids with powerful features such as multi-channel nonlinear compression amplification and the feedback cancellation, these are often difficult to implement with analog circuits. Still, speech in noise is one of the major complaints of not only hearing impaired persons but also normal listeners. This paper describes speech intelligibility in background noise for both normal and hearing impaired listeners. Speech enhancement algorithms were implemented and compared for normal and sensorineural hearing impairment listeners.

  • PDF

COMPUTER AND INTERNET RESOURCES FOR PRONUNCIATION AND PHONETICS TEACHING

  • Makarova, Veronika
    • Proceedings of the KSPS conference
    • /
    • 2000.07a
    • /
    • pp.338-349
    • /
    • 2000
  • Pronunciation teaching is once again coming into the foreground of ELT. Japan is, however, lagging far behind many countries in the development of pronunciation curricula and in the actual speech performance of the Japanese learners of English. The reasons for this can be found in the prevalence of communicative methodologies unfavorable for pronunciation teaching, in the lack of trained professionals, and in the large numbers of students in Japanese foreign language classes. This paper offers a way to promote foreign language pronunciation teaching in Japan and other countries by means of employing computer and internet facilities. The paper outlines the major directions of using modem speech technologies in pronunciation classes, like EVF (electronic visual feedback) training at segmental and prosodic levels; automated error detection, testing, grading and fluency assessment. The author discusses the applicability of some specific software packages (CSLU, SUGIspeech, Multispeech, Wavesurfer, etc.) for the needs of pronunciation teaching. Finally, the author talks about the globalization of pronunciation education via internet resources, such as computer corpora and speech and pronunciation training related web pages.

  • PDF

A Study on Automatic Measurement of Pronunciation Accuracy of English Speech Produced by Korean Learners of English (한국인 영어 학습자의 발음 정확성 자동 측정방법에 대한 연구)

  • Yun, Weon-Hee;Chung, Hyun-Sung;Jang, Tae-Yeoub
    • Proceedings of the KSPS conference
    • /
    • 2005.11a
    • /
    • pp.17-20
    • /
    • 2005
  • The purpose of this project is to develop a device that can automatically measure pronunciation of English speech produced by Korean learners of English. Pronunciation proficiency will be measured largely in two areas; suprasegmental and segmental areas. In suprasegmental area, intonation and word stress will be traced and compared with those of native speakers by way of statistical methods using tilt parameters. Durations of phones are also examined to measure speakers' naturalness of their pronunciations. In doing so, statistical duration modelling from a large speech database using CART will be considered. For segmental measurement of pronunciation, acoustic probability of a phone, which is a byproduct when doing the forced alignment, will be a basis of scoring pronunciation accuracy of a phone. The final score will be a feedback to the learners to improve their pronunciation.

  • PDF

Improvement of acoustic feedback stability by bandwidth compression and expansion

  • 염동홍;안수길
    • The Journal of the Acoustical Society of Korea
    • /
    • v.4 no.1
    • /
    • pp.16-21
    • /
    • 1985
  • Both shifiting the input signal's frequencies by a fixed frequency and compressing the input signal's bandwidth have been known to be effective in improving the stability margin of public adress systems operating in reverberant spaces. This paper describes the effect of an alternative approach of improving the acoustic-feedback stability and yet maintaining speech inteligibility by bandwidth compression and expansion. Conditions are derived for this technizue to be realized and an experimental system has been made - up. A series of experiments has been performed in small spaces and the results have shown that more than 5dB improvement can be obtained in the stability margin.

  • PDF

The Effect of Semantic Neighborhood Density in Korean Visual Word Recognition (한국어 시각단어재인에서 의미 이웃크기 효과)

  • Kwon, You-An;Nam, Ki-Chun
    • Proceedings of the KSPS conference
    • /
    • 2007.05a
    • /
    • pp.173-175
    • /
    • 2007
  • The lexical decision task (LDT) commonly postulates the activation of semantic level. However, there are few studies for the feedback effect from semantic level. The purpose of the present study is to investigate whether the feedback effect from semantic level is facilitatory or inhibitory in Korean LDT. In Experiment 1, we manipulated the number of phonological syllable neighbors (PSN) and the number of semantic neighbors (SEN) orthogonally while orthographic syllable neighbor (OSN) is dense. In the results, the significant facilitatory effect was shown in words with many SEN. In Experiment 2, we examined same conditions as Experiment 1 but OSN was sparse. Although the similar lexical decision latency pattern was shown, there was no statistical significance. These results can be explained by the feedback activation from semantic level. If a target has many SENs and many PSNs, it receives more feedback activation from semantic level than a target with few SENs and PSNs.

  • PDF

A Full Body Gumdo Game with an Intelligent Cyber Fencer using Multi-modal(3D Vision and Speech) Interface (멀티모달 인터페이스(3차원 시각과 음성 )를 이용한 지능적 가상검객과의 전신 검도게임)

  • 윤정원;김세환;류제하;우운택
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.9 no.4
    • /
    • pp.420-430
    • /
    • 2003
  • This paper presents an immersive multimodal Gumdo simulation game that allows a user to experience the whole body interaction with an intelligent cyber fencer. The proposed system consists of three modules: (i) a nondistracting multimodal interface with 3D vision and speech (ii) an intelligent cyber fencer and (iii) an immersive feedback by a big screen and sound. First, the multimodal Interface with 3D vision and speech allows a user to move around and to shout without distracting the user. Second, an intelligent cyber fencer provides the user with intelligent interactions by perception and reaction modules that are created by the analysis of real Gumdo game. Finally, an immersive audio-visual feedback by a big screen and sound effects helps a user experience an immersive interaction. The proposed system thus provides the user with an immersive Gumdo experience with the whole body movement. The suggested system can be applied to various applications such as education, exercise, art performance, etc.

The Study on Automatic Speech Recognizer Utilizing Mobile Platform on Korean EFL Learners' Pronunciation Development (자동음성인식 기술을 이용한 모바일 기반 발음 교수법과 영어 학습자의 발음 향상에 관한 연구)

  • Park, A Young
    • Journal of Digital Contents Society
    • /
    • v.18 no.6
    • /
    • pp.1101-1107
    • /
    • 2017
  • This study explored the effect of ASR-based pronunciation instruction, using a mobile platform, on EFL learners' pronunciation development. Particularly, this quasi-experimental study focused on whether using mobile ASR, which provides voice-to-text feedback, can enhance the perception and production of target English consonants minimal pairs (V-B, R-L, and G-Z) of Korean EFL learners. Three intact classes of 117 Korean university students were assigned to three groups: a) ASR Group: ASR-based pronunciation instruction providing textual feedback by the mobile ASR; b) Conventional Group: conventional face-to-face pronunciation instruction providing individual oral feedback by the instructor; and the c) Hybrid Group: ASR-based pronunciation instruction plus conventional pronunciation instruction. The ANCOVA results showed that the adjusted mean score for pronunciation production post-test on the Hybrid instruction group (M=82.71, SD =3.3) was significantly higher than the Conventional group (M=62.6, SD =4.05) (p<.05).

Education System to Learn the Skills of Management Decision-Making by Using Business Simulator with Speech Recognition Technology

  • Sakata, Daiki;Akiyama, Yusuke;Kaneko, Masaaki;Kumagai, Satoshi
    • Industrial Engineering and Management Systems
    • /
    • v.13 no.3
    • /
    • pp.267-277
    • /
    • 2014
  • In this paper, we propose an educational system that involves a business game simulator and related curriculum. To develop these two elements, we examined the decision-making process related to business management and identified some significant skills thereby. In addition, we created an original simulator, named BizLator (http://bizlator.com), to help students develop these skills efficiently. Next, we developed a curriculum suitable for the simulator. We confirmed the effectiveness of the simulator and curriculum in a business-game-based class at Aoyama Gakuin University in Tokyo. On the basis of this, we compared our education system with a conventional system. This allowed us to identify advantages of and issues with our proposed system. Furthermore, we proposed a speech recognition support system named BizVoice in order to provide the teachers with more meaningful feedback, such as level of students' understanding. Concretely, BizVocie fetches students' speech of discussion during the game and converts the voice data to text data with speech recognition technology. Finally, teachers can grasp students' parameters of understanding, and thereby, the students also can take more effective class using BizLator. We also confirmed the effectiveness of the system in the class of Aoyama Gakuin Universiry.

Adaptive Speech Emotion Recognition Framework Using Prompted Labeling Technique (프롬프트 레이블링을 이용한 적응형 음성기반 감정인식 프레임워크)

  • Bang, Jae Hun;Lee, Sungyoung
    • KIISE Transactions on Computing Practices
    • /
    • v.21 no.2
    • /
    • pp.160-165
    • /
    • 2015
  • Traditional speech emotion recognition techniques recognize emotions using a general training model based on the voices of various people. These techniques can not consider personalized speech character exactly. Therefore, the recognized results are very different to each person. This paper proposes an adaptive speech emotion recognition framework made from user's' immediate feedback data using a prompted labeling technique for building a personal adaptive recognition model and applying it to each user in a mobile device environment. The proposed framework can recognize emotions from the building of a personalized recognition model. The proposed framework was evaluated to be better than the traditional research techniques from three comparative experiment. The proposed framework can be applied to healthcare, emotion monitoring and personalized service.