Search | Korea Science

An automatic pronunciation evaluation system using non-native teacher's speech model (비원어민 교수자 음성모델을 이용한 자동발음평가 시스템)

Park, Hye-bin;Kim, Dong Heon;Joung, Jinoo
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.16 no.2
- /
- pp.131-136
- /
- 2016
An appropriate evaluation on learner's pronunciation has been an important part of foreign language education. The learners should be evaluated and receive proper feedback for pronunciation improvement. Due to the cost and consistency problem of human evaluation, automatic pronunciation evaluation system has been studied. The most of the current automatic evaluation systems utilizes underlying Automatic Speech Recognition (ASR) technology. We suggest in this work to evaluate learner's pronunciation accuracy and fluency in word-level using the ASR and non-native teacher's speech model. Through the performance evaluation on our system, we confirm the overall evaluation result of pronunciation accuracy and fluency actually represents the learner's English skill level quite accurately.
https://doi.org/10.7236/JIIBC.2016.16.2.131 인용 PDF KSCI

Speech Recognition Website for Korean Pronunciation Training - Baleum (한국어 발음 훈련을 위한 음성 인식 웹 사이트 - 바름)

Junghye Min;Gyo Jin Kang;In Gi Kim
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2023.07a
- /
- pp.29-32
- /
- 2023
본 논문에서는 외국인과 발음에 어려움을 겪고 있는 한국인들을 대상으로 음성 녹음을 진행하여 점수를 반환받는 웹 사이트를 소개한다. 이 웹 사이트의 목적은 사용자들의 발음 향상을 돕는 것이다. 음성 인식 API와 발음 평가 API를 사용하여 사용자의 발음을 정확하게 평가하고 피드백을 제공함으로써, 외국어 학습자와 발음에 어려움을 겪는 한국인들이 보다 원활하게 의사소통할 수 있도록 돕는다. 향후 연구로는 이 시스템의 사용자들에게 학습 성취에 대한 동기 부여를 하는 기능을 추가해 학습 효과를 높이도록 개선할 것이다.
PDF

Pronunciation Dictionary For Continuous Speech Recognition (한국어 연속음성인식을 위한 발음사전 구축)

이경님;정민화
- Proceedings of the Korean Information Science Society Conference
- /
- 2000.10b
- /
- pp.197-199
- /
- 2000
연속음성인식을 수행하기 위해서는 발음사전과 언어모델이 필요하다. 이 둘 사이에는 디코딩 단위가 일치하여야 하므로 발음사전 구축시 디코딩 단위로 표제어 단위를 선정하며 표제어 사이의 음운변화 현상을 반영한 발음사전을 구축하여야 한다. 한국어에 부합하는 음운변화현상을 분석하여 학습용 자동 발음열을 생성하고, 이를 통하여 발음사전을 구축한다. 전처리 단계로 기호, 단위, 숫자 등 전처리 과정 및 형태소 분석 과정을 수행하며, 디코딩 단위인 의사 형태소 단위를 생성하기 위해 규칙을 이용한 태깅 과정을 거친다. 이를 통해 나온 결과를 발음열 생성기 입력으로 하며, 결과는 학습용 발음열 또는 발음사전 구성을 위한 형태로 출력한다. 표제어간 음운변화 현상이 반영된 상태의 표제어 단위이므로 실제 음운변화가 반영되지 않은 상태의 표제어와는 그 형태가 상이하다. 이는 연속 발음시 생기는 현상으로 실제 인식에는 이 음운변화 현상이 반영된 사전이 필요하게 된다. 생성된 발음사전의 효용성을 확인하기 위해 다음과 같은 실험을 통해 성능을 평가하였다. 음향학습을 위하여 PBS(Phonetically Balanced Sentence) 낭독체 17200문장을 녹음하고 그 전사파일을 사용하여 학습을 수행하였고, 발음사전의 평가를 위하여 이 중 각각 3100문장을 사용하여 다음과 같은 실험을 수행하였다. 형태소 태그정보를 이용하여 표제어간 음운변화 현상을 반영한 최적의 발음사전과 다중 발음사전, 언어학적 기준에 의한 수작업으로 생성한 표준 발음사전, 그리고 표제어간의 음운변화 현상을 고려하지 않고 독립된 단어로 생성한 발음사전과의 비교 실험을 수행하였다. 실험결과 표제어간 음운변화 현상을 반영하지 않은 경우 단어 인식률이 43.21%인 반면 표제어간 음운변화 현상을 반영한 1-Best 사전의 경우 48.99%, Multi 사전의 경우 50.19%로 인식률이 5~6%정도 향상되었음을 볼 수 있었고, 수작업에 의한 표준발음사전의 단어 인식률 45.90% 보다도 약 3~4% 좋은 성능을 보였다.
PDF

Syllabus Design for Teaching Pronunciation in Korean EFL Classroom (한국인을 위한 영어 발음지도안 개발)

Park Jookyung
- Proceedings of the KSPS conference
- /
- 1996.10a
- /
- pp.142-148
- /
- 1996
이 연구의 목적은 의사소통 능력 중심의 영어교육을 하기 위하여 특별히 한국인들이 영어를 발음할 때 나타나는 문제점들을 살펴보고 보다 정확한 영어발음을 낼 수 있도록 교육할 수 있는 지도안을 작성해 보고자하는 것이다. 먼저 한국인을 위한 영어발음교육의 특성과 제문제를 살펴보고, 보다 효과적인 발음지도를 위해 구체적인 발음지도 목표와 그 목표에 맞는 발음지도 법을 알아보았다. 발음지도 목표로는 우선, 영어를 모국어로 하는 사람들이 알아듣고 이해할 수 있는 정도의 발음을 갖추도록 하며, 이를 위해 (1)영어자,모음 식별 청취 및 발음, (2)올바른 강세와 억양 식별 및 구사, (3)연음 및 기타 주요 발음 현상 식별 및 구사 등을 지도하되, (1)보다 (2)와 (3)을 보다 집중적으로 지도 할 것을 제시하였다. 아울러 이들 각각의 내용을 보다 효과적으로 지도하기 위하여 의사소통 능력을 중심으로 한 여러 가지 지도법과 학습활동들을 소개하였다. 또한 교육한 내용에 대한 평가의 중요성을 강조하고 그 방법을 제시하였고, 보다 실용적인 발음지도안을 작성하기 위한 교사교육과 작성된 발음지도안의 활용이 필요함을 강조하였다.
PDF

Pronunciation Variation Modeling for Korean Point-of-Interest Data Usins Prosodic Information (운율 정보를 이용한 한국어 위치 정보 데이터의 발음 모델링)

Kim, Sun-Hee;Park, Jeon-Gue;Jeon, Je-Hun;Na, Min-Soo;Chung, Min-Hwa
- Annual Conference on Human and Language Technology
- /
- 2006.10e
- /
- pp.51-56
- /
- 2006
일반적으로 운율 정보를 음성인식에 이용한 연구들에 있어서는 대부분 운율의 음향적 정보를 이용하는데 반하여, 본 연구에서는 운율어나 음절수와 같은 운율의 구조적 정보가 인식률 향상에 기여함을 보인다. 본 논문은 두 가지 운율 정보, 즉 운율어와 음절수를 이용하여 발음모델링을 할 경우에 음성인식기의 성능을 평가하는 것을 목표로 하는 것으로, 먼저, 운율어를 이용하여 위치 정보데이터의 가능한 모든 발음을 생성하고, 다시 음절 수를 기준으로 발음변이 수를 조절하는 방법을 제시한 다음, 제안한 방법에 의하여 생성한 발음사전을 이용하여 음성인식의 성능을 평가하였다. 실험결과 운율어를 이용하여 발음 사전을 제작한 모든 경우에 베이스라인과 비교하여 성능이 향상됨을 보였는데, 베이스라인의 WER 4.63% 에서 최대 8.4%의 WER 가 감소하였다. 위치 정보 데이터의 음절수에 따라서 발음 변이의 수를 조절한 결과도 전체적으로는 3 음절로 그 수를 제한한 경우, 6 음절이상 단어에서는 4음절로 제한한 경우에 가장 좋은 인식 성능을 얻을 수 있어서, 음절수에 따른 발음변이 수의 조절이 효과적임을 알 수 있었다.
PDF

PESAA - Computer Assisted English Speaking Training system (PESAA - 컴퓨터 보조 영어 말하기 훈련 시스템)

Bang, Jeesoo;Lee, Jonghoon;Kang, Sechun;Lee, Geunbae Gary
- Annual Conference on Human and Language Technology
- /
- 2012.10a
- /
- pp.73-76
- /
- 2012
영어 교육의 필요성이 증가하고 그에 대한 수요가 늘어남에 따라 컴퓨터를 이용한 외국어 교육 시스템이 개인적인 영어 교육방법으로 소개되고 있다. 새로운 외국어를 접할 때 습득하기 어려운 부분 중 하나가 발음이고, 발음이 외국어 말하기 실력에 중요한 요소이기 때문에 특별한 훈련이 필요하다. 본 논문에서는 이러한 문제점에 대하여 충분히 인지하고 외국어 발음 향상에 도움을 주기 위하여 컴퓨터 보조 발음 훈련시스템을 개발하였다. 본 시스템은 발음 훈련과 억앙 훈련, 즉 문장 강세 훈련과 끊어 읽기 훈련을 포함하며, 사용자의 발화에 대해 적절한 평가와 피드백을 제공한다. 본 논문에서는 발음 훈련 시스템의 구성요소와 동작에 대하여 중점적으로 기술하였다.
PDF

Pronunciation Variation Modeling for Korean Point-of-Interest Data Using Prosodic Information (운율 정보를 이용한 한국어 위치 정보 데이타의 발음 모델링)

Kim, Sun-He;Park, Jeon-Gue;Na, Min-Soo;Jeon, Je-Hun;Chung, Min-Wha
- Journal of KIISE:Software and Applications
- /
- v.34 no.2
- /
- pp.104-111
- /
- 2007
This paper examines how the performance of an automatic speech recognizer was improved for Korean Point-of-Interest (POI) data by modeling pronunciation variation using structural prosodic information such as prosodic words and syllable length. First, multiple pronunciation variants are generated using prosodic words given that each POI word can be broken down into prosodic words. And the cross-prosodic-word variations were modeled considering the syllable length of word. A total of 81 experiments were conducted using 9 test sets (3 baseline and 6 proposed) on 9 trained sets (3 baseline, 6 proposed). The results show: (i) the performance was improved when the pronunciation lexica were generated using prosodic words; (ii) the best performance was achieved when the maximum number of variants was constrained to 3 based on the syllable length; and (iii) compared to the baseline word error rate (WER) of 4.63%, a maximum of 8.4% in WER reduction was achieved when both prosodic words and syllable length were considered.
PDF KSCI

Comparing the effects of letter-based and syllable-based speaking rates on the pronunciation assessment of Korean speakers of English (철자 기반과 음절 기반 속도가 한국인 영어 학습자의 발음 평가에 미치는 영향 비교)

Hyunsong Chung
- Phonetics and Speech Sciences
- /
- v.15 no.4
- /
- pp.1-10
- /
- 2023
This study investigated the relative effectiveness of letter-based versus syllable-based measures of speech rate and articulation rate in predicting the articulation score, prosody fluency, and rating sum using "English speech data of Koreans for education" from AI Hub. We extracted and analyzed 900 utterances from the training data, including three balanced age groups (13, 19, and 26 years old). The study built three models that best predicted the pronunciation assessment scores using linear mixed-effects regression and compared the predicted scores with the actual scores from the validation data (n=180). The correlation coefficients between them were also calculated. The findings revealed that syllable-based measures of speech and articulation rates were more effective than letter-based measures in all three pronunciation assessment categories. The correlation coefficients between the predicted and actual scores ranged from .65 to .68, indicating the models' good predictive power. However, it remains inconclusive whether speech rate or articulation rate is more effective.
https://doi.org/10.13064/KSSS.2023.15.4.001 인용 PDF

Analysis on Vowel and Consonant Sounds of Patent's Speech with Velopharyngeal Insufficiency (VPI) and Simulated Speech (구개인두부전증 환자와 모의 음성의 모음과 자음 분석)

Sung, Mee Young;Kim, Heejin;Kwon, Tack-Kyun;Sung, Myung-Whun;Kim, Wooil
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.18 no.7
- /
- pp.1740-1748
- /
- 2014
This paper focuses on listening test and acoustic analysis of patients' speech with velopharyngeal insufficiency (VPI) and normal speakers' simulation speech. In this research, a set consisting of 50-words, vowels and single syllables is determined for speech database construction. A web-based listening evaluation system is developed for a convenient/automated evaluation procedure. The analysis results show the trend of incorrect recognition for VPI speech and the one for simulation speech are similar. Such similarity is also confirmed by comparing the formant locations of vowel and spectrum of consonant sounds. These results show that the simulation method for VPI speech is effective at generating the speech signals similar to actual VPI patient's speech. It is expected that the simulation speech data can be effectively employed for our future work such as acoustic model adaptation.
https://doi.org/10.6109/jkiice.2014.18.7.1740 인용 PDF KSCI

Automatic Generation of Pronunciation Variants for Korean Continuous Speech Recognition (한국어 연속음성 인식을 위한 발음열 자동 생성)

이경님;전재훈;정민화
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.2
- /
- pp.35-43
- /
- 2001
Many speech recognition systems have used pronunciation lexicon with possible multiple phonetic transcriptions for each word. The pronunciation lexicon is of often manually created. This process requires a lot of time and efforts, and furthermore, it is very difficult to maintain consistency of lexicon. To handle these problems, we present a model based on morphophon-ological analysis for automatically generating Korean pronunciation variants. By analyzing phonological variations frequently found in spoken Korean, we have derived about 700 phonemic contexts that would trigger the multilevel application of the corresponding phonological process, which consists of phonemic and allophonic rules. In generating pronunciation variants, morphological analysis is preceded to handle variations of phonological words. According to the morphological category, a set of tables reflecting phonemic context is looked up to generate pronunciation variants. Our experiments show that the proposed model produces mostly correct pronunciation variants of phonological words. Then we estimated how useful the pronunciation lexicon and training phonetic transcription using this proposed systems.
PDF

Search Result 125, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)