Search | Korea Science

A Korean Flight Reservation System Using Continuous Speech Recognition

Choi, Jong-Ryong;Kim, Bum-Koog;Chung, Hyun-Yeol;Nakagawa, Seiichi
- The Journal of the Acoustical Society of Korea
- /
- v.15 no.3E
- /
- pp.60-65
- /
- 1996
This paper describes on the Korean continuous speech recognition system for flight reservation. It adopts a frame-synchronous One-Pass DP search algorithm driven by syntactic constraints of context free grammar(CFG). For recognition, 48 phoneme-like units(PLU) were defined and used as basic units for acoustic modeling of Korean. This modeling was conducted using a HMM technique, where each model has 4-states 3-continuous output probability distributions and 3-discrete-duration distributions. Language modeling by CFG was also applied to the task domain of flight reservation, which consisted of 346 words and 422 rewriting rules. In the tests, the sentence recognition rate of 62.6% was obtained after speaker adaptation.
PDF

Brief Commentary on Philological Value of "EuiBangYooChui"(Classified Assemblage of Medical Prescriptions) ("의방유취(醫方類聚)"의 문헌가치(文獻價値)에 관한 관견(管見))

Hu, Sen
- Korean Journal of Oriental Medicine
- /
- v.14 no.2
- /
- pp.151-154
- /
- 2008
"EuiBangYooChui"(Classified Assemblage of Medical Prescriptions) preserves important historical documents about herbal medical prescriptions up to the beginning of Ming dynasty. Mikisakae, a well-known medical history scholar of Japan, attributed high values on "EuiBangYooChui"(Classified Assemblage of Medical Prescriptions) as he stated that it summarized all medical knowledge of all over China and promulgated korean medicine to world top level. "EuiBangYooChui"(Classified Assemblage of Medical Prescriptions) thoroughly cited herbal prescriptions of 150 medical books of China which contents reach up to 9.5millions of letters. Also clarified all the sources of its contents. These efforts made easy the utilization for upcoming experts. Existing block books serves in various aspects of philological field, such as revision of lost documents, block book studies, contents studies, medical history studies, letter studies, phoneme studies and scholia.
PDF

Branch Algorithm for Phoneme Segmentation in Korean Speech Recognition System (한국어 음성인식 시스템에서 음소 경계 검출을 위한 Branch 알고리즘)

서영완;한승진;장흥종;이정현
- Proceedings of the Korean Information Science Society Conference
- /
- 2000.04b
- /
- pp.357-359
- /
- 2000
음소 단위로 구축된 음성 데이터는 음성인식, 합성 및 분석 등의 분야에서 매우 중요하다. 일반적으로 음소는 유성음과 무성음으로 구분되어 진다. 이러한 유성음과 무성음은 많은 특징적 차이가 있지만, 기존의 음소 경계추출 알고리즘은 이를 고려하지 않고 시간 축을 기준으로 이전 프레임과 매개변수 (스펙트럼) 비교만을 통하여 음소의 경계를 결정한다. 본 논문에서는 음소 경계 추출을 위하여 유성음과 무성음의 특징적 차이를 고려한 블록기반의 Branch 알고리즘을 설계하였다. Branch 알고리즘을 사용하기 위한 스펙트럼 비교 방법은 MFCC(Mel-Frequency Cepstrum Coefficient)를 기반으로 한 거리 측정법을 사용하였고, 유성음과 무성음의 구분은 포만트 주파수를 이용하였다. 실험 결과 3~4음절 고립단어를 대상으로 약 78%의 정확도를 얻을수 있었다.
PDF

The Role of Linguistic Knowledge in the Perception of English Stops after /s/

Kim, Dae-Won
- Speech Sciences
- /
- v.3
- /
- pp.71-82
- /
- 1998
Five sets of nonsense acoustical stimuli {$[sp{\varepsilon},st{\varepsilon},sk{\varepsilon}],\;[p{\varepsilon},t{\varepsilon},k{\varepsilon}],\;[sb{\varepsilon},sd{\varepsilon},sg{\varepsilon}],\;[b{\varepsilon},d{\varepsilon},g{\varepsilon}],\;['{\varepsilon}b{\varepsilon},'{\varepsilon}d{\varepsilon},'{\varepsilon}g{\varepsilon}]$} were presented for identification of English stops to native speakers of English, Chinese, and Korean. The English speakers perceived stops after /s/ as /p, t, k/; in other contexts as /b, d, g/. In the languages where other distinctions exist, however, the evaluation was different. The results suggest that in English the cue for stops after /s/ was syllable structure constraint: After initial /s/ always /p, t, k/ follow; the cue for the initial stops was aspiration. On the basis of the results, it was concluded that in English we should classify the unaspirated voiceless stops in initial /s/-stop clusters into the phoneme where [$p^{h},t^{h},k^{h}$] are in, and that perception is not only language specific but also context specific.
PDF

A Comparison on /ㅅ/ and /ㅆ/ in Daegu and Seoul dialect (대구 방언과 서울 방언의 /ㅅ/와 /ㅆ/의 실현 양상 비교)

Jang, Hye-Jin;Shin, Ji-Young
- Proceedings of the KSPS conference
- /
- 2006.11a
- /
- pp.84-87
- /
- 2006
It have been known that Daegu dialect does not have /ㅆ/ as a phoneme. However, it seems that /ㅅ/ and /ㅆ/ are phonemically distinctive in younger generation. In this paper, we investigate realization of /ㅅ/ and /ㅆ/ of Daegu dialect in their 20's, and compare them with /ㅅ/ and /ㅆ/ of Seoul dialect in their 20's. The result of this study showed that /ㅅ/ and /ㅆ/ were not significantly different between Daegu and Seoul dialect except pitch. Therefore, in Daegu dialect /ㅅ/ and /ㅆ/ are phonemically distinctive in younger generation like Seoul dialect's /ㅅ/ and /ㅆ/ are.
PDF

Changes in Features of Korean Vowels with Age and Sex of Speakers and Their Recognition (한국어 단모음의 성별, 연령별 특징변화 및 인식)

이용주;김경태;차균현
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.25 no.12
- /
- pp.1503-1512
- /
- 1988
As the basic analysis to solve the within-and cross-speaker variability in phoneme based speech recognition, changes in pitch and formant frequencies of 8 Korean vowels with age and sex of speaker has been investigated by analyzing a large number fo samples. Conclusions obtained are as follows: 1) Changes in pitch frequency with age and sex of speaker for children are hard to distinguish and the difference of before and after the voice change is analyzed approximately 0.2 oct. for female an 0.9 oct. for male. 2) While most of the formants of vowel considerably change with the age of speaker, the change becomes smaller as the age becomes older. 3) While there is an indirect correlation between pitch and formant with change in age, it is hard to see a direct correlation. 4) When the objects of the recognition experiment by pitch and formants are various speakers in each age and sex, pitch also works as an efficient recognition parameter.
PDF

Algorithm for Concatenating Multiple Phonemic Units for Small Size Korean TTS Using RE-PSOLA Method

Bak, Il-Suh;Jo, Cheol-Woo
- Speech Sciences
- /
- v.10 no.1
- /
- pp.85-94
- /
- 2003
In this paper an algorithm to reduce the size of Text-to-Speech database is proposed. The algorithm is based on the characteristics of Korean phonemic units. From the initial database, a reduced phoneme unit set is induced by articulatory similarity of concatenating phonemes. Speech data is read by one female announcer for 1000 phonetically balanced sentences. All the recorded speech is then segmented by phoneticians. Total size of the original speech data is about 640 MB including laryngograph signal. To synthesize wave, RE-PSOLA (Residual-Excited Pitch Synchronous Overlap and Add Method) was used. The voice quality of synthesized speech was compared with original speech in terms of spectrographic informations and objective tests. The quality of the synthesized speech is not much degraded when the size of synthesis DB was reduced from 320 MB to 82 MB.
PDF

A STUDY ON THE SIMULATED ANNEALING OF SELF ORGANIZED MAP ALGORITHM FOR KOREAN PHONEME RECOGNITION

Kang, Myung-Kwang;Ann, Tae-Ock;Kim, Lee-Hyung;Kim, Soon-Hyob
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06c
- /
- pp.407-410
- /
- 1994
In this paper, we describe the new unsuperivised learning algorithm, SASOM. It can solve the defects of the conventional SOM that the state of network can't converge to the minimum point. The proposed algorithm uses the object function which can evaluate the state of network in learning and adjusts the learning rate flexibly according to the evaluation of the object function. We implement the simulated annealing which is applied to the conventional network using the object function and the learning rate. Finally, the proposed algorithm can make the state of network converged to the global minimum. Using the two-dimensional input vectors with uniform distribution, we graphically compared the ordering ability of SOM with that of SASOM. We carried out the recognitioin on the new algorithm for all Korean phonemes and some continuous speech.
PDF

The Basic Study on making biphone for Korean Speech Recognition (한국어 음성 인식용 biphone 구성을 위한 기초 연구)

Hwang YoungSoo;Song Minsuck
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.99-102
- /
- 2000
In the case of making large vocabulary speech recognition system, it is better to use the segment than the syllable or the word as the recognition unit. In this paper, we study on the basis of making biphone for Korean speech recognition. For experiments, we use the speech toolkit of OGI in U.S.A. The result shows that the recognition rate of the case in which the diphthong is established as a single unit is superior to that of the case in which the diphthong Is established as two units, i.e. a glide plus a vowel. And also, the recognition rate of the case in which the biphone is used as the recognition unit is better than that of the case in which the mono-phoneme is used.
PDF

Korean Phoneme Sequence based Word Embedding (한국어 음소열 기반 워드 임베딩 기술)

Chung, Euisok;Jeon, Hwa Jeon;Lee, Sung Joo;Park, Jeon-Gue
- 한국어정보학회:학술대회논문집
- /
- 2017.10a
- /
- pp.225-227
- /
- 2017
본 논문은 한국어 서브워드 기반 워드 임베딩 기술을 다룬다. 미등록어 문제를 가진 기존 워드 임베딩 기술을 대체할 수 있는 새로운 워드 임베딩 기술을 한국어에 적용하기 위해, 음소열 기반 서브워드 자질 검증을 진행한다. 기존 서브워드 자질은 문자 n-gram을 사용한다. 한국어의 경우 특정 단음절 발음은 단어에 따라 달라진다. 여기서 음소열 n-gram은 특정 서브워드 자질의 변별력을 확보할 수 있다는 장점이 있다. 본 논문은 서브워드 임베딩 기술을 재구현하여, 영어 환경에서 기존 워드 임베딩 사례와 비교하여 성능 우위를 확보한다. 또한, 한국어 음소열 자질을 활용한 실험 결과에서 의미적으로 보다 유사한 어휘를 벡터 공간상에 근접시키는 결과를 보여 준다.
PDF

Search Result 331, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)