Search | Korea Science

A PZrosodic Characteristics of Korean Read Sentences in Discourse Context (한국어 낭독체 담화문의 운율적 특징)

성철재
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.08a
- /
- pp.209-213
- /
- 1998
50개의 담화단독 문장과 연속발성 문장을 대상으로 무장의 첫 어절과 마지막 어절에서의 첫 음절과 마지막 음절의 운율특징을 조사하였다. 이를 체계적으로 살펴 보기 위하여 각 어절에서의 마지막 음절의 음향변수에 대한 첫 음절의 음향변수의 비율을 얻은 뒤 이를 대상으로 하여 평균값과 분포를 구하였다. 지속시간의 경우 두 스타일 간에 주목할 만한 큰 차이점은 없었으나 담화 연속 문장의 문두에서 화자의 조음시간 프로그래밍이 약간 조화롭지 못함을 알 수 있었다. Fo는 마지막 어절 부분의 비율값이 두 스타일간 통계적으로 유의한 차이를 보였으며 운율자질로 기능할 수 있는 가능성을 보였다. 에너지는 Fo와 유사한 분포경향을 보인다. 문미 어절의 마지막 음절이 첫 음절의 약 85% 정도의 힘으로 발성됨을 알 수 있고, 담화 연속 발화의 마지막 어절에서 단독 발화문보다 상대적으로 강하게 조음되었음을 알 수 있었다.
PDF

Effective Syllable Modeling for Korean Speech Recognition Using Continuous HMM (연속 은닉 마코프 모델을 이용한 한국어 음성 인식을 위한 효율적 음절 모델링)

김봉완;이용주
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.1
- /
- pp.23-27
- /
- 2003
Recently attempts to we the syllable as the recognition unit to enhance performance in continuous speech recognition hate been reported. However, syllables are worse in their trainability than phones and the former have a disadvantage in that contort-dependent modeling is difficult across the syllable boundary since the number of models is much larger for syllables than for phones. In this paper, we propose a method to enhance the trainability for the syllables in Korean and phoneme-context dependent syllable modeling across the syllable boundary. An experiment in which the proposed method is applied to word recognition shows average 46.23% error reduction in comparison with the common syllable modeling. The right phone dependent syllable model showed 16.7% error reduction compared with a triphone model.
PDF KSCI

A Recognition of Word Spacing Errors Using By Syllable (음절 bigram 특성을 이용한 띄어쓰기 오류의 인식)

강승식
- Proceedings of the Korean Society for Cognitive Science Conference
- /
- 2000.06a
- /
- pp.85-88
- /
- 2000
대용량 말뭉치에서 이웃 음절간 공기빈도 정보를 추출하여 한글의 bigram 음절 특성을 조사하였다. Bigram 음절 특성은 띄어쓰기가 무시된 문서에 대한 자동 띄어쓰기, 어떤 어절이 띄어쓰기 오류어인지 판단, 맞춤법 검사기에서 절차 오류어의 교정 등 다양한 응용분야에서 유용하게 사용될 것으로 예상되고 있다. 본 논문에서는 한글의 bigram 음절 특성을 자동 띄어쓰기 및 입력어절이 띄어쓰기 오류어인지를 판단하는데 적용하는 실험을 하였다. 실험 결과에 의하면 bigram 음절 특성이 매우 유용하게 사용될 수 있음을 확인하였다.
PDF

A Recognition of Word Spacing Errors Using By Syllable Bigram (음절 bigram 특성을 이용한 띄어쓰기 오류의 인식)

Kang, Seung-Shik
- Annual Conference on Human and Language Technology
- /
- 2000.10d
- /
- pp.85-88
- /
- 2000
대용량 말뭉치에서 이웃 음절간 공기빈도 정보를 추출하여 한글의 bigram 음절 특성을 조사하였다. Bigram 음절 특성은 띄어쓰기가 무시된 문서에 대한 자동 띄어쓰기, 어떤 어절이 띄어쓰기 오류어인지 판단, 맞춤법 검사기에서 철자 오류어의 교정 등 다양한 응용분야에서 유용하게 사용될 것으로 예상되고 있다. 본 논문에서는 한글의 bigram 음절 특성을 자동 띄어쓰기 및 입력어절이 띄어쓰기 오류어인지를 판단하는데 적용하는 실험을 하였다. 실험 결과에 의하면 bigram 음절 특성이 매우 유용하게 사용될 수 있음을 확인하였다.
PDF

한국어 리듬의 음성학적 연구

LEE H.B.
- MALSORI
- /
- no.4
- /
- pp.31-48
- /
- 1982
This paper describes the rhythmic structure of the Korean standard speech of Seoul in terms of what the writer calls 'Speech Segmentv as the basic unit of the rhythm. The speech segment consists of a 'Nucleus'( a stressed syllable ) with or without one or more weak syllable(s) . The nucleus is always long and the weak syllables are short except the last syllable of the speech rhythm, which may be realized nearly as Long as the nuclear syllable.
PDF

Pronunciation Variation Modeling for Korean Point-of-Interest Data Using Prosodic Information (운율 정보를 이용한 한국어 위치 정보 데이타의 발음 모델링)

Kim, Sun-He;Park, Jeon-Gue;Na, Min-Soo;Jeon, Je-Hun;Chung, Min-Wha
- Journal of KIISE:Software and Applications
- /
- v.34 no.2
- /
- pp.104-111
- /
- 2007
This paper examines how the performance of an automatic speech recognizer was improved for Korean Point-of-Interest (POI) data by modeling pronunciation variation using structural prosodic information such as prosodic words and syllable length. First, multiple pronunciation variants are generated using prosodic words given that each POI word can be broken down into prosodic words. And the cross-prosodic-word variations were modeled considering the syllable length of word. A total of 81 experiments were conducted using 9 test sets (3 baseline and 6 proposed) on 9 trained sets (3 baseline, 6 proposed). The results show: (i) the performance was improved when the pronunciation lexica were generated using prosodic words; (ii) the best performance was achieved when the maximum number of variants was constrained to 3 based on the syllable length; and (iii) compared to the baseline word error rate (WER) of 4.63%, a maximum of 8.4% in WER reduction was achieved when both prosodic words and syllable length were considered.
PDF KSCI

An acoustic study on the intonation pattern of Cheju dialects in Korean (제주방언 억양패턴의 실험음성학적 연구)

Lee Sook-hyang
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.369-372
- /
- 1999
본 연구는 제주방언의 억양 패턴에 대하여 실험음성학적 분석을 하였다. 이전의 제주방언에 대한 음성 음운론적 연구는 거의 분절음 연구에 국한되었다. 억양 패턴 분석은 K-ToBI 레이블링 시스템에서 사용하는 성조기호를 사용하여 수행되었다 제주 방언의 운율구는 서울말, 전남방언과 같이 억양구와, 그 하위 층에 악센트구 두 개로 이루어져 있다. 본 연구는 크게 억양구의 경계성조 유형 연구와 악센트구의 성조 연구를 수행하였다. 억양구 경계성조로는 서울말과 같이 $L\%,\;H\%$를 기본으로 하여 $HL\%,\;LHL\%,\;HLHL\%,\;LHLHL\%,\;LH\%,\;HLH\%,\;LHLH\%,\;HLTLH\%$ 등의 유형과 그 외 제주방언만의 유형 또한 관찰되었다. 악센트구의 성조패턴 연구는 음절수와 억양구내 악센트구의 위치를 변수로 하여 살펴보았다. 제주방언의 악센트구 기본 성조는 'LH'로서 마지막 음절에서 ?'가 실현되는 패턴이다. 음절수가 많아지면 마지막 둘째음절에 아주 완만한 피치상승을 보이긴 하나 'H'로 기술하기에는 부적절하였다 유성음화의 범위는 서울방언에서와 같이 악센트구로 나타났다. 강자음이 악센트구 초에 올 때 ?'성조로 시작이 되었으며 피험자에 따라 'H'성조가 첫째음절에서만 실현되고 바로 하강하던가 또는 둘째음절까지 지속되는 것이 관찰되었다.
PDF

Phonetic Tied-Mixture Syllable Model for CSR (연속 음성 인식을 위한 PTM 음절 모델)

Kim Bong-Wan;Lee Yong-Ju
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.33-36
- /
- 2004
최근 연속 음성 인식에서의 성능 향상을 위하여 음절을 인식 단위로 사용하고자 하는 노력들이 보고되고 있다. 그러나 음절의 경우 음소에 비해 학습성이 좋지 않고 모델의 수가 많으므로 음절 경계에서의 문맥 종속 모델링이 어렵다는 단점을 갖고 있다. 본 논문에서는 음절의 이러한 단점을 극복하기 위하여 모노폰과 트라이폰을 이용하여 음절 모델을 합성하는 방법을 제안한다. 제안된 모델은 트라이폰에 비하여 평균 $55\%$, PTM에 비하여 평균 $13\%$의 인식 속도 향상을 보이며, 동일한 속도일 경우 PTM, 트라이폰 모델 모두에 대하여 ERR이 약$8\%$ 향상됨을 볼 수 있었다.
PDF

The Syllable Type and Token Frequency Effect in Naming Task (명명 과제에서 음절 토큰 및 타입 빈도 효과)

Kwon, Youan
- Korean Journal of Cognitive Science
- /
- v.25 no.2
- /
- pp.91-107
- /
- 2014
The syllable frequency effect is defined as the inhibitory effect that words starting with high frequency syllable generate a longer lexical decision latency and a larger error rate than words starting with low frequency syllable do. Researchers agree that the reason of the inhibitory effect is the interference from syllable neighbors sharing a target's first syllable at the lexical level and the degree of the interference effect correlates with the number of syllable neighbors or stronger syllable neighbors which have a higher word frequency. However, although the syllable frequency can be classified as the syllable type and token frequency, previous studies in visual word recognition have used the syllable frequency without the classification. Recently Conrad, Carreiras, & Jacobs (2008) demonstrated that the syllable type frequency might reflect a sub-lexical processing level including matching from letters to syllables and the syllable token frequency might reflect competitions between a target and higher frequency words of syllable neighbors in the whole word lexical processing level. Therefore, the present study investigated their proposals using word naming tasks. Generally word naming tasks are more sensitive to sub-lexical processing. Thus, the present study expected a facilitative effect of high syllable type frequency and a null effect of high syllable token frequency. In Experiment 1, words starting with high syllable type frequency generated a faster naming latency than words starting with low syllable type frequency with holding syllable token frequency of them. In Experiment 2, high syllable token frequency also created a shorter naming time than low syllable token frequency with holding their syllable type frequency. For that reason, we rejected the propose of Conrad et al. and suggested that both type and token syllable frequency could relate to the sub-lexical processing.
PDF KSCI

Analysis of the durational characteristics of monosyllabic interjections in Natural spoken language (자연발화상에 나타난 단음절 단일간투사의 길이특성 분석)

김기호
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06c
- /
- pp.95-98
- /
- 1994
자연발화상에 포함되어, 음성언어 인식에 장애를 초래하는 간투사의 음성적 특성 중 가장 뚜렷이 구별되는 길이특성얼 분석하여 음성언어 인식에 도움을 주는 것을 목적으로 한다. 이 연구에서는 간투사의 대부분을 차지하는 단음절 단일 간투사에 한정하여, 실제 대화의 녹음자료에서 나타나는 간투사의 빈도수와, 그 길이특성을 신분별, 성별, 간투사 유형별로 분석하였다. 또 간투사를 위치에 따라, 음운구초 간투사, 음운구말 간투사로 나누고, 그 길이를 음절 평균, 음운 구초 음절이나 음운구말 음절의 길이와 비교하여 간투사의 증가율을 측정하였다. 분석결과 가장 높은 빈도수를 보이는 단음절 단일 간투사는 어 이며, 간투사 길이 증가율은, 음절평균에 대해서는 그가, 음운구초 평균에 대해서는 응이 가장 큰 증가율을 나타낸다. 전체적을 음운구초 음절길이에 대한 간투사 길이 증가율이 음절평균 길이에 대한 간투사 길이 증가율보다 더 크게 나타났다. 이러한 분석결과를 통해 하위레벨에서 제거할 수 있는 간투사와, 통사적 또는 의미론적 분석이 필요한 상위레벨에서 처리해야할 간투사를 구별할 수 있다. 이와 같은 길이 특성외에 간투사에 대한 다양한 음성적 특성과, 다음절 단일 간투사와, 이중 간투사에 대한 연구가 진척된다면 음성언어 인식에 장애가 되는 간투사의 효과적 배제가 가능할 것으로 보인다.
PDF

Search Result 318, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)