A Study of Fundamental Frequency for Focused Word Spotting in Spoken Korean

Kwon, Soon-Il;Park, Ji-Hyung;Park, Neung-Soo;

doi:10.3745/KIPSTB.2008.15-B.6.595

The KIPS Transactions:PartB (정보처리학회논문지B)

Volume 15B Issue 6
/
Pages.595-602
/
2008
/
1598-284X(pISSN)

Korea Information Processing Society (한국정보처리학회)

DOI QR Code

A Study of Fundamental Frequency for Focused Word Spotting in Spoken Korean

한국어 발화음성에서 중점단어 탐색을 위한 기본주파수에 대한 연구

권순일 (한국과학기술연구원 지능인터랙션연구센터) ;
박지형 (과학기술연합대학원대학교 HCI 및 로봇응용공학) ;
박능수 (건국대학교 정보통신대학 컴퓨터공학부)

Published : 2008.12.31

https://doi.org/10.3745/KIPSTB.2008.15-B.6.595 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

The focused word of each sentence is a help in recognizing and understanding spoken Korean. To find the method of focused word spotting at spoken speech signal, we made an analysis of the average and variance of Fundamental Frequency and the average energy extracted from a focused word and the other words in a sentence by experiments with the speech data from 100 spoken sentences. The result showed that focused words have either higher relative average F0 or higher relative variances of F0 than other words. Our findings are to make a contribution to getting prosodic characteristics of spoken Korean and keyword extraction based on natural language processing.

각 문장 별 중점단어는 발화음성을 인식하고 그 의미를 이해하는데 도움을 준다. 발화된 음성신호로부터 중점단어를 탐색할 수 있는 방법을 찾기 위한 노력의 일환으로 실험을 통하여 문장 내에서 중점단어와 그 외의 단어들의 기본주파수의 평균과 분산, 그리고 평균 에너지를 분석해 보았다. 한국어로 된 100개의 발화문장의 음성데이터를 가지고 실험을 한 결과 중점단어는 그 외의 단어들에 비해 대부분 상대적으로 높은 기본주파수의 평균값을 나타내거나 상대적으로 높은 기본주파수의 분산 값을 나타냈다. 이 연구 결과를 이용하면 한국어의 구어문장에서 운율적 특성을 알 수 있을 뿐만 아니라, 자연어 처리를 이용한 핵심어를 추출하는 데에도 도움이 될 것이다.

Keywords

References

S. Ananthakrishnan and S. Narayanan, “Automatic Prosody Labeling using Acoustic, Lexical, and Syntactic Evidence,” IEEE Transactions on Speech, Audio and Language Processing, 16(1), pp.216-228, Jan., 2008 https://doi.org/10.1109/TASL.2007.907570
D. Baron, E. Shriberg and A. Stolcke, “Automatic punctuation and disfluency detection in multi-party meetings using prosodic and lexical cues,” In Proc. of International Conference on Spoken Language Processing (ICSLP), pp. 949-952, 2002
S.-A. Jun and H.-J. Lee, “Phonetic and phonological markers of contrastive focus in Korean,” In Proc. International Conference on Spoken Language Processing (ICSLP), pp.1295-1298, 1998
S.-A. Jun, “Intonational Phonology of Seoul Korean Revisited,” Japanese-Korean Linguistics 14 , Stanford: CSLI [Also printed in UCLA Working Papers in Phonetics, #104, pp.14-25, 2005], 2006
S.-A. Jun and H.-S. Kim, “VP Focus and Narrow Focus in Korean,” In Proc. of ICPhS, Saarbruecken, Germany, 2007
S. Kang and S. Speer, “Prosody and clause boundaries in Korean,” Proc. of International conference on Speech Prosody, pp.419-422, 2002
E.-S. Kim and B. Scassellati, “Learning to refine behavior using prosodic feedback,” In Proc. of IEEE 6th International Conference on Development and Learning, pp.205-210, 2007 https://doi.org/10.1109/DEVLRN.2007.4354072
H.-S. Kim, S.-A. Jun, H.-J. Lee, and J.-B. Kim, “Argument Structure and Focus Projection in Korean,” Proc. of International conference on Speech Prosody, Dresden, Germany, 2006
B. Secrest and G. Doddington, “An integrated pitch tracking algorithm for speech systems,” Proc. of International Conference on Acoustics, Speech, and Signal Processing, pp.1352-1355, Apr., 1983
K. Sonmez, E. Shriberg, L. Heck, and M. Weintraub, “Modeling Dynamic Prosodic Variation for Speaker Verifi cation,” Proc. of International Conference on Spoken Language Processing, Sydney, Australia, Vol.7, pp.3189-3192, 1998
Speech Filing System [Online]. Available: http://www.phon.ucl.ac.uk/resource/sfs
F. Tamburini, “Automatic prosodic prominence detection in speech using acoustic features: an unsupervised system,” In Proc. of Eurospeech, pp.129-132, 2003
D. Wang and S. Narayanan, “A multi-pass linear fold algorithm for sentence boundary detection using prosodic cues,” In Proc. of International Conference on Acoustics, Speech, and Signal Processing, pp.525-528, May, 2004 https://doi.org/10.1109/ICASSP.2004.1326038
D. Wang and S. Narayanan, “An Acoustic Measure For Word Prominence In Spontaneous Speech,” IEEE Transactions on Speech, Audio and Language Processing, 15(2), pp.690-701, Feb., 2007 https://doi.org/10.1109/TASL.2006.881703
구희산, “영어와 한국어 낱말 운율의 음성학적 연구”, 응용언어학, 제8호, pp.123-140, 1995년 2월

Cited by

Musical Instrument Recognition for the Categorization of UCC Music Source vol.17B, pp.2, 2010, https://doi.org/10.3745/KIPSTB.2010.17B.2.107

The KIPS Transactions:PartB (정보처리학회논문지B)

A Study of Fundamental Frequency for Focused Word Spotting in Spoken Korean

한국어 발화음성에서 중점단어 탐색을 위한 기본주파수에 대한 연구

Abstract

Keywords

References

Cited by

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)