통합 검색 | Korea Science

한국어 연속음성 인식을 위한 발음열 자동 생성 (Automatic Generation of Pronunciation Variants for Korean Continuous Speech Recognition)

이경님;전재훈;정민화
- 한국음향학회지
- /
- 제20권2호
- /
- pp.35-43
- /
- 2001
음성 인식이나 음성 합성시 필요한 발음열을 수작업으로 작성할 경우 작성자의 음운변화 현상에 대한 전문적 언어지식을 비롯하여 많은 시간과 노력이 요구되며 일관성을 유지하기도 쉽지 않다. 또한 한국어의 음운 변화 현상은 단일 형태소의 내부와 복합어에서 결합된 형태소의 경계점, 여러 형태소가 결합해서 한 어절을 이룰 경우 그 어절 내부의 형태소의 경계점, 여러 어절이 한 어절을 이룰 때 구성 어절의 경계점에서 서로 다른 적용 양상을 보인다. 본 논문에서는 이러한 문제를 해결하기 위해서 형태음운론적 분석에 기반하여 문자열을 자동으로 발음열로 변환하는 발음 생성 시스템을 제안하였다. 이 시스템은 한국어에서 빈번하게 발생하는 음운변화 현상의 분석을 통해 정의된 음소 변동 규칙과 변이음 규칙을 다단계로 적용하여 가능한 모든 발음열을 생성한다. 각 음운변화 규칙을 포함하는 대표적인 언절 리스트를 이용하여 구성된 시스템의 안정성을 검증하였고, 발음사전 구성과 학습용 발음열의 유용성을 인식 실험을 통해 평가하였다. 그 결과 표제어 사이의 음운변화 현상을 반영한 발음사전의 경우 5-6％ 정도 나은 단어 인식률을 얻었으며, 생성된 발음열을 학습에 사용한 경우에서도 향상된 결과를 얻을 수 있었다.
PDF

통신언어의 모음변이와 음성학적 유사성 (Vowel Variation in PC Communication Language and Phonetic Similarity)

지윤주;김일규
- 말소리와 음성과학
- /
- 제7권1호
- /
- pp.133-138
- /
- 2015
The purpose of this study is to provide deeper understanding of how it is possible for people to understand PC communication language they have never seen or heard before without any problem. In order to answer this question, we focused on the vowel variation through which new variants are created (for PC communication), and hypothesized that there is a phonetic constraint which requires the vowel of the variant to be phonetically similar (to the maximum) to the vowel of the original word. Through the corpus analysis of the dictionary of PC communication language, we show that our hypothesis is justified by the fact that most of the variants we collected from the dictionary, that is, 90% of them, conformed to the phonetic constraint we postulated.
https://doi.org/10.13064/KSSS.2015.7.1.133 인용 PDF KSCI

Age and gender differences in the spectral characteristics of Korean sibilants

Kong, Eun Jong;Kang, Jieun
- 말소리와 음성과학
- /
- 제13권1호
- /
- pp.37-44
- /
- 2021
While recent acoustic studies have reported associations of fronted sibilants (fricatives /s s⁎/ and affricates /tɕ tɕ⁎/) with gender in Seoul Korean, there have not been any studies examining the relationship of the variants with adult speakers' ages. The current study analyzes sibilant productions from 39 adult speakers born between 1942 and 2008 (19 females) in terms of spectral peak frequencies (SPFs) in frication, an acoustic index of place of articulation (POA). The results indicate some phonetic contexts where higher sibilant SPFs, i.e., fronter POAs, are associated with younger adults and those fronted variants are realized in a gender-differentiated manner -- tense affricates and word-initial tense fricatives before /i/ in the females' productions, and word-medial tense fricatives before /a/ in the males' productions. The findings confirm that the distributions of the fronted sibilants are accounted for not only by the speakers' gender but also by their ages, indicating that the fronted variants are innovative forms of realizing sibilants in Seoul Korean. In addition, the current results convincingly show that the fronted sibilant variants are not mere reflections of individuals' physiological differences since they are not observed across all of the examined phonetic contexts.
https://doi.org/10.13064/KSSS.2021.13.1.037 인용 PDF KSCI

발음 변이의 발음사전 포함 결정 조건을 통한 발음사전 최적화 (Pronunciation Lexicon Optimization with Applying Variant Selection Criteria)

전재훈;정민화
- 대한음성학회:학술대회논문집
- /
- 대한음성학회 2006년도 추계학술대회 발표논문집
- /
- pp.24-27
- /
- 2006
This paper describes how a domain dependent pronunciation lexicon is generated and optimized for Korean large vocabulary continuous speech recognition(LVCSR). At the level of lexicon, pronunciation variations are usually modeled by adding pronunciation variants to the lexicon. We propose the criteria for selecting appropriate pronunciation variants in lexicon: (i) likelihood and (ii) frequency factors to select variants. Our experiment is conducted in three steps. First, the variants are generated with knowledge-based rules. Second, we generate a domain dependent lexicon which includes various numbers of pronunciation variants based on the proposed criteria. Finally, the WERs and RTFs are examined with each lexicon. In the experiment, 0.72% WER reduction is obtained by introducing the variants pruning criteria. Furthermore, RTF is not deteriorated although the average number of variants is higher than that of compared lexica.
PDF

음소변동규칙의 적합도 조정을 통한 연속음성인식 성능향상 (Improving the Performance of the Continuous Speech Recognition by Estimating Likelihoods of the Phonetic Rules)

나민수;정민화
- 대한음성학회:학술대회논문집
- /
- 대한음성학회 2006년도 추계학술대회 발표논문집
- /
- pp.80-83
- /
- 2006
The purpose of this paper is to build a pronunciation lexicon with estimated likelihoods of the phonetic rules based on the phonetic realizations and therefore to improve the performance of CSR using the dictionary. In the baseline system, the phonetic rules and their application probabilities are defined with the knowledge of Korean phonology and experimental tuning. The advantage of this approach is to implement the phonetic rules easily and to get stable results on general domains. However, a possible drawback of this method is that it is hard to reflect characteristics of the phonetic realizations on a specific domain. In order to make the system reflect phonetic realizations, the likelihood of phonetic rules is reestimated based on the statistics of the realized phonemes using a forced-alignment method. In our experiment, we generates new lexica which include pronunciation variants created by reestimated phonetic rules and its performance is tested with 12 Gaussian mixture HMMs and back-off bigrams. The proposed method reduced the WER by 0.42%.
PDF

Parallel sound change between segmental and suprasegmental properties: An individual level observation

Lee, Hyunjung
- 말소리와 음성과학
- /
- 제8권4호
- /
- pp.23-29
- /
- 2016
The present study tested if individual speakers showing great sound change in segments (i.e., vowels and fricatives) also had innovative changing patterns in suprasegmental properties (i.e., lexical pitch accents) in Kyungsang Korean. The acoustic analysis at a group level first confirmed the presence of group level differences in distinguishing /ɨ-ʌ/ and /s-s'/ both of which had different phonemic distinction from Seoul Korean. Younger speakers had more innovative segmental change than older speakers, and even within the younger generation, female speakers produced more innovative phonetic variants than male speakers. Regarding the individual observation within the younger group, the younger speakers with large acoustic distinction in vowels and fricatives also showed acoustically less distinct accent patterns, indicating the innovative sound change pattern consistent across segment and suprasegmental properties. The group and individual observations suggested that linguistic innovators introduced new phonetic variants with consistent degree of changing pattern between segment and suprasegmental properties.
https://doi.org/10.13064/KSSS.2016.8.4.023 인용 PDF KSCI

서울말 어간말 자음의 음성 실현 (The Phonetic Realization of Syllable Codas in Korean)

강은지;이호영;김주원
- 대한음성학회지:말소리
- /
- 제49호
- /
- pp.1-30
- /
- 2004
Although Standard Korean is based on Seoul Korean, the phonetic realization of syllable codas in Seoul Korean has not been satisfactorily investigated. This paper aims to study how Seoul speakers pronounce syllable codas in certain phonetic contexts and what pronunciation they prefer among variants. It is noted that the realization of a syllable coda is different word by word and generation by generation. It is also noted that the syllable coda of a word is realized differently depending on the following vowel. And we discussed how the Principles of Standard Korean Pronunciation should be revised in the future, based on the results of this study.
PDF

Considering Dynamic Non-Segmental Phonetics

Fujino, Yoshinari
- 대한음성학회:학술대회논문집
- /
- 대한음성학회 2000년도 7월 학술대회지
- /
- pp.312-320
- /
- 2000
This presentation aims to explore some possibility of non-segmental phonetics usually ignored in phonetics education. In pedagogical phonetics, especially ESL/EFL oriented phonetics speech sounds tend to be classified in two criteria 1) 'pronunciation' which deals with segments and 2) 'prosody' or 'suprasegmentals', a criterion that deals with non-segmental elements such as stress and intonation. However, speech involves more dynamic processing. It is non-linear and multi-dimensional in spite of the linear sequence of symbols in phonetic/phonological transcriptions. No word is without pitch or voice quality apart from segmental characteristics whether it is spoken in isolation or cut out from continuous speech. This simply tells the dichotomy of pronunciation and prosody is merely a useful convention. There exists some room to consider dynamic non-segmental phonetics. Examples of non-segmental phonetic investigation, some of the analyses conducted within the frame of Firthian Prosodic Analysis, especially of the relation between vowel variants and foot types, are examined and we see what kind of auditory phonetic training is required to understand impressionistic transcriptions which lie behind the non-segmental phonetics.
PDF

발음열 자동 변환을 이용한 한국어 음운 변화 규칙의 통계적 분석 (Statistical Analysis of Korean Phonological Rules Using a Automatic Phonetic Transcription)

이경님;정민화
- 대한음성학회:학술대회논문집
- /
- 대한음성학회 2002년도 11월 학술대회지
- /
- pp.81-85
- /
- 2002
We present a statistical analysis of Korean phonological variations using automatic generation of phonetic transcription. We have constructed the automatic generation system of Korean pronunciation variants by applying rules modeling obligatory and optional phonemic changes and allophonic changes. These rules are derived from knowledge-based morphophonological analysis and government standard pronunciation rules. This system is optimized for continuous speech recognition by generating phonetic transcriptions for training and constructing a pronunciation dictionary for recognition. In this paper, we describe Korean phonological variations by analyzing the statistics of phonemic change rule applications for the 60,000 sentences in the Samsung PBS(Phonetic Balanced Sentence) Speech DB. Our results show that the most frequently happening obligatory phonemic variations are in the order of liaison, tensification, aspirationalization, and nasalization of obstruent, and that the most frequently happening optional phonemic variations are in the order of initial consonant h-deletion, insertion of final consonant with the same place of articulation as the next consonants, and deletion of final consonant with the same place of articulation as the next consonants. These statistics can be used for improving the performance of speech recognition systems.
PDF

이상은(李商隱) 시(詩) 구주(舊注) 중에 나타난 시어(詩語)의 음의관계(音義關係) 연구(硏究) (A Phonetic and Semantic Analysis on the Annotations of Li ShangYin (李商隱)'s Poetry)

염재웅
- 비교문화연구
- /
- 제52권
- /
- pp.341-369
- /
- 2018
이상은(李商隱)은 만당(晩唐)시기를 대표하는 시인으로 590여수의 시를 남겼다. 본 논문에서는 이상은(李商隱) 시(詩)에 대한 역대 학자들의 주석(注釋)을 통하여 시어(詩語) 속에 담긴 다양한 음의관계(音義關係)와 특징을 탐색했다. 그 결과 "시어(詩語)의 음의관계(音義關係)를 설명(說明)한 용례" 12개와 "시어(詩語)의 특징(特徵) 및 운율(韻律)을 설명(說明)한 용례" 5개의 핵심적인 용례를 찾아냈다. 특히 "시어(詩語)의 음의관계(音義關係)를 설명(說明)한 용례"를 분석해보니 이상은(李商隱) 시어(詩語)의 주석(注釋)과 고대(古代) 중국어의 음의관계가 일치하는 유형과 그렇지 않은 유형으로 분류되었다. 본 연구에서는 각 유형에 대한 세부 분석을 위해서 시율(詩律)의 평측(平仄)을 적용했다.

검색결과 23건 처리시간 0.019초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)