• 제목/요약/키워드: Phonetic Variants

한국어 연속음성 인식을 위한 발음열 자동 생성 (Automatic Generation of Pronunciation Variants for Korean Continuous Speech Recognition)

    • 한국음향학회지
    • 제20권2호
    • pp.35-43
    • 2001
  • 음성 인식이나 음성 합성시 필요한 발음열을 수작업으로 작성할 경우 작성자의 음운변화 현상에 대한 전문적 언어지식을 비롯하여 많은 시간과 노력이 요구되며 일관성을 유지하기도 쉽지 않다. 또한 한국어의 음운 변화 현상은 단일 형태소의 내부와 복합어에서 결합된 형태소의 경계점, 여러 형태소가 결합해서 한 어절을 이룰 경우 그 어절 내부의 형태소의 경계점, 여러 어절이 한 어절을 이룰 때 구성 어절의 경계점에서 서로 다른 적용 양상을 보인다. 본 논문에서는 이러한 문제를 해결하기 위해서 형태음운론적 분석에 기반하여 문자열을 자동으로 발음열로 변환하는 발음 생성 시스템을 제안하였다. 이 시스템은 한국어에서 빈번하게 발생하는 음운변화 현상의 분석을 통해 정의된 음소 변동 규칙과 변이음 규칙을 다단계로 적용하여 가능한 모든 발음열을 생성한다. 각 음운변화 규칙을 포함하는 대표적인 언절 리스트를 이용하여 구성된 시스템의 안정성을 검증하였고, 발음사전 구성과 학습용 발음열의 유용성을 인식 실험을 통해 평가하였다. 그 결과 표제어 사이의 음운변화 현상을 반영한 발음사전의 경우 5-6% 정도 나은 단어 인식률을 얻었으며, 생성된 발음열을 학습에 사용한 경우에서도 향상된 결과를 얻을 수 있었다.

통신언어의 모음변이와 음성학적 유사성 (Vowel Variation in PC Communication Language and Phonetic Similarity)

  • 지윤주;김일규
    • 말소리와 음성과학
    • 제7권1호
    • pp.133-138
    • 2015
  • The purpose of this study is to provide deeper understanding of how it is possible for people to understand PC communication language they have never seen or heard before without any problem. In order to answer this question, we focused on the vowel variation through which new variants are created (for PC communication), and hypothesized that there is a phonetic constraint which requires the vowel of the variant to be phonetically similar (to the maximum) to the vowel of the original word. Through the corpus analysis of the dictionary of PC communication language, we show that our hypothesis is justified by the fact that most of the variants we collected from the dictionary, that is, 90% of them, conformed to the phonetic constraint we postulated.

Age and gender differences in the spectral characteristics of Korean sibilants

  • Kong, Eun Jong;Kang, Jieun
    • 말소리와 음성과학
    • 제13권1호
    • pp.37-44
    • 2021
  • While recent acoustic studies have reported associations of fronted sibilants (fricatives /s s⁎/ and affricates /tɕ tɕ⁎/) with gender in Seoul Korean, there have not been any studies examining the relationship of the variants with adult speakers' ages. The current study analyzes sibilant productions from 39 adult speakers born between 1942 and 2008 (19 females) in terms of spectral peak frequencies (SPFs) in frication, an acoustic index of place of articulation (POA). The results indicate some phonetic contexts where higher sibilant SPFs, i.e., fronter POAs, are associated with younger adults and those fronted variants are realized in a gender-differentiated manner -- tense affricates and word-initial tense fricatives before /i/ in the females' productions, and word-medial tense fricatives before /a/ in the males' productions. The findings confirm that the distributions of the fronted sibilants are accounted for not only by the speakers' gender but also by their ages, indicating that the fronted variants are innovative forms of realizing sibilants in Seoul Korean. In addition, the current results convincingly show that the fronted sibilant variants are not mere reflections of individuals' physiological differences since they are not observed across all of the examined phonetic contexts.

발음 변이의 발음사전 포함 결정 조건을 통한 발음사전 최적화 (Pronunciation Lexicon Optimization with Applying Variant Selection Criteria)

  • 전재훈;정민화
    • 대한음성학회:학술대회논문집
    • 대한음성학회 2006년도 추계학술대회 발표논문집
    • pp.24-27
    • 2006
  • This paper describes how a domain dependent pronunciation lexicon is generated and optimized for Korean large vocabulary continuous speech recognition(LVCSR). At the level of lexicon, pronunciation variations are usually modeled by adding pronunciation variants to the lexicon. We propose the criteria for selecting appropriate pronunciation variants in lexicon: (i) likelihood and (ii) frequency factors to select variants. Our experiment is conducted in three steps. First, the variants are generated with knowledge-based rules. Second, we generate a domain dependent lexicon which includes various numbers of pronunciation variants based on the proposed criteria. Finally, the WERs and RTFs are examined with each lexicon. In the experiment, 0.72% WER reduction is obtained by introducing the variants pruning criteria. Furthermore, RTF is not deteriorated although the average number of variants is higher than that of compared lexica.

음소변동규칙의 적합도 조정을 통한 연속음성인식 성능향상 (Improving the Performance of the Continuous Speech Recognition by Estimating Likelihoods of the Phonetic Rules)

  • 나민수;정민화
    • 대한음성학회:학술대회논문집
    • 대한음성학회 2006년도 추계학술대회 발표논문집
    • pp.80-83
    • 2006
  • The purpose of this paper is to build a pronunciation lexicon with estimated likelihoods of the phonetic rules based on the phonetic realizations and therefore to improve the performance of CSR using the dictionary. In the baseline system, the phonetic rules and their application probabilities are defined with the knowledge of Korean phonology and experimental tuning. The advantage of this approach is to implement the phonetic rules easily and to get stable results on general domains. However, a possible drawback of this method is that it is hard to reflect characteristics of the phonetic realizations on a specific domain. In order to make the system reflect phonetic realizations, the likelihood of phonetic rules is reestimated based on the statistics of the realized phonemes using a forced-alignment method. In our experiment, we generates new lexica which include pronunciation variants created by reestimated phonetic rules and its performance is tested with 12 Gaussian mixture HMMs and back-off bigrams. The proposed method reduced the WER by 0.42%.

Parallel sound change between segmental and suprasegmental properties: An individual level observation

  • Lee, Hyunjung
    • 말소리와 음성과학
    • /
    • 제8권4호
    • /
    • pp.23-29
    • /
    • 2016
  • The present study tested if individual speakers showing great sound change in segments (i.e., vowels and fricatives) also had innovative changing patterns in suprasegmental properties (i.e., lexical pitch accents) in Kyungsang Korean. The acoustic analysis at a group level first confirmed the presence of group level differences in distinguishing /ɨ-ʌ/ and /s-s'/ both of which had different phonemic distinction from Seoul Korean. Younger speakers had more innovative segmental change than older speakers, and even within the younger generation, female speakers produced more innovative phonetic variants than male speakers. Regarding the individual observation within the younger group, the younger speakers with large acoustic distinction in vowels and fricatives also showed acoustically less distinct accent patterns, indicating the innovative sound change pattern consistent across segment and suprasegmental properties. The group and individual observations suggested that linguistic innovators introduced new phonetic variants with consistent degree of changing pattern between segment and suprasegmental properties.

서울말 어간말 자음의 음성 실현 (The Phonetic Realization of Syllable Codas in Korean)

  • 강은지;이호영;김주원
    • 대한음성학회지:말소리
    • /
    • /
    • /
    • 2004
  • Although Standard Korean is based on Seoul Korean, the phonetic realization of syllable codas in Seoul Korean has not been satisfactorily investigated. This paper aims to study how Seoul speakers pronounce syllable codas in certain phonetic contexts and what pronunciation they prefer among variants. It is noted that the realization of a syllable coda is different word by word and generation by generation. It is also noted that the syllable coda of a word is realized differently depending on the following vowel. And we discussed how the Principles of Standard Korean Pronunciation should be revised in the future, based on the results of this study.

  • PDF

Considering Dynamic Non-Segmental Phonetics

  • Fujino, Yoshinari
    • 대한음성학회:학술대회논문집
    • 대한음성학회 2000년도 7월 학술대회지
    • pp.312-320
    • 2000
  • This presentation aims to explore some possibility of non-segmental phonetics usually ignored in phonetics education. In pedagogical phonetics, especially ESL/EFL oriented phonetics speech sounds tend to be classified in two criteria 1) 'pronunciation' which deals with segments and 2) 'prosody' or 'suprasegmentals', a criterion that deals with non-segmental elements such as stress and intonation. However, speech involves more dynamic processing. It is non-linear and multi-dimensional in spite of the linear sequence of symbols in phonetic/phonological transcriptions. No word is without pitch or voice quality apart from segmental characteristics whether it is spoken in isolation or cut out from continuous speech. This simply tells the dichotomy of pronunciation and prosody is merely a useful convention. There exists some room to consider dynamic non-segmental phonetics. Examples of non-segmental phonetic investigation, some of the analyses conducted within the frame of Firthian Prosodic Analysis, especially of the relation between vowel variants and foot types, are examined and we see what kind of auditory phonetic training is required to understand impressionistic transcriptions which lie behind the non-segmental phonetics.

발음열 자동 변환을 이용한 한국어 음운 변화 규칙의 통계적 분석 (Statistical Analysis of Korean Phonological Rules Using a Automatic Phonetic Transcription)

  • 이경님;정민화
    • 대한음성학회:학술대회논문집
    • 대한음성학회 2002년도 11월 학술대회지
    • pp.81-85
    • 2002
  • We present a statistical analysis of Korean phonological variations using automatic generation of phonetic transcription. We have constructed the automatic generation system of Korean pronunciation variants by applying rules modeling obligatory and optional phonemic changes and allophonic changes. These rules are derived from knowledge-based morphophonological analysis and government standard pronunciation rules. This system is optimized for continuous speech recognition by generating phonetic transcriptions for training and constructing a pronunciation dictionary for recognition. In this paper, we describe Korean phonological variations by analyzing the statistics of phonemic change rule applications for the 60,000 sentences in the Samsung PBS(Phonetic Balanced Sentence) Speech DB. Our results show that the most frequently happening obligatory phonemic variations are in the order of liaison, tensification, aspirationalization, and nasalization of obstruent, and that the most frequently happening optional phonemic variations are in the order of initial consonant h-deletion, insertion of final consonant with the same place of articulation as the next consonants, and deletion of final consonant with the same place of articulation as the next consonants. These statistics can be used for improving the performance of speech recognition systems.

이상은(李商隱) 시(詩) 구주(舊注) 중에 나타난 시어(詩語)의 음의관계(音義關係) 연구(硏究) (A Phonetic and Semantic Analysis on the Annotations of Li ShangYin (李商隱)'s Poetry)

  • 염재웅
    • 비교문화연구
    • /
    • /
    • /
    • 2018
  • 이상은(李商隱)은 만당(晩唐)시기를 대표하는 시인으로 590여수의 시를 남겼다. 본 논문에서는 이상은(李商隱) 시(詩)에 대한 역대 학자들의 주석(注釋)을 통하여 시어(詩語) 속에 담긴 다양한 음의관계(音義關係)와 특징을 탐색했다. 그 결과 "시어(詩語)의 음의관계(音義關係)를 설명(說明)한 용례" 12개와 "시어(詩語)의 특징(特徵) 및 운율(韻律)을 설명(說明)한 용례" 5개의 핵심적인 용례를 찾아냈다. 특히 "시어(詩語)의 음의관계(音義關係)를 설명(說明)한 용례"를 분석해보니 이상은(李商隱) 시어(詩語)의 주석(注釋)과 고대(古代) 중국어의 음의관계가 일치하는 유형과 그렇지 않은 유형으로 분류되었다. 본 연구에서는 각 유형에 대한 세부 분석을 위해서 시율(詩律)의 평측(平仄)을 적용했다.