• 제목/요약/키워드: traditional phonetics

검색결과 14건 처리시간 0.022초

우리말 소리갈(國語音聲學)에 대한 연구 - 주시경, 김두봉, 최현배, 이극로를 중심으로 - (A Historical Review of Sorigal (Korean Phonetics) in the early 20th Century)

  • 이숙희;고도흥
    • 음성과학
    • /
    • 제7권4호
    • /
    • pp.149-167
    • /
    • 2000
  • The purpose of this paper is to review the contribution made by some phoneticians including Si-gyung Chu, Du-bong Kim, Hyun-bae Choi, and Geuk-ro Lee in the early 20th century. It is known that the period can be characterized as the coexistence of traditional phonetics and modem phonetics. Si-gyung Chu well recognized the physical nature of speech sounds from the physiological point of view. Although Du-bong Kim adapted Chu's approach in some ways, he made some more detailed modifications in explaining the vocal organs. Hyun-bae Choi tried to explain the consonants and vowels systematically based on Western theories of phonetics. Finally, Geuk-ro Lee made the most significant contribution by introducing the experimental phonetics.

  • PDF

Mieko Han의 한국어 음성학 연구 (Mieko Han and her Works on Korean Phonetics)

  • 고도흥
    • 음성과학
    • /
    • 제1권
    • /
    • pp.213-223
    • /
    • 1997
  • This paper deals with a general review of Mieko S. Han, who made a significant contribution to the studies of Korean phonetics during the 1960' s and early 1970' s. As both a single and joint author, Dr. Han published important papers in both quantity and quality, which have been cited among Korean phoneticians until today. Before Dr. M. Han' s work, professor of USC in the department of East Asian Languages & Cultures, there were only a few phonetics-related publications in Korea, most of which are papers or books based on non-experimental traditional approach. It is known that there was coexistence between traditionalism and structuralism in the field of Korean linguistics. It was, however, fortunate that we had two important phoneticians (M. Han and Chin-W Kim) abroad at that time. Mieko Han' s concern was to investigate experimental characteristics of the system of Korean vowels and consonants using a Spectrograph, which was the single most important tool for analysing phonetic data at that time. Dr. Han conducted her experimental studies on Korean phonetics, mostly funded by the Office of Naval Research, in terms of duration, fundamental frequency, Voice Onset Time (VOT), intensity, and so on. This paper aims to re-appreciate Dr. Han's specific contribution to the study of Korean phonetics since she played an important role as a pioneer of early Korean phonetics. Further, it is highly recommended that Dr. Han's works can be extremely useful for a graduate student, who seriously would like to specialize in Korean phonetics in the first step.

  • PDF

한국어 대용량발화말뭉치의 단모음분석 (Monophthong Analysis on a Large-scale Speech Corpus of Read-Style Korean)

  • 윤태진;강윤정
    • 말소리와 음성과학
    • /
    • 제6권3호
    • /
    • pp.139-145
    • /
    • 2014
  • The paper describes methods of conducting vowel analysis from a large-scale corpus with the aids of forced alignment and optimal formant ceiling methods. 'Read Style Corpus of Standard Korean' is used for building the forced alignment system and a subset of the corpus for the processing and extraction of features for vowel analysis based on optimal formant ceiling. The results of the vowel analysis are reliable and comparable to the results obtained using traditional analytical methods. The findings indicate that the methods adopted for the analysis can be extended and be used for more fine-grained analysis without time-consuming manual labeling without losing accuracy and reliability.

심층신경망을 이용한 조음 예측 모형 개발 (Development of articulatory estimation model using deep neural network)

  • 유희조;양형원;강재구;조영선;황성하;홍연정;조예진;김서현;남호성
    • 말소리와 음성과학
    • /
    • 제8권3호
    • /
    • pp.31-38
    • /
    • 2016
  • Speech inversion (acoustic-to-articulatory mapping) is not a trivial problem, despite the importance, due to the highly non-linear and non-unique nature. This study aimed to investigate the performance of Deep Neural Network (DNN) compared to that of traditional Artificial Neural Network (ANN) to address the problem. The Wisconsin X-ray Microbeam Database was employed and the acoustic signal and articulatory pellet information were the input and output in the models. Results showed that the performance of ANN deteriorated as the number of hidden layers increased. In contrast, DNN showed lower and more stable RMS even up to 10 deep hidden layers, suggesting that DNN is capable of learning acoustic-articulatory inversion mapping more efficiently than ANN.

The acoustic realization of the Korean sibilant fricative contrast in Seoul and Daegu

  • Holliday, Jeffrey J.
    • 말소리와 음성과학
    • /
    • 제4권1호
    • /
    • pp.67-74
    • /
    • 2012
  • The neutralization of /$s^h$/ and /$s^*$/ in Gyeongsang dialects is a culturally salient stereotype that has received relatively little attention in the phonetic literature. The current study is a more extensive acoustic comparison of the sibilant fricative productions of Seoul and Gyeongsang dialect speakers. The data presented here suggest that, at least for young Seoul and Daegu speakers, there are few inter-dialectal differences in sibilant fricative production. These conclusions are supported by the output of mixed effects logistic regression models that used aspiration duration, spectral mean of the frication noise, and H1-H2 of the following vowel to predict fricative type in each dialect. The clearest dialect difference was that Daegu speakers' /$s^h$/ and /$s^*$/ productions had overall shorter aspiration durations than those of Seoul speakers, suggesting the opposite of the traditional "/$s^*$/ produced as [$s^h$]" stereotype of Gyeongsang dialects. Further work is needed to investigate whether /$s^h/-/s^*$/ neutralization in Daegu is perceptual rather than acoustic in nature.

음성을 이용한 사상체질 분류 알고리즘 (Automated Speech Analysis Applied to Sasang Constitution Classification)

  • 강재환;유종향;이혜정;김종열
    • 말소리와 음성과학
    • /
    • 제1권3호
    • /
    • pp.155-163
    • /
    • 2009
  • This paper introduces an automatic voice classification system for the diagnosis of individual constitution based on Sasang Constitutional Medicine (SCM) in Traditional Korean Medicine (TKM). For the developing of this algorithm, we used the voices of 473 speakers and extracted a total of 144 speech features from the speech data consisting of five sustained vowels and one sentence. The classification system, based on a rule-based algorithm that is derived from a non parametric statistical method, presents binary negative decisions. In conclusion, 55.7% of the speech data were diagnosed by this system, of which 72.8% were correct negative decisions.

  • PDF

Word-boundary and rate effects on upper and lower lip movements in the articulation of the bilabial stop /p/ in Korean

  • Son, Minjung
    • 말소리와 음성과학
    • /
    • 제10권1호
    • /
    • pp.23-31
    • /
    • 2018
  • In this study, we examined how the upper and lower lips articulate to produce labial /p/. Using electromagnetic midsagittal articulography, we collected flesh-point tracking movement data from eight native speakers of Seoul Korean (five females and three males). Individual articulatory movements in /p/ were examined in terms of minimum vertical upper lip position, maximum vertical lower lip position, and corresponding vertical upper lip position aligned with maximum vertical lower lip position. Using linear mixed-effect models, we tested two factors (word boundary [across-word vs. within-word] and speech rate [comfortable vs. fast]) and their interaction, considering subjects as random effects. The results are summarized as follows. First, maximum lower lip position varied with different word boundaries and speech rates, but no interaction was detected. In particular, maximum lower lip position was lower (e.g., less constricted or more reduced) in fast rate condition and across-word boundary condition. Second, minimum lower lip position, as well as lower lip position, measured at the time of maximum lower lip position only varied with different word boundaries, showing that they were consistently lower in across-word condition. We provide further empirical evidence of lower lip movement sensitive to both different word boundaries (e.g., linguistic factor) and speech rates (e.g., paralinguistic factor); this supports the traditional idea that the lower lip is an actively moving articulator. The sensitivity of upper lip movement is also observed with different word boundaries; this counters the traditional idea that the upper lip is the target area, which presupposes immobility. Taken together, the lip aperture gesture is a good indicator that takes into account upper and lower lip vertical movements, compared to the traditional approach that distinguishes a movable articulator from target place. Respective of different speech rates, the results of the present study patterned with cross-linguistic lenition-related allophonic variation, which is known to be more sensitive to fast rate.

입술 트릴의 방법에 따른 음향학적 및 전기성문파형검사 측정치 비교 (A comparison of acoustic & electroglottographic measures according to voiced lip trill methods)

  • 이승진;이광용;임재열;최홍식
    • 말소리와 음성과학
    • /
    • 제9권4호
    • /
    • pp.107-114
    • /
    • 2017
  • The purpose of the current study was to compare selected acoustic and electroglottographic measures (closed quotient, pitch, and loudness) among vowel phonation, traditional voiced lip trill ($VLT_T$), modified voiced lip trill methods ($VLT_M$). A total of 21 participants without voice complaints produced 4-second long samples using each phonation method. Results indicated that mean closed quotient of $VLT_M$ was higher than that of vowel phonation and $VLT_T$, while its range and standard deviation measures were higher than those of vowel phonation. Mean, range, standard deviation, maximum of pitch measures of $VLT_M$ were higher than those of vowel phonation. Lastly, mean and maximum loudness of the $VLT_M$ were higher than $VLT_T$. In conclusion, the current data indicate the possibility to use the $VLT_M$ as a training method for singing or a strategy to facilitate generalization effect of voice therapy. Current results also reflect the necessity for further study pertaining to the long-term effect of the $VLT_M$ training method. Clinical implications are discussed.

Neutralization of Vowels /ɨ/ and /u/ after a Labial Consonant in Korean: A Cross-generational Study

  • Kang, Hyunsook
    • 말소리와 음성과학
    • /
    • 제6권1호
    • /
    • pp.3-10
    • /
    • 2014
  • This study investigated whether Korean vowels, /ɨ/ and /u/, are distinctively perceived after a labial consonant given the fact that native and Sino-Korean nouns showed only vowel /u/ after a labial consonant while this pattern was massively broken by the recent introduction of loanwords. For this purpose, a perception experiment was conducted with $V_1C_1V_2$ sequences in which different vowels /a, i, u/ and consonants /p, t, k/ occurred in $V_1$ and $C_1$ before the target $V_2$, /ɨ/ and /u/. The data was produced by six speakers each from two different age groups, Age20 and Age40/50 in the read speech style. The results showed that consonant /p/ attracted significantly more responses of /u/ from /VCɨ/ sequences and significantly less responses of /u/ from /VCu/ sequence than the other consonants did in both age groups. Furthermore, Age20 group showed significantly less percentage of /u/ responses than Age40 group when the preceding consonant was /p/ regardless of the target vowel. We suggest therefore that unlike the traditional belief of labial assimilation, there is neutralization after a labial consonant in which vowels /ɨ/ and /u/ are often realized as any sound between two vowels, /ɨ/ and /u/. That is, this vowel change is not categorial but it rather produces an ambiguous stimulus which attracts different responses from different listeners. Ambiguous stimulus was produced due to coarticulatory efforts in speech production and perceptual compensation. We also argue that there is generational difference such that Age40/50 group speakers showed stronger tendency to produce /u/ after a labial consonant regardless of whether the target vowel was /ɨ/ or /u/.

음성 신호 분류에 따른 장애 음성의 변동률 분석, 비선형 동적 분석, 캡스트럼 분석의 유용성 (The Utility of Perturbation, Non-linear dynamic, and Cepstrum measures of dysphonia according to Signal Typing)

  • 최성희;최철희
    • 말소리와 음성과학
    • /
    • 제6권3호
    • /
    • pp.63-72
    • /
    • 2014
  • The current study assessed the utility of acoustic analyses the most commonly used in routine clinical voice assessment including perturbation, nonlinear dynamic analysis, and Spectral/Cepstrum analysis based on signal typing of dysphonic voices and investigated their applicability of clinical acoustic analysis methods. A total of 70 dysphonic voice samples were classified with signal typing using narrowband spectrogram. Traditional parameters of %jitter, %shimmer, and signal-to-noise ratio were calculated for the signals using TF32 and correlation dimension(D2) of nonlinear dynamic parameter and spectral/cepstral measures including mean CPP, CPP_sd, CPPf0, CPPf0_sd, L/H ratio, and L/H ratio_sd were also calculated with ADSV(Analysis of Dysphonia in Speech and VoiceTM). Auditory perceptual analysis was performed by two blinded speech-language pathologists with GRBAS. The results showed that nearly periodic Type 1 signals were all functional dysphonia and Type 4 signals were comprised of neurogenic and organic voice disorders. Only Type 1 voice signals were reliable for perturbation analysis in this study. Significant signal typing-related differences were found in all acoustic and auditory-perceptual measures. SNR, CPP, L/H ratio values for Type 4 were significantly lower than those of other voice signals and significant higher %jitter, %shimmer were observed in Type 4 voice signals(p<.001). Additionally, with increase of signal type, D2 values significantly increased and more complex and nonlinear patterns were represented. Nevertheless, voice signals with highly noise component associated with breathiness were not able to obtain D2. In particular, CPP, was highly sensitive with voice quality 'G', 'R', 'B' than any other acoustic measures. Thus, Spectral and cepstral analyses may be applied for more severe dysphonic voices such as Type 4 signals and CPP can be more accurate and predictive acoustic marker in measuring voice quality and severity in dysphonia.