• 제목/요약/키워드: 4 Formant Frequency

검색결과 72건 처리시간 0.022초

소프라노 1인의 모음곡 발성 시 제 1 포먼트의 변화양상 (The First Formant Characteristics in Vocalize of One Soprano)

  • 송윤경;진성민
    • 대한후두음성언어의학회지
    • /
    • 제16권1호
    • /
    • pp.10-14
    • /
    • 2005
  • Background and Objectives : Vowels are characterized on the basis of formant patterns. The first formant(F1) is determined by high-low placement of the tongue, and the second formant (F2) by front-back placement of the tongue. The fundamental frequency(F0) of a soprano often exceed the normal frequency of the first formant. And the vocal intensity is boosted when F0 is high and a harmonic coincides with a formant. This is called a formant tuning. Experienced singers thus learned how to tune their formants over a resonable range by lowering the tongue to maximize their vocal intensity. So, the current study aimed to identify the formant tuning in one experienced soprano by comparing the first formants of vowel [i] in three different voice production : speech, ascending scale, and vocalize. Materials and Method : All voices recordings of vowel [i] in speech, ascending scale (from F4 note to A4 note), and vocalize(:Ridente la calam") were made with digital audio tape-corder in a sound treated room. And the captured data were analyzed by the long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab(CSL, Kay elementrics, Model, 4300B). Results : Although the first formant of vowel [i] in speech was 238Hz, those of ascending scale [i] were 377Hz, 405Hz, 453Hz respectively in F4(349z), G4(392Hz), A4(440Hz) note, and 722Hz, 820Hz, 918Hz respectively in F5 (698Hz), G5(784Hz), A5(880Hz) note. In vocalize, first formants of [i] were 380Hz, 398Hz, 453Hz respectively in F4, G4, A4 note, and 720Hz, 821Hz, 890Hz respectively in F5, G5, A5 note. Conclusion : These results showed that the first formant of ascending scale and vocalize sustained higher frequency than fundamental frequency in high pitch. This finding implicates that the formant tuning of vowel [i] in ascending scale was also noted in vocalize.

  • PDF

포만트 공간에서의 주파수 변환을 이용한 이중 언어 음성 변환 연구 (Bilingual Voice Conversion Using Frequency Warping on Formant Space)

  • 채의근;윤영선;정진만;은성배
    • 말소리와 음성과학
    • /
    • 제6권4호
    • /
    • pp.133-139
    • /
    • 2014
  • This paper describes several approaches to transform a speaker's individuality to another's individuality using frequency warping between bilingual formant frequencies on different language environments. The proposed methods are simple and intuitive voice conversion algorithms that do not use training data between different languages. The approaches find the warping function from source speaker's frequency to target speaker's frequency on formant space. The formant space comprises four representative monophthongs for each language. The warping functions can be represented by piecewise linear equations, inverse matrix. The used features are pure frequency components including magnitudes, phases, and line spectral frequencies (LSF). The experiments show that the LSF-based voice conversion methods give better performance than other methods.

한국어 비음의 음향학적 구분을 위한 장구간 스펙트럼(LTAS) 분석 (Long Term Average Spectral Analysis for Acoustical Discrimination of Korean Nasal Consonants)

  • 최순애;성철재
    • 대한음성학회지:말소리
    • /
    • 제60호
    • /
    • pp.67-84
    • /
    • 2006
  • The purpose of this study is to find some acoustic parameters on frequency domain to distinguish the Korean nasals, $/m,\;n,\;{\eta}/$ from each other. The new parameters are devised on the basis of LTAS (Long Term Average Spectrum). The maximum peak amplitude and the relevant formant frequency are measured in low and high frequency range, respectively. The frequency of spectral valley and its energy level are also obtained in the specific frequency range of the spectrum. Spectral slope, total energy value in specific frequency range, statistical distribution of spectral energy like centroid, skewness, and kurtosis are suggested as new parameters as well. The parameters that show statistically significant differences across nasals are summerized as follows. 1) in syllable initial positions: the total energy value from 1,500 to 2,200 Hz(zeroENG); 2) in syllable final positions: the peak amplitude of the first formant(peak1_a), the formant frequency with maximum peak amplitude from 4,000 to 8,000 Hz(peak2_f), the maximum peak amplitude of the formant frequency from 4,000 to 8,000 Hz(peak2_a), and the total energy value from 1,500 to 2,200 Hz(zeroENG).

  • PDF

후두위치의 변화에 따른 Singer's Formant와 성대접촉률의 변화 연구 (Analysis of Singer's Formant & Close Quotient During Change of the Larynx Position)

  • 남도현;최성희;최재남;전석필;최홍식
    • 대한후두음성언어의학회지
    • /
    • 제15권2호
    • /
    • pp.98-111
    • /
    • 2004
  • Background and Objectives : The purpose of this study is to analyze the difference of Fundamental Frequency(Hz), Closed Quotient(Qx ; %), Intensity(dB), Vocal tract length and width(cm), formant frequency(Hz), level of formant frequency(dB) depending on the larynx position. Materials and Methods : One professional male singer(career : 28 years) produced sustained vowel /a/,/e/,/i/,/o/,/u/ in two larynx position (higher, lower) with Dr. Speech and video fluoroscopy was used to quantify the vocal tract morphology. Results : In lower larynx position, CQ is increased 9.8% and Intensity is increased about 10% and level of Formant Frequency is increased. And also Vocal tract length is longer 2.4cm, Vocal tract width(Anterior width : 0.4cm, lateral width : 0.2cm) is wider than in higher larynx position. Conclusions : Singer's formant has a prominent spectrum envelope peak near 2400-2600Hz by clustering of F3, F4 and F5 near 3400Hz in lower larynx position.

  • PDF

실이에서 여자 음성 /ㅏ/, /ㅣ/, /ㅜ/의 포먼트 주파수 변화 (Formant frequency changes of female voice /a/, /i/, /u/ in real ear)

  • 허승덕;강희라
    • 말소리와 음성과학
    • /
    • 제9권1호
    • /
    • pp.49-53
    • /
    • 2017
  • Formant frequencies depend on the position of tongue, the shape of lips, and larynx. In the auditory system, the external ear canal is an open-end resonator, which can modify the voice characteristics. This study investigates the effect of the real ear on formant frequencies. Fifteen subjects ranging from 22 to 30 years of age participated in the study. This study employed three corner vowels: the low central vowel /a/, the high front vowel /i/, and the high back vowel /u/. For this study, the voice of a well-educated undergraduate who majored in speech-language pathology, was recorded with a high performance condenser microphone placed in the upper pinna and in the ear canal. Paired t-test showed that there were significant difference in the formant frequencies of F1, F2, F3, and F4 between the free field and the real ear. For /a/, all formant frequencies decreased significantly in the real ear. For /i/, F2 increased and F3 and F4 decreased. For /u/, F1 and F2 increased, but F3 and F4 decreased. It seems that these voice modifications in the real ear contribute to interpreting voice quality and understanding speech, timbre, and individual characteristics, which are influenced by the shape of the outer ear and external ear canal in such a way that formant frequencies become centralized in the vowel space.

정상 및 심질환 소아의 청진음 분석에 관한 연구 (A Study on Stethoscope Signal Analysis for Normal and Heart-diseased Children)

  • 김동준
    • 전기학회논문지
    • /
    • 제66권4호
    • /
    • pp.715-720
    • /
    • 2017
  • This study tries to analyze morphology and formant frequencies of linear prediction spectra of stethoscope sounds for heart diseased children. For this object, heart diseased stethoscope sounds were collected in the pediatrics of an university hospital. The collected signals were preprocessed and analyzed by the Burg algorithm, a kind of linear prediction analysis. The linear prediction spectra and the formant frequencies of the spectra for the stethoscope sounds for the normal and the diseased children are estimated and compared. The spectra showed outstanding differences in morphology and formant frequencies between the normal and the diseased children. Normal children showed relatively low frequency of F1(the first formant) and small negative slope from F1. VSD children revealed stiff slope change around F1 to F3. Spectra of ASD children is similar with the normal case, but have negative values of F3. F1-F2 difference of the functional murmur children were relatively large.

Praat을 이용한 아동 포먼트 자동 세팅 스크립트 구현 (The implementation of children's automated formant setting by Praat scripting)

  • 박지연;성철재
    • 말소리와 음성과학
    • /
    • 제10권4호
    • /
    • pp.1-10
    • /
    • 2018
  • This study introduces an automated Praat script allowing optimal formant analysis for children's vowels. Using Burg's algorithm in Praat, formants can be extracted by setting the maximum formant value and the number of formants. The optimal formant setting was determined by identifying the two conditions, F1 and F2, with minimum standard deviations. When applying the optimal formant setting determined by the script, the results of normality tests were not significant among all vowels except /e/ for the maximum formant value, and among the vowels /a/, /e/, /i/, /o/, /u/ and /ʌ/ for the number of formants. This indicates that when analyzing the formants of children's vowel sounds, the unilateral application of a parameter setting (the maximum formant value and the number of formants) to all vowels is problematic. The performance of the optimal formant setting script was evaluated along with 3 different algorithm in order to determine whether it properly extracts formants for children's vowels. To this end, Korean monophghongs of 6-year-old children were collected and the Praat scripts were applied to the data. Resultant Formant plots and statistical analysis showed that optimum_script and qtone_script, which links to the perceptual unit, performed very well in formant extraction compared to the remaining 2 scripts.

음성으로부터 감성인식 요소분석 (Analyzing the element of emotion recognition from speech)

  • 심귀보;박창현
    • 한국지능시스템학회논문지
    • /
    • 제11권6호
    • /
    • pp.510-515
    • /
    • 2001
  • 일반적으로 음성신호로부터 사람의 감정을 인식할 수 있는 요소는(1)대화의 내용에 사용한 단어, (2)톤 (tore), (3)음성신호의 피치(Pitch), (4)포만트 주파수(Formant Frequencey)그리고 (5)말의 빠르기(Speech Speed)(6)음질(Voice Quality)등이다. 사람의 경우는주파수 같은 분석요소 보다 톤과 단어 빠르기, 음질로 감정을 받아들이게 되는것이 자연스러운 방법이므로 당연히 후자의 요소들이 감정을 분류하는데 중요한 인자로쓰일 수있다. 그리고, 종래는 주로 후자의 효소들을 이용하였는데, 기계로써 구현하기 위해서는 포만트 주파수를 사용할 수있게 되는것이 도움이 된다. 그러므로, 본 연구는 음성 신호로부터 피치와 포만트, 그리고 말의 빠르기 등을 이용하여 감성인식시스템을 구현하는것을 목표로 연구를 진행하고 있으며, 그 1단계 연구로서 본 논문에서는 화가 나서 내뱉는 말을 기반으로 하여 화난 감정의 독특한 특성을 찾아내었다.

  • PDF

벅아이 코퍼스에서의 젊은 성인 남성의 모음 포먼트 분석 (An Analysis of the Vowel Formants of the Young Males in the Buckeye Corpus)

  • 윤규철;노혜욱
    • 말소리와 음성과학
    • /
    • 제4권2호
    • /
    • pp.41-49
    • /
    • 2012
  • The purpose of this paper is to extract the vowel formants of the ten young male speakers from the Buckeye Corpus of Conversational Speech [1] and to analyze them in comparison to earlier works in terms of various phonetic factors that are expected to affect the realization of the formant distribution. The first two formant frequency values were automatically extracted with a Praat script along with such factors as the place of articulation, the content versus function word information, syllabic stress information, the location in a word, location in utterance, speech rate of three consecutive words, and the word frequency in the corpus. The results indicated that the formant patterns from the corpus were very different from those of earlier works although the overall pattern was similar and that the factors were strongly responsible for the realization of the two formants. The purpose of this paper is to extract the vowel formants of the ten young male speakers from the Buckeye Corpus of Conversational Speech [1] and to analyze them in comparison to earlier works in terms of various phonetic factors that are expected to affect the realization of the formant distribution. The first two formant frequency values were automatically extracted with a Praat script along with such factors as the place of articulation, the content versus function word information, the syllabic stress information, the location in a word, the location in an utterance, the speech rate of the three consecutive words, and the word frequency in the corpus. The result indicated that the formant patterns from the corpus were very different from those of earlier works although the overall pattern was similar and that the factors were strongly responsible for the realization of the two formants.

성인 포먼트 측정에서의 최적 세팅 구현: Praat software와 관련하여 (The implementation of Korean adult's optimal formant setting by Praat scripting)

  • 박지연;성철재
    • 말소리와 음성과학
    • /
    • 제11권4호
    • /
    • pp.97-108
    • /
    • 2019
  • 한국인 성인을 대상으로 최적의 포먼트 분석이 가능하도록 자동화된 프랏 스크립트를 구현하였다. 최적의 포먼트 분석이란 프랏에서 포먼트 분석 시 설정하는 2가지 세팅 파라미터(최대 포먼트, 포먼트 개수)를 조합하여 측정된 제1, 제2 포먼트의 편차합이 최소일 때를 가리킨다. 포먼트 분석의 신뢰성을 높이기 위해서는 성별이나 모음의 종류에 따라 LPC 차수를 다르게 설정해야 하는데 프랏 매뉴얼에서는 최대 포먼트 설정 값으로 남성 5,000 Hz, 여성 5,500 Hz, 측정개수는 5개를 권고한다. 그러나 이렇게 권고된 포먼트 세팅 설정이 한국어 모음에 대해서도 타당한지 검증이 필요하다. 본 연구에서 구현한 4가지 스크립트를 적용한 결과, 각 모음별 포먼트 산점도로 확인하였을 때 특히 여성의 경우 스크립트에 따라 측정된 포먼트 변이의 폭이 두드러지는 차이를 보였다. 포먼트 산점도와 통계 결과를 통해 linear_script와 qtone_script가 포먼트 측정에서 더 신뢰성이 높은 것을 알 수 있었다. Linear_script, qtone_script에서 최적의 세팅으로 설정된 최대 포먼트와 포먼트 개수의 데이터 경향성을 살펴보면, 전설 모음 [이, 에]의 경우 권고 설정보다 최대 포먼트 값은 높게, 포먼트 개수의 값은 적게 설정되었다. 반면 후설모음 [오, 우]의 경우, 권고 설정보다 최대 포먼트 값은 낮게, 포먼트 개수의 값은 많게 설정되는 것을 확인할 수 있었다.