• 제목/요약/키워드: Fundamental frequency(F0)

검색결과 136건 처리시간 0.02초

소프라노 1인의 모음곡 발성 시 제 1 포먼트의 변화양상 (The First Formant Characteristics in Vocalize of One Soprano)

  • 송윤경;진성민
    • 대한후두음성언어의학회지
    • /
    • 제16권1호
    • /
    • pp.10-14
    • /
    • 2005
  • Background and Objectives : Vowels are characterized on the basis of formant patterns. The first formant(F1) is determined by high-low placement of the tongue, and the second formant (F2) by front-back placement of the tongue. The fundamental frequency(F0) of a soprano often exceed the normal frequency of the first formant. And the vocal intensity is boosted when F0 is high and a harmonic coincides with a formant. This is called a formant tuning. Experienced singers thus learned how to tune their formants over a resonable range by lowering the tongue to maximize their vocal intensity. So, the current study aimed to identify the formant tuning in one experienced soprano by comparing the first formants of vowel [i] in three different voice production : speech, ascending scale, and vocalize. Materials and Method : All voices recordings of vowel [i] in speech, ascending scale (from F4 note to A4 note), and vocalize(:Ridente la calam") were made with digital audio tape-corder in a sound treated room. And the captured data were analyzed by the long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab(CSL, Kay elementrics, Model, 4300B). Results : Although the first formant of vowel [i] in speech was 238Hz, those of ascending scale [i] were 377Hz, 405Hz, 453Hz respectively in F4(349z), G4(392Hz), A4(440Hz) note, and 722Hz, 820Hz, 918Hz respectively in F5 (698Hz), G5(784Hz), A5(880Hz) note. In vocalize, first formants of [i] were 380Hz, 398Hz, 453Hz respectively in F4, G4, A4 note, and 720Hz, 821Hz, 890Hz respectively in F5, G5, A5 note. Conclusion : These results showed that the first formant of ascending scale and vocalize sustained higher frequency than fundamental frequency in high pitch. This finding implicates that the formant tuning of vowel [i] in ascending scale was also noted in vocalize.

  • PDF

A Basic Study on the Conversion of Sound into Color Image using both Pitch and Energy

  • Kim, Sung-Ill
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제12권2호
    • /
    • pp.101-107
    • /
    • 2012
  • This study describes a proposed method of converting an input sound signal into a color image by emulating human synesthetic skills which make it possible to associate an sound source with a specific color image. As a first step of sound-to-image conversion, features such as fundamental frequency(F0) and energy are extracted from an input sound source. Then, a musical scale and an octave can be calculated from F0 signals, so that scale, energy and octave can be converted into three elements of HSI model such hue, saturation and intensity, respectively. Finally, a color image with the BMP file format is created as an output of the process of the HSI-to-RGB conversion. We built a basic system on the basis of the proposed method using a standard C-programming. The simulation results revealed that output color images with the BMP file format created from input sound sources have diverse hues corresponding to the change of the F0 signals, where the hue elements have different intensities depending on octaves with the minimum frequency of 20Hz. Furthermore, output images also have various levels of chroma(or saturation) which is directly converted from the energy.

Core Loss Analysis of Non-oriented Electrical Steel Under Magnetic Induction Including Higher Harmonics

  • Cho, Chuhyun;Son, Derac;Cho, Youk
    • Journal of Magnetics
    • /
    • 제6권2호
    • /
    • pp.66-69
    • /
    • 2001
  • The actual magnetic induction waveform of cores in electrical machines is not sinusoidal i.e. higher harmonics are always included. Thus the core loss in actual electrical machines is different from the core loss which is measured by the standard method, because the waveform of magnetic induction should be sinusoidal in the standard testing method. Core loss analysis under higher harmonic induction is always important in electric machine design. In this works we measured the core loss when a hysteresis loop has only one period of an ac minor loop of higher harmonic frequency, depending on the position of the ac minor loop of relative to the fundamental harmonic frequency. From this experiment, the core loss P(B/sub 0/f/sub 0/, B/sub h/, nf/sub 0/)) under a higher harmonic magnetic induction B/sub h/ could be expressed by the linear combination the core loss at fundamental harmonic frequency P/sub c/(B/sub 0/, f/sub 0/), the core loss of ac minor loop at zero induction region of the major hysteresis loop P/sub cL/ (B/sub h/, nf/sub 0/), and the core loss of an ac minor loop in the high induction region of the major hysteresis loop P/sub cH/ (B/sub h/, nf/sub 0/) i.e., P/sub c/, (B/sub 0/, f/sub 0/, B/sub h/, nf/sub 0/)=P/sub c/ (B/sub 0/, f/sub 0/,)+(n-1)[k₁(B/sub 0/) P/sub cL/ (B/sub h/, nf/sub 0/)+(1-k₁(B/sub 0/)) P/sub cH/ (B/sub h/, nf/sub 0/)]. This will be useful formula for electrical machine designers and one of effective methods to predict core loss including higher harmonic induction.

  • PDF

서울 방언 어두 폐쇄음의 후속모음 F0 (F0 as a primary cue for signaling word-initial stops of Seoul Korean)

  • 변희경
    • 말소리와 음성과학
    • /
    • 제8권1호
    • /
    • pp.25-36
    • /
    • 2016
  • Previous studies showed that the voice onset time (VOT) of aspirated and lenis stops has been merged, and post-stop fundamental frequency (F0) has emerged as a primary cue to distinguish the two stops in the younger generation and female speech. The purpose of this study is to demonstrate that VOT merger in aspirated and lenis stops occurs after an F0 difference between the two stops becomes stabilized. In other words, unless post-stop F0, which is a redundant feature, is fully developed, it is hard for VOT merger to happen. Females have got a stable F0 difference in stops earlier than males. Therefore, VOT merger could happen, and as a result, females could take the lead in changing from VOT to F0 in initial stops. This study also shows that speakers who acquired F0 as a primary cue use F0 to the full to distinguish lenis stops from two other stops (aspirated and fortis).

한국어 발화음성에서 중점단어 탐색을 위한 기본주파수에 대한 연구 (A Study of Fundamental Frequency for Focused Word Spotting in Spoken Korean)

  • 권순일;박지형;박능수
    • 정보처리학회논문지B
    • /
    • 제15B권6호
    • /
    • pp.595-602
    • /
    • 2008
  • 각 문장 별 중점단어는 발화음성을 인식하고 그 의미를 이해하는데 도움을 준다. 발화된 음성신호로부터 중점단어를 탐색할 수 있는 방법을 찾기 위한 노력의 일환으로 실험을 통하여 문장 내에서 중점단어와 그 외의 단어들의 기본주파수의 평균과 분산, 그리고 평균 에너지를 분석해 보았다. 한국어로 된 100개의 발화문장의 음성데이터를 가지고 실험을 한 결과 중점단어는 그 외의 단어들에 비해 대부분 상대적으로 높은 기본주파수의 평균값을 나타내거나 상대적으로 높은 기본주파수의 분산 값을 나타냈다. 이 연구 결과를 이용하면 한국어의 구어문장에서 운율적 특성을 알 수 있을 뿐만 아니라, 자연어 처리를 이용한 핵심어를 추출하는 데에도 도움이 될 것이다.

A Study of Peak Finding Algorithms for the Autocorrelation Function of Speech Signal

  • So, Shin-Ae;Lee, Kang-Hee;You, Kwang-Bock;Lim, Ha-Young;Park, Ji Su
    • 한국컴퓨터정보학회논문지
    • /
    • 제21권12호
    • /
    • pp.131-137
    • /
    • 2016
  • In this paper, the peak finding algorithms corresponding to the Autocorrelation Function (ACF), which are widely exploited for detecting the pitch of voiced signal, are proposed. According to various researchers, it is well known fact that the estimation of fundamental frequency (F0) in speech signal is not only very important task but quite difficult mission. The proposed algorithms, presented in this paper, are implemented by using many characteristics - such as monotonic increasing function - of ACF function. Thus, the proposed algorithms may be able to estimate both reliable and correct the fundamental frequency as long as the autocorrelation function of speech signal is accurate. Since the proposed algorithms may reduce the computational complexity it can be applied to the real-time processing. The speech data, is composed of Korean emotion expressed words, is used for evaluation of their performance. The pitches are measured to compare the performance of proposed algorithms.

소프라노의 성악 발성에 대한 음향학적 특징 연구 (A Study on Acoustical Properties of Soprano′s Singing)

  • 임동철;문소연;이행세
    • 한국음향학회지
    • /
    • 제19권5호
    • /
    • pp.60-64
    • /
    • 2000
  • 본 논문에서는 소프라노가 성악 발성으로 한국어 단모음을 발음할 때, 그 단모음들의 포르만트가 F0(Fundamental frequency)에 따라 어떻게 바뀌어지는지 연구되었다. 일반적으로 다른 파트의 경우와는 달리, 소프라노가 노래를 할 때에는 포르만트가 그 F0의 영향을 크게 받는 것으로 알려져 있다. 따라서, 성악발성에 대한 연구를 위해서는 소프라노가 발성할 수 있는 전 음역 대의 F0에서 각 모음에 대한 포르만트 분석이 필요하다. 이러한 분석 결과를 바탕으로 성악 발성의 특징들을 패턴화하여 성악발성 평가 시스템이나 성악발성 합성 시스템을 구축할 수 있다. 5명의 전문 소프라노를 대상으로 '아, 에, 이, 오, 우' 5모음의 성악발성을 A3(220.0Hz)에서부터 A5(880.0Hz)까지의 피치에서 포르만트 분석을 하였다. 또한, 일반적인 대화 시 이 5가지 모음의 포르만트를 분석하여 성악발성의 경우와 비교하였다. 연구 결과, '아, 에, 이'의 F2/F1의 그래프가, B4(493.8Hz)이상의 F0에서는 거의 직선으로 나타났다. B4는 Changing Voice가 시작되는 곳으로, 성악가의 음색 변화가 포르만트 형태의 변화와 밀접한 관계가 있음을 알 수 있다. 또한, A5에서는 '아, 에, 이, 오, 우'의 F1, F2의 수치가 거의 일치하는 것으로 나타났다. 즉, 최고음부에서 불려지는 모음들은 서로 구별되기가 어렵게 되는 것이다. 본 논문은 성악발성 평가 시스템이나 성악발성 합성 시스템을 구축할 때에, '아, 오, 우'의 경우에는 B4에서 A5의 F1, F2를 F0대한 기울기로 규정화할 것을 제안한다. 이와 같은 규정화를 통하여 성악발성과 관련된 시스템 구축에 필요한 노력과 비용을 줄일 수 있을 것이다.

  • PDF

한국 중학생의 영어 읽기 발화에서 문장유형에 따른 유창성 등급과 초분절 요소의 관계 (The relationship between fluency levels and suprasegmentals according to the sentence types in the English read speech by Korean middle school English learners)

  • 김화영
    • 말소리와 음성과학
    • /
    • 제14권3호
    • /
    • pp.51-66
    • /
    • 2022
  • 본 연구의 목적은 한국인 영어 학습자가 영어문장을 읽을 때 어떠한 초분절 요소가 영어 원어민 화자에 가깝게 구현되는데 영향을 미치는지를 밝혀 영어 발음교육에 도움이 되고자 하는 것이다. 본 연구에서는 연구대상자를 중학생 영어학습자로 선택하고, 다양한 유형의 문장(평서문, 의문문, 명령문, 감탄문)과 음절수로 연구 자료를 구성하였다. 이들 영어 문장 발화의 분석대상으로는 초분절 요소 중 발화속도, 휴지빈도, 휴지길이, F0 범위, 리듬을 이용하였고 음성분석 결과는 평균분석, 상관분석 및 회귀분석을 실시하였다. 그 결과, 발화속도, 휴지빈도, 휴지길이, F0 범위가 유창성 등급 평가에 영향을 미친다는 결과를 얻었다. 모든 초분절 요소와 유창성 등급 간의 회귀분석에서는 유창성 등급에 영향을 미치는 초분절 요소는 발화속도와 F0 범위이다. 리듬은 유창성 등급과의 관계에서 통계적으로 유의미하지 않았다. 따라서, 영어 발음교육을 할 때 발화속도를 높이고, F0 범위를 크게 하도록 교육하는 것이 필요하다. 또한, 발화시 휴지개수와 휴지시간을 줄이도록 하는 교육이 유창성을 높이는데 도움이 된다. 문장유형을 분류하여 분석한 결과, 감탄문의 경우 다른 문장유형에 비해 발화속도가 더 빠르고, 휴지빈도는 더 적고, 휴지길이는 더 짧으며, 리듬값은 더 높았다.

한국어 음성합성기의 성능 향상을 위한 합성 단위의 유무성음 분리 (Separation of Voiced Sounds and Unvoiced Sounds for Corpus-based Korean Text-To-Speech)

  • 홍문기;신지영;강선미
    • 음성과학
    • /
    • 제10권2호
    • /
    • pp.7-25
    • /
    • 2003
  • Predicting the right prosodic elements is a key factor in improving the quality of synthesized speech. Prosodic elements include break, pitch, duration and loudness. Pitch, which is realized by Fundamental Frequency (F0), is the most important element relating to the quality of the synthesized speech. However, the previous method for predicting the F0 appears to reveal some problems. If voiced and unvoiced sounds are not correctly classified, it results in wrong prediction of pitch, wrong unit of triphone in synthesizing the voiced and unvoiced sounds, and the sound of click or vibration. This kind of feature is usual in the case of the transformation from the voiced sound to the unvoiced sound or from the unvoiced sound to the voiced sound. Such problem is not resolved by the method of grammar, and it much influences the synthesized sound. Therefore, to steadily acquire the correct value of pitch, in this paper we propose a new model for predicting and classifying the voiced and unvoiced sounds using the CART tool.

  • PDF

한국어 원거리 음성의 모음의 음향적 특성 (Acoustic Characteristics of Vowels in Korean Distant-Talking Speech)

  • 이숙향;김선희
    • 대한음성학회지:말소리
    • /
    • 제55권
    • /
    • pp.61-76
    • /
    • 2005
  • This paper aims to analyze the acoustic effects of vowels produced in a distant-talking environment. The analysis was performed using a statistical method. The influence of gender and speakers on the variation was also examined. The speech data used in this study consist of 500 distant-talking words and 500 normal words of 10 speakers (5 males and 5 females). Acoustic features selected for the analysis were the duration, the formants (Fl and F2), the fundamental frequency and the total energy. The results showed that the duration, F0, F1 and the total energy increased in the distant-talking speech compared to normal speech; female speakers showed higher increase in all features except for the total energy and the fundamental frequency. In addition, speaker differences were observed.

  • PDF