• Title/Summary/Keyword: 4 Formant Frequency

Search Result 72, Processing Time 0.028 seconds

Influence of Sexual Desire Caused by Watching Phonography on Human Body (음란물 시청으로 야기된 성욕이 인체에 미치는 영향)

  • Kim, Bong Hyun;Cho, Dong Uk;Kim, Hee Dae;Lee, Bum Joo;Park, Young;Jeong, Yeon Man
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.42 no.4
    • /
    • pp.831-837
    • /
    • 2017
  • The development of various electronic media such as the Internet and smart phones, each kinds of media informations has been accompanied by the fact that various types of media information are provided from one media, and on the other hand, various dysfunctions including smart phone addiction are also caused by a very large social problem. Especially, one of the biggest dysfunctions is the social crime problem such as sex crime caused by increased sexual desire according to watch the phonography, and even if it is not a social crime, watching the phonography has influenced bad mental and physical on human body. In this paper, we try to analyze what kind of change occurs in the voice in order to investigate what kind of bad influence it has on the human body after watching the phonography. In other words, the voice in the human body is the place where the human body signal is most expressed with the face. Therefore, the purpose of this study is to investigate the effects on the organs of the human body by comparing the change of voice before and after watching phonography. Experimental results showed that the stress hormone was increased by the inability to resolve sexual desire after watching the phonography, which resulted in an increase in the bandwidth of the 3rd formant frequency.

Phoneme Segmentation in Consideration of Speech feature in Korean Speech Recognition (한국어 음성인식에서 음성의 특성을 고려한 음소 경계 검출)

  • 서영완;송점동;이정현
    • Journal of Internet Computing and Services
    • /
    • v.2 no.1
    • /
    • pp.31-38
    • /
    • 2001
  • Speech database built of phonemes is significant in the studies of speech recognition, speech synthesis and analysis, Phoneme, consist of voiced sounds and unvoiced ones, Though there are many feature differences in voiced and unvoiced sounds, the traditional algorithms for detecting the boundary between phonemes do not reflect on them and determine the boundary between phonemes by comparing parameters of current frame with those of previous frame in time domain, In this paper, we propose the assort algorithm, which is based on a block and reflecting upon the feature differences between voiced and unvoiced sounds for phoneme segmentation, The assort algorithm uses the distance measure based upon MFCC(Mel-Frequency Cepstrum Coefficient) as a comparing spectrum measure, and uses the energy, zero crossing rate, spectral energy ratio, the formant frequency to separate voiced sounds from unvoiced sounds, N, the result of out experiment, the proposed system showed about 79 percents precision subject to the 3 or 4 syllables isolated words, and improved about 8 percents in the precision over the existing phonemes segmentation system.

  • PDF

Relationship between roar sound characteristics and body size of Steller sea lion

  • Park, Tae-Geon;Iida, Kohji;Mukai, Tohru
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • v.46 no.4
    • /
    • pp.458-465
    • /
    • 2010
  • Hundreds of Steller sea lions, Eumetopias jubatus, migrate from Sakhalin and the northern Kuril Islands to Hokkaido every winter. During this migration, they may use their roaring sounds to navigate and to maintain their groups. We recorded the roars of wild Steller sea lions that had landed on reefs on the west coast of Hokkaido, and those of captive sea lions, while making video recordings. A total of 300 roars of wild sea lions and 870 roars of captive sea lions were sampled. The fundamental frequency ($F_0$), formant frequency ($F_1$), pulse repetition rate (PRR), and duration of syllables (T) were analyzed using a sonagraph. $F_0$, $F_1$, and PRR of the roars emitted by captive sea lions increased in the order male, female, and juvenile. By contrast, the $F_1$ of wild males was lower than that of females, while the $F_0$ and PRR of wild males and females did not differ statistically. Moreover, the $F_0$ and $F_1$ frequencies for captive sea lions were higher than those of wild sea lions, while PRR in captive sea lions was lower than in wild sea lions. Since there was a linear relationship between body length and the $F_0$ and $F_1$ frequencies in captive sea lions, the body length distribution of wild sea lions could be estimated from the $F_0$ and $F_1$ frequency distribution using a regression equation. These results roughly agree with the body length distribution derived from photographic geometry. As the volume of the oral cavity and the length of the vocal cords are generally proportional to body length, sampled roars can provide useful information about a population, such as the body length distribution and sex ratio.

Spectral Modeling of Haegeum Using Cepstral Analysis (캡스트럼 분석을 이용한 해금의 스펙트럼 모델링)

  • Hong, Yeon-Woo;Kang, Myeong-Su;Cho, Sang-Jin;Kim, Jong-Myon;Lee, Jung-Chul;Chong, Ui-Pil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.4
    • /
    • pp.243-250
    • /
    • 2010
  • This paper proposes a spectral modeling of Korean traditional instrument, Haegeum, using cepstral analysis to naturally describe Haegeum sounds varying with time. To get a precise result of cepstral analysis, we set the frame size to 3 periods of input signal and more cepstral coefficients are used to extract formants. The performance is enhanced by flexibly controlling the cutoff frequency of bandpass filter depending on the resonances in the synthesis process of sinusoidal components and the deleting peaks remained in the residual signal. To detect the change of pitch, we divide the input frames into silence, attack, and sustain region and determine which region the current frame is involved in. Then, the proposed method readjusts the frame size according to the fundamental frequency in the case of the current frame is in attack region and corrects the extraction errors of the fundamental frequency for the frames in sustain region. With these processes, the synthesized sounds are much more similar to the originals. The evaluation result through the listening test by a Haegeum player says that the synthesized sounds are almost similar to originals (96~100 % similar to the original sounds).

A SPEECH-PHONETIC STUDY ON THE PRONUNCIATION OF THE OPENBITE PATIENTS (개교환자의 발성에 관한 언어 음성학적 연구)

  • Kim, Ki-Dal;Yang, Won Sik
    • The korean journal of orthodontics
    • /
    • v.21 no.2 s.34
    • /
    • pp.287-307
    • /
    • 1991
  • This study aimed at examining speech defects of openbite patients, which were analized in terms of formant frequency for vowels and word pronunciation length for consonants. In addition, the upper and lower lip (perioral m.) activity was tested by the EMG. The tongue force was measured by the strain gauge, and the speech discrimination test was carried out. One experimental group and one control group were used for this study and they were respectively composed of six female openbite patients and six normal-occlusion females. Eight monophthongs, two fricatives and two affricatives were chosen for speech analysis. Speeches of the above-mentioned groups were recorded and then analized by the ILS/PC-1 software. Four hundred most frequently used monosyllables were also chosen for discrimination score. Openbite patients showed the following characteristics: 1. Abnormality in case of /a/, $/\varepsilon/$, /e/, /i/ $F_2$ and /e/, /a/ $F_1$. 2. Significantly elongated length in their pronunciation of /h/ and $/C^h/$ and somewhat elongated length also in their pronunciation of /s/ and /c/. 3. Significant upper lip activity according to the EMG test during pronunciation of the bilabial consonants. 4. Relatively weak tongue force according to the strain gauge measurement. 5. According to the speech discrimination test, high rate of misarticulation in case of (a) initial /p/ /s'/ and /ts'/, (b) /a/,$/\varepsilon/$,/e/,/je/,/o/, $/\phi/$,/jo/,/u/,/we/, and /i/ (c) final (equation omitted).

  • PDF

Classification of Diphthongs using Acoustic Phonetic Parameters (음향음성학 파라메터를 이용한 이중모음의 분류)

  • Lee, Suk-Myung;Choi, Jeung-Yoon
    • The Journal of the Acoustical Society of Korea
    • /
    • v.32 no.2
    • /
    • pp.167-173
    • /
    • 2013
  • This work examines classification of diphthongs, as part of a distinctive feature-based speech recognition system. Acoustic measurements related to the vocal tract and the voice source are examined, and analysis of variance (ANOVA) results show that vowel duration, energy trajectory, and formant variation are significant. A balanced error rate of 17.8% is obtained for 2-way diphthong classification on the TIMIT database, and error rates of 32.9%, 29.9%, and 20.2% are obtained for /aw/, /ay/, and /oy/, for 4-way classification, respectively. Adding the acoustic features to widely used Mel-frequency cepstral coefficients also improves classification.

THE EFFECT OF LINGUAL FRENECTOMY ON PHONATION & TONGUE MOVEMENT (설소대성형술이 발음 및 혀의 운동에 미치는 영향에 관한 연구)

  • Hwang, Sun-Yong;Lee, Sang-Chull;Ryu, Dong-Mok
    • Maxillofacial Plastic and Reconstructive Surgery
    • /
    • v.14 no.1_2
    • /
    • pp.40-53
    • /
    • 1992
  • This sutdy aimed at examining the effect of lingual frenectomy on phonation & tongue movement. Almost the patient visiting to department of oral & maxillofacial surgery for the treatment of tongue tie always complain the speech problem. Many operation was performed according to this problem. But the objective evaluation of the speech change have been deficient. The experimental group was 25 adult males. Fourteen Korean consonants & after Korean vowels was combined and seventy sound was made for speech analysis. Before & after lingual frenectomy, the speech of the above mentioned group was recorded and then analysed by the Speech Workstation computer software. And before & after operation, the lingual frenum & tongue protrusion amount vas measured. The results were as follows : 1. The pre-operative length of lingual frenum was inverse proportion with the pre-operative length of the protrusive tongue. 2. The average difference between pre & post-operative length of the protrusive tongue was about 23 mm. 3. In the comparison of consonant continuing time change, fricative consonant(r, s, h) was increased post-operatively. 4. In the comparison of the vowel frequency formant change, the "i"and "u" sound vas reliably changed. 5. There was no reliable speech changes on the other sounds.

  • PDF

Acoustic characteristics of speech-language pathologists related to their subjective vocal fatigue (언어재활사의 주관적 음성피로도와 관련된 음향적 특성)

  • Jeon, Hyewon;Kim, Jiyoun;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.14 no.3
    • /
    • pp.87-101
    • /
    • 2022
  • In addition to administering a questionnaire (J-survey), which questions individuals on subjective vocal fatigue, voice samples were collected before and after speech-language pathology sessions from 50 female speech-language pathologists in their 20s and 30s in the Daejeon and Chungnam areas. We identified significant differences in Korean Vocal Fatigue Index scores between the fatigue and non-fatigue groups, with the most prominent differences in sections one and two. Regarding acoustic phonetic characteristics, both groups showed a pattern in which low-frequency band energy was relatively low, and high-frequency band energy was increased after the treatment sessions. This trend was well reflected in the low-to-high ratio of vowels, slope LTAS, energy in the third formant, and energy in the 4,000-8,000 Hz range. A difference between the groups was observed only in the vowel energy of the low-frequency band (0-4,000 Hz) before treatment, with the non-fatigue group having a higher value than the fatigue group. This characteristic could be interpreted as a result of voice abuse and higher muscle tonus caused by long-term voice work. The perturbation parameter and shimmer local was lowered in the non-fatigue group after treatment, and the noise-to-harmonics ratio (NHR) was lowered in both groups following treatment. The decrease in NHR and the fall of shimmer local could be attributed to vocal cord hypertension, but it could be concluded that the effective voice use of speech-language pathologists also contributed to this effect, especially in the non-fatigue group. In the case of the non-fatigue group, the rhamonics-to-noise ratio increased significantly after treatment, indicating that the harmonic structure was more stable after treatment.

The Change of the Length of Vocal Tract in Singers according to the Phonation at Different Levels of Pitch (성악인에서 발성 시 음의 높낮이에 따른 성도 길이의 변화)

  • Ban, Jae-Ho;Kim, Chang-Gyu;Lee, Sang-Hyuk;Lee, Kyung-Chul;Jin, Sung-Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.17 no.1
    • /
    • pp.14-16
    • /
    • 2006
  • Background and Objectives: The purpose of this study is to investigate the change of vocal tract length according to the level of the pitch by the singers. Materials and Methods: Fifteen tenors were asked to produce successive /a/ sound in G4(382Hz) for the head register, C3(131Hz) for the chest register and usual speaking sound. The control group consisted of 15 males of an similar age who are not professional singers. The length of vocal tract was calculated by applying the formula of Fn=(2n-1) c/4L(F : formant frequency, c : the speed of sound in the vocal tract(350m/sec), L : length of vocal tract, $n=1,2,3,4,{\ldots}{\infty}$). Results: In singer's group, there showed no significant statistical difference of length among head and chest register and usual speaking sound. However in the control group, there showed statistically significant difference of length. Comparison of the absolute difference in the length of vocal tract by changing level of pitch in phonation, between the control group and the singers group. Changing from G4 phonation to C3 phonation and C3 phonation to usual speaking sound showed statistically difference of vocal tract length was less in the singers group than the control group. Conclusion: The change of vocal tract length, in either speaking or singing, was less in singers than the control group. We could assume that the singers maintain their larynx position constantly throughout the pitch range when phonation.

  • PDF

A Study on Acoustical Properties of Soprano′s Singing (소프라노의 성악 발성에 대한 음향학적 특징 연구)

  • 임동철;문소연;이행세
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.5
    • /
    • pp.60-64
    • /
    • 2000
  • This paper studies the relation between the Fundamental Frequency (F0) and the formants of simple vowels in the Korean language sung by sopranos. It is hewn that, in soprano singing, the F0 of a vowel affects its formants. For this reason the formants of simple vowels sung by sopranos must be considered in all over the soprano singing range. We recorded the five simple vowel sounds /a/, /e/, /i/, /o/, and /u/ sung by five professional sopranos from A3 (220.0Hz) to A5 (880.0Hz) in the major scale and compared the formants of the sung vowels with those of spoken vowels. We observed that F1 and F2 of sung vowels were stable in low F0 (lower than B4) but in high F0 (higher than B4), F1 and F2 lost their stabilities. In the case of /a/, /o/, and /u/, the slope of the F1-F2 graph was about 2.6, and those of the F0-F2 and F0-Fl graphs were 2.2-2.5 and 0.7-1.0, respectively. And as the F0 increases, the F1 and F2 of sung vowels /a/, /e/, /i/, /o/, and /u/ were almost the same. At A5, the Fl and F2 of five sung vowels had the same values. This results suggest that the relation between the F0 and the formants be used to synthesize soprano's singing vowels.

  • PDF