• 제목/요약/키워드: Formant Frequency

검색결과 183건 처리시간 0.03초

Characteristics of Cow´s Voices in Time and Frequency domains for Recognition

  • Ikeda, Yoshio;Ishii, Y.
    • Agricultural and Biosystems Engineering
    • /
    • 제2권1호
    • /
    • pp.15-23
    • /
    • 2001
  • On the assumption that the voices of the cows are produced by the linear prediction filter, we characterized the cows’voices. The order of this filter was determined by examining the voice characteristics both in time and frequency domains. The proposed order of the linear prediction filter is 15 for modeling voice production of the cow. The characteristics of the amplitude envelope of the voice signal was investigated by analyzing the sequence of the short time variance both in time and frequency domains, and the new parameters were defined. One of the coefficients o the linear prediction filter generating the voice signal, the fundamental frequency, the slope of the straight line regressed from the log-log spectra of the short time variance and the coefficients of the linear prediction filter generating the sequence of the short time variance of the voice signal can differentiate the two cows.

  • PDF

구개상의 두께에 따른 한국어 자음의 발음 변화에 관한 컴퓨터 분석 - 치조음, 경구개음- (A COMPUTER ANALYSIS ON THE KOREAN CONSONANT SOUND DISTORTION IN RELATION TO THE PALATAL PLATE THICKNESS -Dentoalveolar and hard palatal consonant-)

  • 우이형;최대균;최부병;박남수
    • 대한치과보철학회지
    • /
    • 제25권1호
    • /
    • pp.71-94
    • /
    • 1987
  • This study was carried out to investigate the sound distortion following the alternation of the palatal plate thickness. For this study, 2 healthy male subjects (24-year-old) were selected. Born in Seoul, they both spoke Seoul dialect. First, their sounds of /na(나)/, /da(다)/, /1a(라)/, /ja(자)/, /cha(차)/, /ta(타)/, without inserting plates were recorded, and then the sounds with palatal plates of different thickness were recorded, successively. The plate was fabricated in 3 types, each palatal thickness being 1.0mm, 2.5mm, dentoalveolar portion 2.5mm, other residual portion was 1.0mm, successively. Each type plates named B, C, D-type, in succession. Series of analysis were administered through Computer(16 bit) to analyze the sound distortions. These experiments were analyzed by the LPC (without weighting, pre-weighting, post-weighting) of the consonants, vowels portion, formant frequency of the vowels and word duration of the consonants. The findings led to the following conclusions: 1. There was no correlation of the distortion rate on the 2 informants. 2. Generally, vowels were not affected by the palatal plate thickness in the formant analysis, however, more distortion was detected in the LPC analysis, especially C, D-type plates. 3. Consonants distortion was more evident in the C, D-type plate. 4. The second formant was most disturbed and reduced in the all consonants with insertion of the palatal plate, especially C, D-type plate. 5. Word duration was shortened in the plate inserted(except /ja/, /cha/), especially C, D-type. 6. It was found that dentoalveolar, hard palatal sounds were severely distorted in plate inserted, and they were mainly affected by the dentoalveolar portion thickness. 7. There was correlation between palatal thickness and consonants quality.

  • PDF

C8H10O2H4 섭취량에 의한 조음기관의 변화 요소 분석 연구 (A Study on the Change Parameter Analysis of Articulator by Intake the C8H10O2H4)

  • 김봉현;조동욱
    • 한국통신학회논문지
    • /
    • 제36권1B호
    • /
    • pp.93-100
    • /
    • 2011
  • 현대사회에서 일에 대한 스트레스, 여가 생활, 기호 식품에 대한 매력 등이 증가되면서 커피를 즐겨 마시는 사람들이 늘고 있다. 커피는 품종, 가공방법, 원두의 혼합여부 및 인위적인 향의 첨가여부 등에 따라 다양한 종류로 현대인의 입맛을 사로잡고 있다. 그러나 대부분의 커피에는 $C_8H_{10}O_2N_4$ 함유되어 있고 이로 인해 인체의 여러부위에 영향을 미치고 있다. 이러한 $C_8H_{10}O_2N_4$이 커피의 주성분인 카페인이다. 따라서 본 논문에서는 블랙커피 한 잔에 함유되어 있는 $C_8H_{10}O_2N_4$ 250mg에 대해 섭취량이 증가할 때마다 조음기관에 미치는 영향을 분석하였다. 이를 위해 20대 남성 30명을 대상으로 $C_8H_{10}O_2N_4$ 섭취량을 점차 늘라면서 이에 대해 반응하는 조음기관의 변화를 음성 분석 요소인 Jitter, Formant 및 Spectrum을 적용하여 결과값을 추출하고 이를 분석하는 연구를 수행하였다.

MBE 부호화용 스펙트럼 V-UV 구간 검출에 관한 연구 (On a Detection of V-UV Segments of Speech Spectrum for the MBE Coding)

  • 김을제
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1992년도 학술논문발표회 논문집 제11권 1호
    • /
    • pp.43-48
    • /
    • 1992
  • In the area of speech vocoder systems, the MBE vocoder allows the high quality and low bit rate. In the MBE parameters detection, the dicision methods of V/UV region proposed until now are dependent highly to the other parameters, fundamental frequency and formant information. In this paper, thus, we propose a new V/UV detection method that uses a zero-crossing rate of flatten harmonices spectrum. This method can reduce the influences of the other parameters for the V/UV regions detection.

  • PDF

신경망 기반의 동적 파라미터들을 이용한 음성 경계 추출 (A Voice Boundary Detection Method Using Dynamic Parameters Based On Neural Network)

  • 마창수;김계영;최형일
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2002년도 가을 학술발표논문집 Vol.29 No.2 (2)
    • /
    • pp.616-618
    • /
    • 2002
  • 본 논문에서는 음성인식 성능을 높이기 위한 기본적 단계인 음성과 비음성 부분의 경계를 추출하는 음성 경계 추출 방법을 제안한다. 음성경계 추출을 위한 특징들로는 시간영역 분할 파라미터인 ZCR, MA를 사용하고 주파수 영역 분할 파라미터로 주파수 대역 파워 에너지 (Frequency band power energy), 포만트 계수 (Formant coefficient)를 사용하였고 각 파라미터들을 이용하여 음성 경계를 결정할 때 경험에 의해 임계치를 결정하는 단점을 보안하기 위해서 신경망을 이용한다. 신경망의 가중치와 임계치들은 지도 학습을 통해 최적화 되고, 학습을 통해 구성된 망을 음성과 비음성의 경계치 구분에 사용한다.

  • PDF

Text-Independent Speaker Identification System Based On Vowel And Incremental Learning Neural Networks

  • Heo, Kwang-Seung;Lee, Dong-Wook;Sim, Kwee-Bo
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2003년도 ICCAS
    • /
    • pp.1042-1045
    • /
    • 2003
  • In this paper, we propose the speaker identification system that uses vowel that has speaker's characteristic. System is divided to speech feature extraction part and speaker identification part. Speech feature extraction part extracts speaker's feature. Voiced speech has the characteristic that divides speakers. For vowel extraction, formants are used in voiced speech through frequency analysis. Vowel-a that different formants is extracted in text. Pitch, formant, intensity, log area ratio, LP coefficients, cepstral coefficients are used by method to draw characteristic. The cpestral coefficients that show the best performance in speaker identification among several methods are used. Speaker identification part distinguishes speaker using Neural Network. 12 order cepstral coefficients are used learning input data. Neural Network's structure is MLP and learning algorithm is BP (Backpropagation). Hidden nodes and output nodes are incremented. The nodes in the incremental learning neural network are interconnected via weighted links and each node in a layer is generally connected to each node in the succeeding layer leaving the output node to provide output for the network. Though the vowel extract and incremental learning, the proposed system uses low learning data and reduces learning time and improves identification rate.

  • PDF

통계적 방법과 인지실험을 통한 한국어 초성파열음의 음소단위 분석에 관한 연구 (A Study on the Phoneme Based Analysis of Korean Initial Plosives Using Statistical Method and Perception Tests)

  • 조철우;이우선;이규호;김종안;임광일;이태원
    • 한국음향학회지
    • /
    • 제8권5호
    • /
    • pp.78-85
    • /
    • 1989
  • 본 논문에서는 한국어의 규칙합성에 관한 연구중 파열음의 함성 파라미터를 추정하기 위하여 사용한 통계적 방법과 인지실험에 의한 방법에 관하여 기술하고 있다. 합성기로는 직렬 포만트 합성기를 구성하여 사용하였고 통제적 분석에 사용된 음성시료는 9개의 초성 파열음과 8개의 모음으로 구성되는 72개의 독립 CV형태를 갖는 단음절을 만일 화자를 통하여 수집하였다. 음성의 분석은 시간 및 주파수 평면에서 파라미터의 변화를 중심으로 행하였으며, 인지실험을 통한 파라미터 추정방법을 통하여 독립적으로 포만트 파라미터의 변화에 관하여 조사하였다.

  • PDF

MRI에 의한 모음의 성도 단면적 측정 및 면적 변이에 따른 합성 연구 (Measurement of the vocal tract area of vowels By MRI and their synthesis by area variation)

  • 양병곤
    • 음성과학
    • /
    • 제4권1호
    • /
    • pp.19-34
    • /
    • 1998
  • The author collected and compared midsagittal, coronal, coronal oblique, and transversal images of Korean monophthongs /a, i, e, o, u, i, v/ produced by a healthy male speaker using 1.5 T MR, VISION. Area was measured by computer software after tracing the cross-section at different points along the tract. Results showed that the width of the oral and pharyngeal cavities varied compensatorily from each other on the midsagittal dimension. Formant frequency values estimated from the area functions of the seven vowels showed a strong correlation (r=0.978) with those analyzed from the spoken vowels. Moreover, almost all of 35 students who listened to the synthesized vowels from area data perceived the synthesized vowels as equivalent to the spoken ones. Movement of constriction points of vowel /u/ with wider lip opening sounded /i/ and led to slight changes in vowel quality. Jaw and tongue movement led to major volume variation with an anatomical limitation. Each comer vowel varied systematically from a somewhat constant volume of the average area. Thus, the author proposed that any simulation studies related to vocal tract area variation should reflect its constant volume. The results may be helpful to verify exact measurement of the vocal tract area through vowel synthesis and a simulation study before having any operation of the vocal tract.

  • PDF

후두 전적출술후 MR영상을 이용한 음성재활환자의 발성기전에 관한 연구 (Mechanism of Vowel Phonation in T-E Shunt Patient using MR Imaging after Total Laryngectomy)

  • 박병래
    • 대한방사선기술학회지:방사선기술과학
    • /
    • 제20권1호
    • /
    • pp.21-27
    • /
    • 1997
  • Total laryngectomy has become an usual treatment for any advanced carcinoma of the laynx, but most patients who have undergone total laryngectomy have shown permanant disability in voice production. I compared the first three formant frequencies estimated from MRI to those measured directly from speech data of the T-E patients and the normal. It was to estimate the accuracy of MRI and to compare the vocal tract shape of the normal to T-E patients. The obtained results were as follows : 1. The middle sagittle section of the MRI represents vocal tract well during pnonation. The vocal tract shape of the T-E shunt patients are lack of pharyngeal space and superior space of the glottis. 2. The length of the normal subject's vocal tract is 17 cm. For the T-E shunt patients, the length from lip to shunt opening is 17.5 cm in case 1, and 18.5 cm in case 2. That of the true resonante chamber is 13 cm and 13.5 cm for each case respectively. 3. T-E shunt patients phonated strained voice. The intensity of the higher formant frequency decreased especially in /o/, /u/. 4. The vocal tract is shortened during the phonation by T-E shunt patients. In case of /e/ and /i/, front cavities are constricted while back cavities are shortened. 5. The pseudoglottis of the T-E shunt patients is located at $14{\sim}15\;cm$ below from lips.

  • PDF

비중격 성형술 및 하비잡개 절제술 후 비개존도 측정을 위한 Nasometer와 제1포만트 측정의 유용성 (Significance of Nasometer and First Formant for Nasal Patency After Septoplasty and Turbinoplasty)

  • 진성민;강현국;이경철;박상욱;이성채;이용배
    • 대한후두음성언어의학회지
    • /
    • 제8권2호
    • /
    • pp.161-165
    • /
    • 1997
  • Background : The rhinomanometry and acoustic rhinometry can assess e nasal passage dynamically and statically Recently, analytic methods such as nasometer and sound spectrogram are gaining wide attention to evaluate the nasality objectively. Objectives : firstly to determine if ere was a relationship between the new methods and nasal airway resistance, and secondly to establish if the measurement of nasalance and sound spectrum could be used as an alternative to rhinomanometry and acoustic rhinometry. Materials and Methods : Thirty two patients who underwent either septoplasty and turbinectomy for nasal obstruction were studied. And their ages ranged form 15 to 45 years, with an average of 26.1 years. The rhinomanometry, nasometer, sound spectrogram were performed at preoperative and postoperative 4 weeks day. Results : After operation, subjective symptoms and rhinomanometric results were significantly improved but nasalance and slope of nana, mama and mamma passage had not meningful change. The significnat changes were noted in nasalance and first nasal formant frequency of nasal consonant of velum(angang). Conclusion : Nasometer and sound spectrogram had a limitation for the measure of nasal patency.

  • PDF