• 제목/요약/키워드: Speech Tone

검색결과 200건 처리시간 0.024초

Audiogram in Response to Stimulation Delivered to Fluid Applied to the External Meatus

  • Geal-Dor, Miriam;Chordekar, Shai;Adelman, Cahtia;Kaufmann-Yehezkely, Michal;Sohmer, Haim
    • Journal of Audiology & Otology
    • /
    • 제24권2호
    • /
    • pp.79-84
    • /
    • 2020
  • Background and Objectives: Hearing can be elicited in response to vibratory stimuli delivered to fluid in the external auditory meatus. To obtain a complete audiogram in subjects with normal hearing in response to pure tone vibratory stimuli delivered to fluid applied to the external meatus. Subjects and Methods: Pure tone vibratory stimuli in the audiometric range from 0.25 to 6.0 kHz were delivered to fluid applied to the external meatus of eight participants with normal hearing (15 dB or better) using a rod attached to a standard clinical bone vibrator. The fluid thresholds obtained were compared to the air conduction (AC), bone conduction (BC; mastoid), and soft tissue conduction (STC; neck) thresholds in the same subjects. Results: Fluid stimulation thresholds were obtained at every frequency in each subject. The fluid and STC (neck) audiograms sloped down at higher frequencies, while the AC and BC audiograms were flat. It is likely that the fluid stimulation audiograms did not involve AC mechanisms or even, possibly, osseous BC mechanisms. Conclusions: The thresholds elicited in response to the fluid in the meatus likely reflect a form of STC and may result from excitation of the inner ear by the vibrations induced in the fluid. The sloping fluid audiograms may reflect transmission pathways that are less effective at higher frequencies.

Audiogram in Response to Stimulation Delivered to Fluid Applied to the External Meatus

  • Geal-Dor, Miriam;Chordekar, Shai;Adelman, Cahtia;Kaufmann-Yehezkely, Michal;Sohmer, Haim
    • 대한청각학회지
    • /
    • 제24권2호
    • /
    • pp.79-84
    • /
    • 2020
  • Background and Objectives: Hearing can be elicited in response to vibratory stimuli delivered to fluid in the external auditory meatus. To obtain a complete audiogram in subjects with normal hearing in response to pure tone vibratory stimuli delivered to fluid applied to the external meatus. Subjects and Methods: Pure tone vibratory stimuli in the audiometric range from 0.25 to 6.0 kHz were delivered to fluid applied to the external meatus of eight participants with normal hearing (15 dB or better) using a rod attached to a standard clinical bone vibrator. The fluid thresholds obtained were compared to the air conduction (AC), bone conduction (BC; mastoid), and soft tissue conduction (STC; neck) thresholds in the same subjects. Results: Fluid stimulation thresholds were obtained at every frequency in each subject. The fluid and STC (neck) audiograms sloped down at higher frequencies, while the AC and BC audiograms were flat. It is likely that the fluid stimulation audiograms did not involve AC mechanisms or even, possibly, osseous BC mechanisms. Conclusions: The thresholds elicited in response to the fluid in the meatus likely reflect a form of STC and may result from excitation of the inner ear by the vibrations induced in the fluid. The sloping fluid audiograms may reflect transmission pathways that are less effective at higher frequencies.

한국어 원거리 음성의 운율적 특성 (Prosodic Characteristics of Korean Distant Speech)

  • 김선희;김종진;이숙향
    • 한국음향학회지
    • /
    • 제25권3호
    • /
    • pp.137-143
    • /
    • 2006
  • 본 논문의 목적은 한국어 원거리 음성의 운율적 특성을 규명하는 것으로, 36개의 2음절어를 4명의 화자 (여성 화자 2명, 남성 화자 2명)가 원거리 환경과 일반환경에서 발화한 총 288개의 2음절어를 분석대상으로 하였다. 실험 결과 지속시간과 에너지의 경우는 일반 음성에 비하여 원거리 음성의 첫음절에 대한 둘째음절의 비율이 유의미하게 큰 것으로 나타났다. F0 대역폭의 경우에도 원거리 음성에서의 대역폭이 평이 음성에 비해 큰 값을 보였다. 억양 패턴에 있어서는 원거리 음성의 경우에 둘째음절에 'HL%'의 복합 경계성조가 실현되거나 첫음절에 'L+H' 성조가 실현되기도 하였으며 이 두 가지가 한 단어에 모두 실현되는 경우도 있었다.

성악인과 일반인 발성의 전기성문검사 및 공기역학적 검사에 대한 연구 (Comparative Evaluation of Electroglottography and Aerodynamic Study in Trained Singers and Untrained Controls under Different Two Pitch)

  • 안성윤;김한수;김영호;송기재;최성희;이성은;최홍식
    • 음성과학
    • /
    • 제10권2호
    • /
    • pp.111-128
    • /
    • 2003
  • Aerodynamic study is valuable information about the vocal efficiency in translating airflow to acoustic signal. The purpose of this study was to investigate the differences between trained singers and untrained controls under different two pitch by simultaneous using the airway interruption method and electroglottography (EGG). Under singing a Korean lied 'Gene', 20 (Male 10, Female 10) trained singers were studied on two one-octave different tone. Mean flow rate (MFR) , subglottic pressure (Psub) and intensity were measured with aerodynamic test using the Phonatory function analyzer (Nagashima Ltd. Model PS 77H, Tokyo, Japan). Closed quotients (Qx), jitter and shimmer were also investigated by electroglottography using Lx speech studio (Laryngograph Ltd, London, UK). These data were compared with those of normal controls. MFR and Psub were increased on high pitch tone in all subject groups. Statistically significant increasing of Qx and intensity were observed in male trained singers on high pitch tone (Qx;p = .025, intensity;p < .001). Beacasue of increasing of Qx and intensity, vocal efficiency was also significantly increased in male singers (p < .001). The trained singers' phonation was more efficient than untrained singers. The result means that the trained singers can increase the loudness with little changing of mean flow rate, subglottic pressure but more increasing of glottic closed quotients.

  • PDF

음성분석에 의한 체질진단에 관한 연구 (Pilot Study on the Classification for Sasangin by the Voice Analysis)

  • 이의주;송광빈;최환수;유정희;곽창규;손은혜;고병희
    • 대한한의학회지
    • /
    • 제26권1호
    • /
    • pp.93-102
    • /
    • 2005
  • Objective : This research was conducted to evaluate the method of sasangin classification by voice analysis, The 2 pilot tests were thus designed to solve the following problems: 'What are the conditions at classification for sasangin by the voice analysis?' and 'What are the important variances of /a/ parameter?'. Methods: 122 volunteers Were examined to make a diagnosis of sasangin by QSCC II and they were disease-free and healthy, First, they said /a/ three times for 2 seconds in their usual voice, Second, they said /a/ for 2 seconds by the different ways of high tone, mid tone, and low tone. The sounds were collected by a recording program (cooledit 2000) through a Sony microphone (ecm-26l). We analyzed the voices by maltlab, the simulation tool. Results: There were no differences and were correlations when one said /a/ three times for 2 seconds in the usual voice. There were some things to correlate when one said /a/ three times for 2 seconds by the different ways of high speech, usual speech, and low speech. Others were nothing to correlate. We evaluated the value of sasangin classification method by only /a/ voice analysis. The hit ratio was average $66.3\%\;:\;soyangin\;67.9\%,\;taeumin\;68.0\%,\;soeumin\;63.9\%$. Conclusion: We must set up the conditions to use the method of sasangin classification by voice analysis. The value of sasangin classification method by only fa! voice analysis was a hit ratio of $66.3\%$.

  • PDF

음성합성시스템을 위한 음색제어규칙 연구 (A Study on Voice Color Control Rules for Speech Synthesis System)

  • 김진영;엄기완
    • 음성과학
    • /
    • 제2권
    • /
    • pp.25-44
    • /
    • 1997
  • When listening the various speech synthesis systems developed and being used in our country, we find that though the quality of these systems has improved, they lack naturalness. Moreover, since the voice color of these systems are limited to only one recorded speech DB, it is necessary to record another speech DB to create different voice colors. 'Voice Color' is an abstract concept that characterizes voice personality. So speech synthesis systems need a voice color control function to create various voices. The aim of this study is to examine several factors of voice color control rules for the text-to-speech system which makes natural and various voice types for the sounding of synthetic speech. In order to find such rules from natural speech, glottal source parameters and frequency characteristics of the vocal tract for several voice colors have been studied. In this paper voice colors were catalogued as: deep, sonorous, thick, soft, harsh, high tone, shrill, and weak. For the voice source model, the LF-model was used and for the frequency characteristics of vocal tract, the formant frequencies, bandwidths, and amplitudes were used. These acoustic parameters were tested through multiple regression analysis to achieve the general relation between these parameters and voice colors.

  • PDF

운율경계정보를 이용한 HMM기반 한국어 TTS 자연성 향상 연구 (Improvement of Naturalness for a HMM-based Korean TTS using the prosodic boundary information)

  • 임기정;이정철
    • 한국컴퓨터정보학회논문지
    • /
    • 제17권9호
    • /
    • pp.75-84
    • /
    • 2012
  • HMM 기반 음성합성시스템은 성능향상을 위해 일반적으로 대용량 음성 DB로부터 생성된 문맥의존 tri-phone을 이용한다. 그리고 대용량 DB의 경량화를 위해서 문맥의존정보를 이용하여 결정트리 방식으로 발화특성이 유사한 문맥의존음소들을 군집화한다. 군집화에 사용하는 문맥의존정보는 음소열 뿐만 아니라 운율정보도 포함하는데 이는 합성음의 자연성이 끊어 읽기, 억양패턴, 음의 장단과 같은 운율에 의해 크게 좌우되기 때문이다. 그러나 복잡한 운율정보를 사용할 경우 훈련과정에 포함되지 않은 문맥의존음소는 하나의 대표값으로 평활화되며 이로 인해 합성음의 자연성이 크게 저하된다. 본 논문에서는 합성음의 자연성을 향상시키기 위해 복잡한 운율정보 대신 억양 변화를 상승, 평탄, 하강으로 구분함으로써 운율정보표현을 간소화시킨 운율경계정보를 포함하는 문맥의존정보에 대한 문맥질의, 그리고 해당 질의의 패턴을 정의하는 방법을 제안하였다. 본 논문에서 제안하는 세 가지 운율경계정보를 포함한 문맥의존정보를 이용하여 합성음을 생성하고 MOS평가를 수행한 결과 운율경계정보를 이용한 HMM기반 한국어 TTS 합성음의 자연성이 향상됨을 확인하였다.

음성의 감성요소 추출을 통한 감성 인식 시스템 (The Emotion Recognition System through The Extraction of Emotional Components from Speech)

  • 박창현;심귀보
    • 제어로봇시스템학회논문지
    • /
    • 제10권9호
    • /
    • pp.763-770
    • /
    • 2004
  • The important issue of emotion recognition from speech is a feature extracting and pattern classification. Features should involve essential information for classifying the emotions. Feature selection is needed to decompose the components of speech and analyze the relation between features and emotions. Specially, a pitch of speech components includes much information for emotion. Accordingly, this paper searches the relation of emotion to features such as the sound loudness, pitch, etc. and classifies the emotions by using the statistic of the collecting data. This paper deals with the method of recognizing emotion from the sound. The most important emotional component of sound is a tone. Also, the inference ability of a brain takes part in the emotion recognition. This paper finds empirically the emotional components from the speech and experiment on the emotion recognition. This paper also proposes the recognition method using these emotional components and the transition probability.

한국어 억양구의 경계톤 (The Boundary Tones in Korean Intonational Phrases)

  • 한선희;오미라
    • 음성과학
    • /
    • 제5권2호
    • /
    • pp.109-129
    • /
    • 1999
  • A study of boundary tones, which are realized at the final syllable of an Intonational Phrase, is important in that sentential meaning is often differentiated solely by the use of different boundary tones in Korean. The purposes of this paper are three-fold: Firstly, it aims at finding out the different characteristics of boundary tones between designed corpus and natural speech. Secondly, it is to show that gender and dialectal differences are crucial factors in determining different realizations of boundary tones. Finally, this study is to provide a basis for better speech synthesis and speech recognition through the analysis of the morphemes where boundary tones are realized. This study has shown that nine different kinds of boundary tones are realized based on the contextual, gender and dialectal differences. In addition to the boundary tones suggested in Jun (1993), three more boundary toes are introduced: L-%,H-%,LHLH%.

  • PDF

남성 성악가의 Passaggio시 음성변화연구 (Analysis of Voice Parameters Variation during Passaggio of the Trained Male Singers)

  • 남도현;안철민;최성희;홍진희;이성은;최홍식
    • 음성과학
    • /
    • 제9권4호
    • /
    • pp.15-25
    • /
    • 2002
  • It's not easy to produce very high tones during singing for not only untrained ordinary people but also even trained singers. To get high singing tones from the low tones, some trained singers used to use a distinguished singing technique, Passaggio (vocal register transition). The purpose of this study is to compare several voice parameters variation between when to sing with using the passaggio technique and to sing without using it. We selected 18 male singers (tenor 8, baritone 10), who had more than 7 years of experience and were well trained in passaggio technique. Simultaneous measurements of fundamental frequency (F0), mean flow rate (MFR), intensity (I), and subglottal pressure (Psub) were performed using the phonatory function analyzer (Nagashima). For the tenor, target tones /a/ were presented: 1) easy phonation: $B_{2}$, 2) high tone without passaggio: F$#_{3}$ 3) high tone with passaggio: F$#_{3}$. For the baritone, target tones /a/ were presented: 1) easy phonation: G$#_{3}$, 2) high tone without passaggio: D$#_{3}$, 3) high tone with passaggio: D$#_{3}$. F0 of the target tones between non-passaggio group and passaggio group was almost the same in both tenor and baritone groups. Intensity of the non-passaggio and passaggio vocalization was much louder than that of easy phonation and pasaggio was louder than non-passaggio vocalization (especially statistically significant in baritone singers). MFR of the passaggio vocalization was greater than non-passaggio vocalization in both tenor and baritone group, but statistically significant only in baritone. Psub of the passaggio vocalization was greater than that of the non-passaggio vocalization in both tenor and baritone group, but statistically not significant in tenor.

  • PDF