• Title/Summary/Keyword: vocal tract

Search Result 172, Processing Time 0.024 seconds

Hunminjeongeum Phonetics (II): Phonetic and Phoniatric Consideration for Explanation of Designs of Initial and Final Consonant Letters (훈민정음 음성학(II): 초성, 종성(닿소리) 제자해에 대한 음성언어의학적 고찰)

  • Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.33 no.2
    • /
    • pp.83-88
    • /
    • 2022
  • Hunminjeongeum had 17 initial consonant letters. Among them, five consonant letters, those are ㄱ (牙音, molar sound letter), ㄴ (舌音, lingual sound letter), ㅁ(脣音, labial sound letter), ㅅ (齒音, dental sound letter), ㅇ (喉音, guttural sound letter), were served as chief consonants. There was no argument that consonant letters were made by symbolizing the shape of vocal organs during phonation of them. It could be phoniatrically explained that all of five chief consonants were morphologically symbolized from left lateral view of vocal tract during articulation. Although 'ㄱ' was known as molar sound, it was not modeled the shape of molar tooth but modeled the shape of tongue at molar teeth bearing area. The same principle applies to 'ㅅ', and it was represented the shape of upper surface of anterior tongue instead of incisor teeth. 'ㄴ' was a lingual sound and directly shaped the shape of tongue. 'ㄷ' was made by addition of a stroke 'ㅡ' meaning hard palate above 'ㄴ'. 'ㅁ' was represented the shape of lateral view of anterior mouth. 'ㅇ' was looked like shaping left lateral view of laryngopharyngeal space.

Effect of Music Training on Categorical Perception of Speech and Music

  • L., Yashaswini;Maruthy, Sandeep
    • Korean Journal of Audiology
    • /
    • v.24 no.3
    • /
    • pp.140-148
    • /
    • 2020
  • Background and Objectives: The aim of this study is to evaluate the effect of music training on the characteristics of auditory perception of speech and music. The perception of speech and music stimuli was assessed across their respective stimulus continuum and the resultant plots were compared between musicians and non-musicians. Subjects and Methods: Thirty musicians with formal music training and twenty-seven non-musicians participated in the study (age: 20 to 30 years). They were assessed for identification of consonant-vowel syllables (/da/ to /ga/), vowels (/u/ to /a/), vocal music note (/ri/ to /ga/), and instrumental music note (/ri/ to /ga/) across their respective stimulus continuum. The continua contained 15 tokens with equal step size between any adjacent tokens. The resultant identification scores were plotted against each token and were analyzed for presence of categorical boundary. If the categorical boundary was found, the plots were analyzed by six parameters of categorical perception; for the point of 50% crossover, lower edge of categorical boundary, upper edge of categorical boundary, phoneme boundary width, slope, and intercepts. Results: Overall, the results showed that both speech and music are perceived differently in musicians and non-musicians. In musicians, both speech and music are categorically perceived, while in non-musicians, only speech is perceived categorically. Conclusions: The findings of the present study indicate that music is perceived categorically by musicians, even if the stimulus is devoid of vocal tract features. The findings support that the categorical perception is strongly influenced by training and results are discussed in light of notions of motor theory of speech perception.

Tube phonation in water for patients with hyperfunctional voice disorders: The effect of tube diameter and water immersion depth on bubble height and maximum phonation time (과기능적 음성장애 환자의 물저항발성: 튜브 직경과 물 깊이가 물거품 높이 및 최대발성지속시간에 미치는 영향)

  • Min Gyeong Kim;Seong Hee Choi;Jong-In Youn
    • Phonetics and Speech Sciences
    • /
    • v.15 no.2
    • /
    • pp.31-40
    • /
    • 2023
  • Tube phonation in water has been widely used for voice training among semi-occluded vocal tract (SOVT) exercises in which the patient bubbles with phonation keeping the tube submerged in water. This study aims to investigate the effect of tube diameter and water depth on bubble height and maximum phonation time (MPT) for patients with hyperfunctional voice disorders. Seventeen patients with hyperfunctional voice disorders were asked to bubble with sustained /u/ at the different inner diameters of tube (5, 7, and 10 mm), water depth (4, 7, and 10 cm). A water resistance phonation biofeedback system using a water height sensor was used for recording bubble height and MPT. The bubble height was significantly changed by the tube diameter while MPT was significantly changed with the tube diameter and water depth. Although the wider tube presented significantly lower bubble height for a given depth, relatively consistent bubble height was maintained. Depending on the water depth, the bubble height did not significantly differ for a given tube diameter. In addtion, MPT significantly decreased with water depth and a wider tube led significantly shorter MPT. A water level-driven water resistance biofeedback system provided useful information on bubble characteristics and vocal fold vibration depending on tube diameter and water depth. It can be useful to monitor the breath support during water resistance phonation for patients with hyperfunctional voice disorders.

How to Use EVT Figures for Actor Voice Training II (배우 음성 훈련을 위한 EVT 구조연습 활용방안 II)

  • Lee, Young-Su
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.2
    • /
    • pp.647-664
    • /
    • 2022
  • This study explores the possibility that the figure of the Estill Voice Training model, which is based on speech science, can contribute to the expansion of vocal expertise in the acting art where an actor creates a character. The purpose of this study is to examine the usage plan. The training model through the fluidity and structural functionality of the voice production organ is differentiated from the existing voice training that focuses only on the results of sound due to its ambiguous abstraction. Developing the voluntary coordination ability of the occipital region and vocal tract, such as False Vocal Folds, Cricoid Cartilage, Velum, AES, and Anchoring, has scientific efficiency that makes it easier to produce artistic target sounds, and it is a technical skill that can creatively overcome the functional limitations faced by actors. It can be used as a methodology. The Estill model Figure, which is a principle training for harmony and coordination between the elements of voice production, has a practical value that can be used as an alternative training model for the voice education of actors in Korea, where images and abstractions are the mainstream.

Detection of Human Papillomavirus in Laryngeal Squamous Cell Carcinomas (후두편평세포암종에서 인유두종 바이러스의 검출)

  • 김완수;박성용;마현웅;도남용;김용기;이도용;나한조
    • Korean Journal of Bronchoesophagology
    • /
    • v.4 no.2
    • /
    • pp.197-204
    • /
    • 1998
  • Human papillomavirus(HPV) is epitheliotrophic virus invading the anogenital tract and the upper aerodigestive tract HRV produces a diversity of benign and maljgnant tumors. In this study, the author determined the frequency of association of human papillomavirus(HPV) and laryngeal carcinomas and investigated the significance of HRV infection of different subtypes in the tumorigenesis of laryngeal carcinoma. Laryngeal squamous cell cancinomas from 34 patients who did not have preexisting papillomas by clinical history were retrieved from formalin-fixed, paraffin-embedded blocks and analyzed for HPV. Nineteen cases were tumors of the true vocal folds, 11 were supraglottic and 4 were transglottic. HPV detection was dane using polymerase chain reaction amplification with HPV L$_1$consensus primer. HPV type was determined by the same method using HPV-6, 11 and 16,-18 type-specific E6 primers. The results were as follows : 1) HPV DNA was detected in 7 cases among the 34 patients(20.6%). According to the type of HPV DNA HPV-11 was detected in 3 cases, HPV-16 was detected in 2 cases and HPV-6 and HPV-18 were detected in 1 case, respectively. 2) These 7 HPV-positive patients were advanced cancinoma cases. From these results, we concluded that HPV was thought to be the etiological factor of laryngeal squamous cell carcinomas.

  • PDF

Superiorly Based Flap Tracheostomy (Superiorly based flap을 이용한 기관절개술)

  • 정필상;이정구;정필섭;김영훈
    • Korean Journal of Bronchoesophagology
    • /
    • v.1 no.1
    • /
    • pp.129-135
    • /
    • 1995
  • The superiorly based flap tracheostomy(SBFT) has been advocated as an new technique of tracheostomy to manage a wide variety of causes of upper airway obstruction. This technique has particular applicability in patients who require long term tracheostomy such as in bilateral vocal cord paralysis and severe obstructive sleep apnea. SBFT has numerous advantages such as shortening of the gap between the skin and trachea : construction of a self-sustaining tract ; circumferential mucocutaneous junction to reduce infection, granulation tissue, bleeding, and stenosis of the tract : avoidance of the laryngotracheal damage : easy placement of a tracheostomal stent to promote speech, coughing and swallowing. Most of all, this technique can reduces the suprastomal buckling by the support of the superiorly based tracheal flap, and thus prevents the stenosis of suprastomal airway. The disadvantage of SBFT is more time-consuming procedure than the conventional tracheostomy, A retrospective analysis of 8 patients undergoing SBFT between June, 1994 and March, 1995 in Dankook University Hospital was performed to present the surgical technique and com-plication rates. The average duration of follow up was 11 months. The complications were consisted of a wound infection and a sternal granulation. The other complications including wound dehiscence, tracheitis, pneumonia, tracheal granulation, sternal narrowing and subglottic stenosis were not experienced.

  • PDF

Robust Speech Hash Function

  • Chen, Ning;Wan, Wanggen
    • ETRI Journal
    • /
    • v.32 no.2
    • /
    • pp.345-347
    • /
    • 2010
  • In this letter, we present a new speech hash function based on the non-negative matrix factorization (NMF) of linear prediction coefficients (LPCs). First, linear prediction analysis is applied to the speech to obtain its LPCs, which represent the frequency shaping attributes of the vocal tract. Then, the NMF is performed on the LPCs to capture the speech's local feature, which is then used for hash vector generation. Experimental results demonstrate the effectiveness of the proposed hash function in terms of discrimination and robustness against various types of content preserving signal processing manipulations.

Voice Source Estimation Using Robust Sequential SVD (견실 순차 특이치분해를 이용한 음원추정)

  • 홍성훈
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1993.06a
    • /
    • pp.75-79
    • /
    • 1993
  • 본 논문에서는 변화가 심한 음원파형을 추정하는 새로운 순차처리 알고리듬을 제안한다. 먼저, 1) 기존의 순차처리 분석법중 대표적인 분석법인 RLS(recursive least square)의 문제점들을 검토하고, 2) 이를 개선하기 위해서 관측행렬(observation matrix)을 최적차수의 SVD(reduced-rank singular value decomposition)로 재구성하고, 3) 이에 견실개념(robustness concept)을 적용해서 최적의 성도변수(vocal tract parameter)를 찾아내고 역필터를 적용해서 음원(voice source)을 효과적으로 구분해낸다. 본 논문에서 제안된 방법으로 음원을 추정할 경우, 변화가 심한 음원파형을 잘 추정할 수 있으며, 음원의 특성을 구분해낸 성도 파라미터도 효과적으로 추정할 수 있다. 본 연구내용은 음성합성에서 자연성 개선 및 개인성 구현을 위해서 필수적이며, 다양한 형태의 음성을 표현하기 위해 사용되어질 수 있다. 또한, 음성코딩, 화자인식, 음성인식에서도 사용되어질 수 있다.

  • PDF

Korean Broadcast News Transcription Using Morpheme-based Recognition Units

  • Kwon, Oh-Wook;Alex Waibel
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.1E
    • /
    • pp.3-11
    • /
    • 2002
  • Broadcast news transcription is one of the hardest tasks in speech recognition because broadcast speech signals have much variability in speech quality, channel and background conditions. We developed a Korean broadcast news speech recognizer. We used a morpheme-based dictionary and a language model to reduce the out-of·vocabulary (OOV) rate. We concatenated the original morpheme pairs of short length or high frequency in order to reduce insertion and deletion errors due to short morphemes. We used a lexicon with multiple pronunciations to reflect inter-morpheme pronunciation variations without severe modification of the search tree. By using the merged morpheme as recognition units, we achieved the OOV rate of 1.7% comparable to European languages with 64k vocabulary. We implemented a hidden Markov model-based recognizer with vocal tract length normalization and online speaker adaptation by maximum likelihood linear regression. Experimental results showed that the recognizer yielded 21.8% morpheme error rate for anchor speech and 31.6% for mostly noisy reporter speech.

On a Performance Evaluation of the Pitch Alteration Techniques of speech waveform coding (피치 변경법의 성능평가)

  • Kim, Hong;Bae, Seong-Gyun;Jo, Wang-Rae;Bae, Myung-Jin
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06c
    • /
    • pp.103-106
    • /
    • 1994
  • Generally we are used to apply waveform coding method obtaining the high quality synthesized speech. But we have to solve the problems, memory capacity and pitch alteration, for applying the waveform coding method to speech synthesis by rule. The former problem is conquered by improving the integrated semiconductor technology, but the latter problem remains. In this paper, we compare the methods that have proposed for pitch alteration in our laboratory until now. These methods are not change properties of vocal tract formants and only altered the pitch halving method, 1.14% for cepstrum analysis method, and 2.36% for hamonics compensated with the phase method.

  • PDF