• 제목/요약/키워드: vocal tract

검색결과 173건 처리시간 0.025초

우측 폐상엽 기관지에 발생한 고립성 유두종: 1례 치험 및 문헌 고찰 (Solitary Papilloma of the Right Upper Lobe Bronchus: Report of a Case and Review of the Literature)

  • 배수동
    • Journal of Chest Surgery
    • /
    • 제5권2호
    • /
    • pp.135-140
    • /
    • 1972
  • Papilloma of the upper respiratory tract, particularly larynx and vocal cords are relatively common disease. However, solitary papilloma of the bronchus is extremely rare condition and only a handful cases were reported in the literature. The patient is a 39 year old housewife who has been suffering from productive cough and occasional hemoptysis in the past one year. X-ray of the chest showed complete atelectasis of the right upper lobe. Bronchography revealed a hemispherical protruding mass in the right main bronchus with complete occlusion of the upper lobe bronchus. Bronchoscopy showed a whitish friable mass in the lumen of the right main bronchus biopsy of which was reported as benign papilloma. Right upper lobectomy together with wedge resection of the portion of right main bronchus to include the tumor was done. Cut-edges of the bronchus were stitched together with interrupted fine dacron sutures. During this procedure, right main bronchus was gently clamped with non-crushing Satinsky type clamp. Patient has had uneventful recovery from surgery and was discharged without symptom. Patientis doing well three months following the operation.

  • PDF

A Rare Case of Acute Obstructive Laryngitis in a Cat with Severe Respiratory Distress

  • Hyeona Bae;Dongbin Lee;DoHyeon Yu
    • 한국임상수의학회지
    • /
    • 제40권2호
    • /
    • pp.124-129
    • /
    • 2023
  • A 5-year-old neutered male domestic short-haired cat presented with acute dyspnea characterized by open-mouth breathing and stridor for 2 days. Direct visualization via laryngoscopy revealed diffuse laryngeal swelling and severe thickening of the vocal folds bilaterally; thus, the upper respiratory tract was obstructed owing to severe edema. Neutrophil infiltration was found on fine needle aspiration of the larynx cytology, and no discrete mass with polyp or neoplasia was identified on diagnostic imaging. The cat was diagnosed with acute obstructive laryngitis, and a tracheostomy tube was immediately installed. After 17 days of treatment with steroids, doxycycline and azithromycin, the swollen larynx gradually improved, and there was no recurrence of laryngitis or respiratory obstruction. A feline upper respiratory polymerase chain reaction panel revealed Mycoplasma felis infection; however, it could not be determined whether it was pathogenic or opportunistic. Herein, we report a case of obstructive laryngitis in a cat. When respiratory obstruction due to acute laryngitis is identified, a good prognosis is expected with rapid and appropriate treatment.

음성 비식별화 모델과 방송 음성 변조의 한국어 음성 비식별화 성능 비교 (Comparison of Korean Speech De-identification Performance of Speech De-identification Model and Broadcast Voice Modulation)

  • 김승민;박대얼;최대선
    • 스마트미디어저널
    • /
    • 제12권2호
    • /
    • pp.56-65
    • /
    • 2023
  • 뉴스와 취재 프로그램 같은 방송에서는 제보자의 신원 보호를 위해 음성을 변조한다. 음성 변조 방법으로 피치(pitch)를 조절하는 방법이 가장 많이 사용되는데, 이 방법은 피치를 재조절하는 방식으로 쉽게 원본 음성과 유사하게 음성 복원이 가능하다. 따라서 방송 음성 변조 방법은 화자의 신원 보호를 제대로 해줄 수 없고 보안상 취약하기 때문에 이를 대체하기 위한 새로운 음성 변조 방법이 필요하다. 본 논문에서는 Voice Privacy Challenge에서 비식별화 성능이 검증된 Lightweight 음성 비식별화 모델을 성능 비교 모델로 사용하여 피치 조절을 사용한 방송 음성변조 방법과 음성 비식별화 성능 비교 실험 및 평가를 진행한다. Lightweight 음성 비식별화 모델의 6가지 변조 방법 중 비식별화 성능이 좋은 3가지 변조 방법 McAdams, Resampling, Vocal Tract Length Normalization(VTLN)을 사용하였으며 한국어 음성에 대한 비식별화 성능을 비교하기 위해 휴먼 테스트와 EER(Equal Error Rate) 테스트를 진행하였다. 실험 결과로 휴먼 테스트와 EER 테스트 모두 VTLN 변조 방법이 방송 변조보다 더 높은 비식별화 성능을 보였다. 결과적으로 한국어 음성에 대해 Lightweight 모델의 변조 방법은 충분한 비식별화 성능을 가지고 있으며 보안상 취약한 방송 음성 변조를 대체할 수 있을 것이다.

他話者의 勵起信號를 이용한 抑揚變換 (Intonatin Conversion using the Other Speaker's Excitation Signal)

  • 이기영;최창석;최갑석;이현수
    • 한국음향학회지
    • /
    • 제14권4호
    • /
    • pp.21-28
    • /
    • 1995
  • 본 논문에서는 원음성을 원하는 억양의 음성으로 변환시켜 주기 위한 기초연구로서 타화자의 여기신호를 이용한 억양변환방법을 제안하였다. 이방법에서는 타화자의 여기신호를 억양정보로 이용하였으며, 타화자의 성도스펙트럼과 DTW에 의해 정합되는 원신호의 성도스펙트럼를 추출하여 여기신호의 스펙트럼과 곱한 후 단시간푸리에 역변환해 줌으로써 억양변환된 음성을 합성하였다. 본 방법에 의해 억양변환된 합성음성을 평가하기 이하여 30명의 남성화자가 발성한 한국어 단모음과 문장음성을 대상으로 억양변환실험을 수행한 후 기본주파수의 궤적과 스펙트로그램 및 왜곡측정을 비교하고 MOS테스트를 실시한 결과 제안된 방법에 의해 임의의 음성을 타화자음성의 억양으로 변환시킬 수 있음을 확인하였다.

  • PDF

구개인두성형술 후 음성의 음향학적 변화 (The Acoustic Changes of Voice after Uvulopalatopharyngoplasty)

  • 홍기환;김성완;윤희완;조윤성;문승현;이상헌
    • 음성과학
    • /
    • 제8권2호
    • /
    • pp.23-37
    • /
    • 2001
  • The primary sound produced by the vibration of vocal folds reaches the velopharyngeal isthmus and is directed both nasally and orally. The proportions of the each component is determined by the anatomical and functional status of the soft palate. The oral sounds composed of oral vowels and consonants according to the status of vocal tract, tongue, palate and lips. The nasal sounds composed of nasal consonants and nasal vowels, and further modified according to the status of the nasal airway, so anatomical abnormalities in the nasal cavity will influence nasal sound. The measurement of nasal sounds of speech has relied on the subjective scoring by listeners. The nasal sounds are described with nasality and nasalization. Generally, nasality has been assessed perceptually in the effect of maxillofacial procedures for cleft palate, sleep apnea, snoring and nasal disorders. The nasalization is considered as an acoustic phenomenon. Snoring and sleep apnea is a typical disorders due to abundant velopharynx. The sleep apnea has been known as a cessation of breathing for at least 10 seconds during sleep. Several medical and surgical methods for treating sleep apnea have been attempted. The uvulopalatopharyngoplasty(UPPP) involves removal of 1.0 to 3.0 cm of soft palate tissue with removal of redundant oropharyngeal mucosa and lateral tissue from the anterior and sometimes posterior faucial pillars. This procedure results in a shortened soft palate and a possible risk following this surgery may be velopharyngeal malfunctioning due to the shortened palate. Few researchers have systematically studied the effects of this surgery as it relates to speech production. Some changes in the voice quality such as resonance (nasality), articulation, and phonation have been reported. In view of the conflicting reports discussed, there remains some uncertainty about the speech status in patients following the snoring and sleep apnea surgery. The study was conducted in two phases: 1) acoustic analysis of oral and nasal sounds, and 2) evaluation of nasality.

  • PDF

Emotion Recognition Based on Frequency Analysis of Speech Signal

  • Sim, Kwee-Bo;Park, Chang-Hyun;Lee, Dong-Wook;Joo, Young-Hoon
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제2권2호
    • /
    • pp.122-126
    • /
    • 2002
  • In this study, we find features of 3 emotions (Happiness, Angry, Surprise) as the fundamental research of emotion recognition. Speech signal with emotion has several elements. That is, voice quality, pitch, formant, speech speed, etc. Until now, most researchers have used the change of pitch or Short-time average power envelope or Mel based speech power coefficients. Of course, pitch is very efficient and informative feature. Thus we used it in this study. As pitch is very sensitive to a delicate emotion, it changes easily whenever a man is at different emotional state. Therefore, we can find the pitch is changed steeply or changed with gentle slope or not changed. And, this paper extracts formant features from speech signal with emotion. Each vowels show that each formant has similar position without big difference. Based on this fact, in the pleasure case, we extract features of laughter. And, with that, we separate laughing for easy work. Also, we find those far the angry and surprise.

고음질 합성방식용 V/UV 스펙트럼상의 피치변경법에 관한 연구 (On a Pitch Alteration Technique in the V/UV Spectrum for High Quality Speech Synthesis Technique)

  • 조왕래;배명진;김동성
    • 한국음향학회지
    • /
    • 제15권6호
    • /
    • pp.99-103
    • /
    • 1996
  • 파형부호화법은 파형의 잉여성분 제거과정을 통해 음성파형의 꼴을 단순히 보존하는 부호화법이다. 음성합성분야에서 파형부호화법은 주로 분석에 의한 고음질 합성방식으로 적용되고 있다. 그렇지만 이 부호화법은 분석시에 여기원과 성도여파기 피라미터들로 분류하여 처리하지 않기 때문에 규칙에 의한 합성방식으로는 적용하기가 힘들다. 본 논문에서는 스펙트럼영역에서 유성스펙트럼에 대해서만 스펙트럼축의 변경을 통해 피치를 조절하는 새로운 피치변경법을 제안하였다. 이 방법은 주파수영역의 처리법이며 50%의 피치변경을 수행하여도 스펙트럼 왜곡율이 2.7% 이하로 얻어졌고, 시간영역의 위상특성 보상에 의해 프레임간의 진폭연결이 자연스럽다는 장점을 갖느다.

  • PDF

다중 응답 분류회귀트리를 이용한 음성 개성 변환 (Voice Personality Transformation Using a Multiple Response Classification and Regression Tree)

  • 이기승
    • 한국음향학회지
    • /
    • 제23권3호
    • /
    • pp.253-261
    • /
    • 2004
  • 본 논문에서는 음성 신호가 지니고 있는 화자 의존적 특징 변수를 변환 시키는 음성 개성 변환 기법이 새롭게 제안되었다. 제안된 방법은 성도 전달 함수의 특성을 반영하는 켑스트럼 벡터와 여기 신호의 특성을 반영하는 피치 값을 변환 대상 변수로 삼았으며, 이들에 대한 변환 기법으로 다중 응답 분류 회귀 트리를 사용하였다. 다중 응답 분류 회귀 트리는 기존의 분류 회귀 트리를 다차원 확장시킨 형태로서, 반응값이 벡터 형태로 존재하는 분류 회귀 트리를 의미한다. 본 논문에서는 기존의 코드북 메핑 방법과 비교하여 제안된 기법의 성능을 평가하였으며, 분류 회귀 트리에 입력되는 관찰값을 다양하게 변화시켜 트리의 복잡도와 변환 성능을 정량적으로 분석하였다. 네 명의 화자를 이용한 음성 개성 변환 실험에서, 기존의 코드북 메핑과 비교하여 객관적으로 우수한 성능을 나타내었으며, 청취 테스트에서도 변환음이 목표로 하는 화자의 음성과 유사함을 관찰할 수 있었다.

Efficient Tracking of Speech Formant Using Closed Phase WRLS-VFF-VT Algorithm

  • Lee, Kyo-Sik;Park, Kyu-Sik
    • The Journal of the Acoustical Society of Korea
    • /
    • 제19권2E호
    • /
    • pp.8-13
    • /
    • 2000
  • In this paper, we present an adaptive formant tracking algorithm for speech using closed phase WRLS-VFF-VT method. The pitch synchronous closed phase methods is known to give more accurate estimates of the vocal tract parameters than the pitch asynchronous method. However the use of a pitch-synchronous closed phase analysis method has been limited due to difficulties associated with the task of accurately isolating the closed phase region in successive periods of speech. Therefore we have implemented the pitch synchronous closed phase WRLS-VFF-VT algorithm for speech analysis, especially for formant tracking. The proposed algorithm with the variable threshold(VT) can provide a superior performance in the boundary of phone and voiced/unvoiced sound. The proposed method is experimentally compared with the other method such as two channel CPC method by using synthetic waveform and real speech data. From the experimental results, we found that the block data processing techniques, such as the two-channel CPC, gave reasonable estimates of the formant/antiformant. However, the data windows used by these methods included the effects of the periodic excitation pulses, which affected the accuracy of the estimated formants. On the other hand the proposed WRLS-VFF-VT method, which eliminated the influence of the pulse excitation by using an input estimation as part of the algorithm, gave very accurate formant/bandwidth estimates and good spectral matching.

  • PDF

한국인 영어학습자의 지각 모음공간과 발화 모음공간의 연계 (A Link between Perceived and Produced Vowel Spaces of Korean Learners of English)

  • 양병곤
    • 말소리와 음성과학
    • /
    • 제6권3호
    • /
    • pp.81-89
    • /
    • 2014
  • Korean English learners tend to have difficulty perceiving and producing English vowels. The purpose of this study is to examine a link between perceived and produced vowel spaces of Korean learners of English. Sixteen Korean male and female participants perceived two sets of English synthetic vowels on a computer monitor and rated their naturalness. The same participants produced English vowels in a carrier sentence with high and low pitch variation in a clear speaking mode. The author compared the perceived and produced vowel spaces in terms of the pitch and gender variables. Results showed that the perceived vowel spaces were not significantly different in either variables. Korean learners perceived the vowels similarly. They did not differentiate the tense-lax vowel pairs nor the low vowels. Secondly, the produced vowel spaces of the male and female groups showed a 25% difference which may have come from their physiological differences in the vocal tract length. Thirdly, the comparison of the perceived and produced vowel spaces revealed that although the vowel space patterns of the Korean male and female learners appeared similar, which may lead to a relative link between perception and production, statistical differences existed in some vowels because of the acoustical properties of the synthetic vowels, which may lead to an independent link. The author concluded that any comparison between the perceived and produced vowel space of nonnative speakers should be made cautiously. Further studies would be desirable to examine how Koreans would perceive different sets of synthetic vowels.