Search | Korea Science

A Study on Voice Color Control Rules for Speech Synthesis System (음성합성시스템을 위한 음색제어규칙 연구)

Kim, Jin-Young;Eom, Ki-Wan
- Speech Sciences
- /
- v.2
- /
- pp.25-44
- /
- 1997
When listening the various speech synthesis systems developed and being used in our country, we find that though the quality of these systems has improved, they lack naturalness. Moreover, since the voice color of these systems are limited to only one recorded speech DB, it is necessary to record another speech DB to create different voice colors. 'Voice Color' is an abstract concept that characterizes voice personality. So speech synthesis systems need a voice color control function to create various voices. The aim of this study is to examine several factors of voice color control rules for the text-to-speech system which makes natural and various voice types for the sounding of synthetic speech. In order to find such rules from natural speech, glottal source parameters and frequency characteristics of the vocal tract for several voice colors have been studied. In this paper voice colors were catalogued as: deep, sonorous, thick, soft, harsh, high tone, shrill, and weak. For the voice source model, the LF-model was used and for the frequency characteristics of vocal tract, the formant frequencies, bandwidths, and amplitudes were used. These acoustic parameters were tested through multiple regression analysis to achieve the general relation between these parameters and voice colors.
PDF

A study of phonological regression in 2-6 years of Korean children (서울-경기 지역 2-6세 아동의 발달기적 음운변동에 관한 연구 - 자음을 중심으로 -)

Kim Young-Tae
- MALSORI
- /
- no.21_24
- /
- pp.3-24
- /
- 1992
This study was designed to investigate the changes of phonological processes in normal Korean children aged from 2- to 6-years. Forty eight children who lived in Seoul or Kyung-Ki do were tested with a picture articulation test and their articulation errors including omissions, additions and substitutions were coded into phonological processes. Those phonological processes were discussed in several ways: syllable structure, place, manner, assimilation, tenseness, and aspiration of sounds. Data were analyzed by two ways: (1) number of subjects who showed each process and (2) percentage of occurrence of each process. Analyses in omission-addition processes demonstrated that postvocalic omission occurred most frequently, followed by velar-, alveolar-, and glottal omission. Analyses in substitution processes showed that fronting (palatal and velar), backing (alveolar), and alveolization occurred most frequently in terms of the place of sounds. In terms of assimilation, alveolar-, stopping, and aspiration assimilation occurred frequently. Analyses by the tenseness and aspiration showed similar occurrences among the 4 processes, with slightly higher occurrences in tensing and aspiration than lanxing and deaspiration. All of the processes decreased by age. The numbers of the processes showed by more than half of the children or exceeded 10％ of occurrence were 20 in 2-years of age, 10 in 3-years of age, 1 in 4-years of age, and none in ages of 5 and 6.
PDF

Phonological Characteristics of Early Vocabulary in Young Children with Cleft Palate (구개열 아동의 초기 어휘에 나타난 음운 특성 연구)

Ha, Seunghee
- Phonetics and Speech Sciences
- /
- v.6 no.2
- /
- pp.65-71
- /
- 2014
The purpose of this study was to investigate whether young children with cleft palate differ from those of noncleft typically developing children in terms of expressive vocabulary size, phonological characteristics and lexical selectivity. A total of 12 children with cleft palate and 12 noncleft children who were matched by age and gender participated in the study. The groups were compared by size of expressive vocabulary reported on Korean version of MacArthur-Bates Communicative Development Inventories and the number of different words, consonant inventory, the percentage of words beginning with obstruents and vowels, nasal, and glottal sounds, and the percentage of words which do not include obstruents in a language sample. Also, correlation analysis were performed to examine the relationship between measures on size of expressive vocabulary and phonological characteristics. The results showed that expressive vocabulary size and consonant inventory for children with cleft palate produced significantly smaller than those for noncleft children. Children with cleft palate produced significantly more words beginning with vowel or which do not include obstruents, and fewer words beginning with obstruents than noncleft children. The two groups showed different results on significant correlations between measures on size of expressive vocabulary and phonological characteristics indicating that children with cleft palate show different lexical selectivity from their noncleft peers. The results suggest that children with cleft palate aged 18-30 months demonstrate a slower rate of lexical and phonological development compared with their noncleft peers and they develop lexical selectivity reflecting cleft palate speech. The results will have a clinical implication on speech-language intervention for young children with cleft palates.
https://doi.org/10.13064/KSSS.2014.6.2.065 인용 PDF KSCI

Quasi-periodic waveform analysis for diplophonia (이중음성에 대한 음성파형분석)

홍기환;김미정;정상술
- Proceedings of the KOR-BRONCHOESO Conference
- /
- 1993.05a
- /
- pp.71-71
- /
- 1993
Diplophonia is produced by the voice of two separate tones and produced through quasi-periodic variations in the vocal cord vibration. Diplophonia is generally regarded as a symptom of laryngeal pathology. The difference in the vibratory frequency between the vocal cords can be seen in a tension imbalance and a difference in the level of the vocal folds under the special condition such as incomplete glottal closure. So authors have experienced 19 cases of patient with diplophonia for the unilateral vocal cord paralysis, intracordal cysts and other mass lesions. And we analysed the diplophonic voice with peak variability and noise level for the quasi-periodic waveforms and spectrograms pre-and postoperatively.
PDF

Arytenoid Adduction as a Surgical Treatment for Hoarseness with Unilateral Vocal Cord Paralysis (편측성대마비환자에 대한 피열연골내전술)

김광문;김영호;홍원표;최홍식
- Proceedings of the KOR-BRONCHOESO Conference
- /
- 1993.05a
- /
- pp.74-74
- /
- 1993
Unilateral vocal cord paralysis is induced by various causes and its effective treatment has been diversely searched out until now. Currently used treatment modalities are intracordal injection of exogenous materials such as Teflon or Silicone, and thyroplasty and so forth. But, with the above mentioned modalities, it has been not satisfactory to obtain a good postoperative results especially in cases when the glottal incompetence is very severe or the level difference between the vocal cords is large. In such cases, vocal cord adduction can be accomplished by anteromedial traction of the muscular process of paralyzed vocal cord via surgical exposure resulting improvement of voice quality. Recently, authors performed arytenoid adduction in 3 cases of unilateral vocal cord paralysis to obtain a better improvement of voice quality, and experienced satisfiable postoperative results.
PDF

Medialization Thyroplasty with Silastic- Decision Making & Practical Points (Silastic을 이용한 내전 갑상성형술-적용 및 술기)

Choi, Hong-Shik
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.18 no.1
- /
- pp.7-10
- /
- 2007
Unilateral vocal fold paralysis resulting in glottal incompetence can cause significant morbidity attributable to impaired speech, swallowing, and ability to protect the airway. The treatment of unilateral vocal cord paralysis has a long history, marked by technical innovations and improvements. These methods typically use endoscopic injection or implants to augment the volume of the affected vocal fold. The first known treatment, reported by Brunnings in 1911, was paraffin injection. The first thyroplasty medializing the paralysed vocal cord was performed by Payr in 1915 ; here, a cartilage door-flap was created from the thyroid ala to obtain better voice quality. In the 1970s, Isshiki systematized and developed the use of the external medialization by Payr. Later he modified his original technique, and achieved safer and better results. Many other methods were introduced for external medialization during the 1980s and 1990s. There has been couple of materials using for medialization laryngoplasty: silicone bloc, cartilage, goretex (polytetrafluoroethylene), titanium, etc. Among them, silicone bloc is the most popularly used material. Type I thyroplasty in combination with arytenoid adduction is a proven technique for medialization of the paralysed vocal fold. In this paper, personal experience for using silicone bloc type I thyroplasty : decision making and practical points, long-term results and complication of the procedure will be discussed.
PDF

Segmentation of the Glottis and Quantitative Measurement of the Vocal Cord Mucosal Morphology in the Laryngoscopic Image (후두 내시경 영상에서의 성문 분할 및 성대 점막 형태의 정량적 평가)

Lee, Seon Min;Oh, Seok;Kim, Young Jae;Woo, Joo Hyun;Kim, Kwang Gi
- Journal of Korea Multimedia Society
- /
- v.25 no.5
- /
- pp.661-669
- /
- 2022
The purpose of this study is to compare and analyze Deep Learning (DL) and Digital Image Processing (DIP) techniques using the results of the glottis segmentation of the two methods followed by the quantification of the asymmetric degree of the vocal cord mucosa. The data consists of 40 normal and abnormal images. The DL model is based on Deeplab V3 architecture, and the Canny edge detector algorithm and morphological operations are used for the DIP technique. According to the segmentation results, the average accuracy of the DL model and the DIP was 97.5% and 94.7% respectively. The quantification results showed high correlation coefficients for both the DL experiment (r=0.8512, p<0.0001) and the DIP experiment (r=0.7784, p<0.0001). In the conclusion, the DL model showed relatively higher segmentation accuracy than the DIP. In this paper, we propose the clinical applicability of this technique applying the segmentation and asymmetric quantification algorithm to the glottal area in the laryngoscopic images.
https://doi.org/10.9717/kmms.2022.25.5.661 인용 PDF KSCI HTML

Characteristics of the General American English exposed in Tourist Business (관광산업 현장에서 표출되는 미국 영어의 특색)

Hong, Kwang-Hee
- Korean Business Review
- /
- v.5
- /
- pp.241-274
- /
- 1992
General American English(=A.E.) has conservative elements as well as progressive elements. A.E. and B.E. are languages which have more similarities than differances. In this paper. I studied the process of English progress before the A.E. had come into being, and the historical background and the cahristics of A.E. coming into being. Considering the differences between A.E. and B.E. from spelling, pronunciation, vocabulary and grammar, I can give the outline as follows. A spelling 1. B.E. : au, ou $${\rightarrow}$$A.E. : a, o 2. B.E. : e $${\rightarrow}$$A.E. : i 3. B.E. : $${\ae}$$ oe $${\rightarrow}$$A.E. : e 4. B.E. : our $${\rightarrow}$$A.E. : or 5. B.E. : re $${\rightarrow}$$A.E. : er B. pronunciation 1. B.E. : [e] $${\rightarrow}$$A.E. : [i], [e], $$[\partial]$$ 2. B.E. : [a] $${\rightarrow}$$A.E. : 3. B.E. : [i(:)] $${\rightarrow}$$A.E. : [ai], $$[\partial]$$, $$[{\varepsilon}]$$ 4. B.E. : $$[{\ae}]$$ $${\rightarrow}$$A.E. : [e], [c] 5. B.E. : [ai] $${\rightarrow}$$A.E. : $$[{\ae}]$$, [e] 6. B.E. : [c] $${\rightarrow}$$A.E. : [e], [a], [o] 7. In case of "Vowel+[t]+Vowel", [t] is pronounced into [d] or [r] 8. In case of "-nt", [t] becomes a mute. 9. [t]+[j, l, m, n, r, u, or, w] $${\rightarrow}$$A.E. : [?] (=glottal stop) 10. B.E. : [w] $${\rightarrow}$$A.E. : [hw] 11. B.E. : [Voiceless consonants], [Voiced consonants] $${\leftarrow}$$A.E. : [Voiced consonants], [Voiceless consonants] C. Vocabulary The historical background and geographical conditions of those days caused lots of new compounds and neologies. D. Grammar Though we use "of" to indicate the possessive case of inanimate object, -s genitive is used in A.E. In the perfect tense, "have" is often omitted and also auxiliary verb "will" is used in any case
PDF

Analysis of Phonatory Aerodynamic & Electroglottography of a Countertenor (Countertenor 1인의 Modal Register와 Falsetto Register에서의 공기역학적 변화 및 전기성문파형의 변화 연구)

Nam, Do-Hyun;Choi, Seong-Hee;Choi, Jae-Nam;Choi, Hong-Shik
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.17 no.1
- /
- pp.43-48
- /
- 2006
Background and Objectives: Countertenors who can produce higher vocal pitch like female classical singer's voice and use both modal and falsetto register. This study was conducted to study phonatory characteristics between modal and falsetto register of the countertenor. Materials and Methods: A male countertenor who had 8 years of experience was examined using a videostroboscopy and his voice was analyzed using aerodynamic measures; fundamental frequency(F0), Mean air flow rate(MFR), intensity(SLP), subglottal air pressure(Psub) with phonatory function analyzer(Nagashima) and acoustic measures; jitter, shimmer, HNR, closed quotient(CQ) using a Electro-glottography(EGG) of Lx. Speech Studio(Laryngoscope, Ltd, UK) and voice range profile of CSL(Kay elemetrics). Results: In the stroboscopy finding, the longitudinal length of vocal folds was increased at the falsetto register and the upper margin of vocal folds vibrated with incomplete closure of true vocal folds. In aerodynamic analysis, intensity was same at the modal and falsetto register. However, MFR, Psub, MPT were higher at the falsetto register. In the electroglottographic analysis, closed quotient(CQ) at the modal register was high and also much higher at the high-pitch falsetto than at the loud falsetto. In the VRP, intensity was similar though F0 was different between modal and falsetto register. Conclusion: It implied that countertenor could produce powerful voice quality by increasing of respiratory pressure and respiratory volume though glottal closure was incomplete. In addition, no change of EGG waveform, similar voice range with alto was observed.
PDF

A New Pitch Detection Method Using The WRLS-VFF-VT Algorithm (WRLS-VFF-VT 알고리듬을 이용한 새로운 피치 검출 방법)

Lee, Kyo-Sik;Park, Kyu-Sik
- The Transactions of the Korea Information Processing Society
- /
- v.5 no.10
- /
- pp.2725-2736
- /
- 1998
In this paper. we present a new pitch determination method for speech analysis. namely VFF(Variable Forgetting Factor) based. by using the WRLS-VFF-VT(Weighted Recursive Least Square-Variable Forgetting Factor-Variable Threshold) algorithm. A proposed method uses VFF to identify the glottal closure points which correspond to the instants of the main excitation pulses for voiced speech. The modified EGG
PDF

Search Result 138, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)