• Title/Summary/Keyword: voice data

Search Result 1,256, Processing Time 0.028 seconds

Comparison of the Surgical Results in Mutational Dysphonia between Unilateral Shortening of Thyroid Cartilage Method and Bilateral Shortening of Thyroid Cartilage Method in Type III Thyroplasty (변성발성장애의 제3형 갑상연골성형술시 갑상연골익의 편측절제술과 양측절제술과의 치료성적 비교)

  • 최홍식;김세헌;김영호;이익호;김광문
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.7 no.1
    • /
    • pp.61-68
    • /
    • 1996
  • Failure to change from the higher pitched voice of preadolescence to the lower pitched voice of adolescence and adulthood is called "mutational dysphonia" The voice is weak, thin, breathy, hoarse, and mono-pitched. If the voice theraphy was failed, surgery to lower vocal pitch which is refered to thyroplasty type III, is indicated. We compared the post-op acoustic parameters with pre-op data in unilateral antero-posterior shortening of the thyroid cartilage method and bilateral antero-posterior shortening of the thyroid cartilage method each other. Bilateral antero-posterior shortening of the thyroid cartilage method shows significant drop of fundamental frequency and speaking fundamental frequency statistically than unilateral shortening method. There was no significant differences in Jitter, Shimmer, SNR, MFR and other psychoacoustic analysiss parameters between two groups. These data shows that unequal tension of the vocal cord in uilateral antero-posterior shortening of the thyroid cartilage method does not control the pitch effectively so bilatreal shortening method in Type III thyroplasty is recommandable procedure in surgery of the mutational dysphonia.

  • PDF

Analysis of traffic control policies in the voice-date integrated cellular CDMA mobile network (음성 및 데이터가 혼합된 CDMA 셀룰러 망에서의 트래픽 제어 분석 방법)

  • Yoon, Bok-Sik;Lee, Nam-Jun;Lee, Dong-Kie;Lie, Chang-Hoon
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.22 no.4
    • /
    • pp.771-788
    • /
    • 1996
  • A CDMA-based cellular mobile telecommunication system has already been developed and is expected to provide more stable mobile communication services for much more users than traditional analog mobile systems. As a natural course of development, the CDMA mobile system is expected to provide ISDN services in the near future. In this paper, we analyze several traffic control policies for the voice-data integrated traffic in the cellular CDMA system. We first select four admission control policies which take differences in traffic and QOS characteristics between voice and data into account, and then develop modelling and analysis techniques, which can be used directly to analyze the chosen control policies. Our approach is based on so-called threshold model. Numerical computation results obtained under the typical traffic situation are also given. Through these computation results we could tentatively conclude that the cutoff priority policy, which can provide the priority for handoff voice cells while effectively utilizing unused channels, seems to be most effective among the four policies.

  • PDF

A Generalized Subspace Approach for Enhancing Speech Corrupted by Colored Noise Using Voice Activity Detector(VAD) (음성활동영역검색을 사용하는 유색잡음에 오염된 음성의 향상을 위한 일반화 부공간 접근)

  • Son, Kyung-Sik;Kim, Hyun-Tae
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.17 no.8
    • /
    • pp.1769-1776
    • /
    • 2013
  • In this paper, we proposed the modified YL(Yi and Loizou) algorithm, using a VAD(voice activity detector) for enhancing speech corrupted by colored noise. The performance of the proposed algorithm has been compared to the YL algorithm and LS(Lee and Son, etc.) algorithm by computer simulation. The colored noises used in the experiment were a car noise and multi-talker babble from the AURORA data base and the used voices from the TIMIT data base. It is confirmed that the proposed algorithm shows better performance from SNR(signal to noise ratio) and SSD(speech spectral distortion) viewpoint over the previous two approach.

A Phonetic Study of 'Sasang Constitution' (음성학적으로 본 사상체질)

  • Moon Seung-Jae;Tak Ji-Hyun;Hwang Hyejeong
    • MALSORI
    • /
    • v.55
    • /
    • pp.1-14
    • /
    • 2005
  • Sasang Constitution, one branch of oriental medicine, claims that people can be classified into four different 'constitutions:' Taeyang, Taeum, Soyang, and Soeum. This study investigates whether the classification of the constitutions could be accurately made solely based on people's voice by analyzing the data from 46 different voices whose constitutions were already determined. Seven source-related parameters and four filter-related parameters were phonetically analyzed and the GMM(Gaussian mixture model) was tried on the data. Both the results from phonetic analyses and GMM showed that all the parameters (except one) failed to distinguish the constitutions of the people successfully. And even the single exception, B2 (the bandwidth of the second formant) did not provide us with sufficient reasons to be the source of distinction. This result seems to suggest one of the two conclusions: either the Sasang Constitutions cannot be substantiated with phonetic characteristics of peoples' voices with reliable accuracy, or we need to find yet some other parameters which haven't been conventionally proposed.

  • PDF

Implementation of speech interface for windows 95 (Windows95 환경에서의 음성 인터페이스 구현)

  • 한영원;배건성
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.34S no.5
    • /
    • pp.86-93
    • /
    • 1997
  • With recent development of speech recognition technology and multimedia computer systems, more potential applications of voice will become a reality. In this paper, we implement speech interface on the windows95 environment for practical use fo multimedia computers with voice. Speech interface is made up of three modules, that is, speech input and detection module, speech recognition module, and application module. The speech input and etection module handles th elow-level audio service of win32 API to input speech data on real time. The recognition module processes the incoming speech data, and then recognizes the spoken command. DTW pattern matching method is used for speech recognition. The application module executes the voice command properly on PC. Each module of the speech interface is designed and examined on windows95 environments. Implemented speech interface and experimental results are explained and discussed.

  • PDF

Differentiated message handling and performance evaluation for the NGN call control services (NGN 서비스의 호 처리 차별화 방안 및 성능분석)

  • 정문조;황찬식
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.40 no.12
    • /
    • pp.145-154
    • /
    • 2003
  • In this paper we propose service schemes for the control message of voice and data connections served by a Softswitch in NGN (next generation networks). After that we propose a method of evaluating the performance of a Softswitch that provides a limited delay to voice connections. Via numerical experiments, we verify the implication of the proposition in the design of a Softswitch, which simultaneously incorporates voice and data services in the NGN framework.

Voice Recognition Softwares: Their implications to second language teaching, learning, and research

  • Park, Chong-won
    • Speech Sciences
    • /
    • v.7 no.3
    • /
    • pp.69-85
    • /
    • 2000
  • Recently, Computer Assisted Language Learning (CALL) received widely held attention from diverse audiences. However, to the author's knowledge, relatively little attention was paid to the educational implications of voice recognition (VR) softwares in language teaching in general, and teaching and learning pronunciation in particular. This study explores, and extends the applicability of VR softwares toward second language research areas addressing how VR softwares might facilitate interview data entering processes. To aid the readers' understanding in this field, the background of classroom interaction research, and the rationale of why interview data, therefore the role of VR softwares, becomes critical in this realm of inquiry will be discussed. VR softwares' development and a brief report on the features of up-to-date VR softwares will be sketched. Finally, suggestions for future studies investigating the impact of VR softwares on second language learning, teaching, and research will be offered.

  • PDF

CSMA/CD-TDM/SD Adaptive Control Scheme in Bus-type Integrated Date/Voice Local Atrea Networks (버스형 데이터/음성 공용 LAN에서의 CSMA/CD-TDM/SD 적응제어방식)

  • 황병문;최흥문
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.12 no.2
    • /
    • pp.148-159
    • /
    • 1987
  • This paper proposes CSMA/CD-TDM/SD(carrier sense multiple access/collision detection-time division multiplexing/silence detection) control scheme in bus type integrated data/voice local area networks. Simulation results show that this control scheme satisfies the lossless real-time constraints of the voice traffic and improves the data throughput-delay characteristics as compared to those of the CSMA/CD/MPD and the CSMA/CD-TDMA.

  • PDF

An Acoustic Analysis of Vowels for Severe-profound Hearing Impaired Children (최고도이상의 청력손실을 가진 아동의 모음음형대 분석)

  • Huh, Myung-Jin
    • Speech Sciences
    • /
    • v.14 no.2
    • /
    • pp.65-71
    • /
    • 2007
  • The severe-profound hearing impaired children have various disorders in everday communication due to the lack of hearing feedback. Especially, their speech produced unstable voice, omission and distortion of articulation, pitch break, cul-de-sac voice, and so on so that they were difficult to accurately deliver an intended message. This study attempts to analyze the acoustic characteristics of 4 vowel sounds produced by 35 severe-profound hearing impaired children using CSL(Computerized Speech Lab, Model 4300b). The formant data were obtained from the spectrogram and analyzed data by 12 formant filter and auto-correlation among the formants. Results showed that the hearing impaired children's formant values came out very high. They produced the vowels at the mode of hypertension with unstable voice. In order to improve their speech, they would need some adequate auditory feedback.

  • PDF

Diagnosing Vocal Disorders using Cobweb Clustering of the Jitter, Shimmer, and Harmonics-to-Noise Ratio

  • Lee, Keonsoo;Moon, Chanki;Nam, Yunyoung
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.11
    • /
    • pp.5541-5554
    • /
    • 2018
  • A voice is one of the most significant non-verbal elements for communication. Disorders in vocal organs, or habitual muscular setting for articulatory cause vocal disorders. Therefore, by analyzing the vocal disorders, it is possible to predicate vocal diseases. In this paper, a method of predicting vocal disorders using the jitter, shimmer, and harmonics-to-noise ratio (HNR) extracted from vocal records is proposed. In order to extract jitter, shimmer, and HNR, one-second's voice signals are recorded in 44.1khz. In an experiment, 151 voice records are collected. The collected data set is clustered using cobweb clustering method. 21 classes with 12 leaves are resulted from the data set. According to the semantics of jitter, shimmer, and HNR, the class whose centroid has lowest jitter and shimmer, and highest HNR becomes the normal vocal group. The risk of vocal disorders can be predicted by measuring the distance and direction between the centroids.