• Title/Summary/Keyword: speech factors

Search Result 352, Processing Time 0.025 seconds

Patterns of consonant deletion in the word-internal onset position: Evidence from spontaneous Seoul Korean speech

  • Kim, Jungsun;Yun, Weonhee;Kang, Ducksoo
    • Phonetics and Speech Sciences
    • /
    • v.8 no.1
    • /
    • pp.45-51
    • /
    • 2016
  • This study examined the deletion of onset consonant in the word-internal structure in spontaneous Seoul Korean speech. It used the dataset of speakers in their 20s extracted from the Korean Corpus of Spontaneous Speech (Yun et al., 2015). The proportion of deletion of word-internal onset consonants was analyzed using the linear mixed-effects regression model. The factors that promoted the deletion of onsets were primarily the types of consonants and their phonetic contexts. The results showed that onset deletion was more likely to occur for a lenis velar stop [k] than the other consonants, and in the phonetic contexts, when the preceding vowel was a low central vowel [a]. Moreover, some speakers tended to more frequently delete onset consonants (e.g., [k] and [n]) than other speakers, which reflected individual differences. This study implies that word-internal onsets undergo a process of gradient reduction within individuals' articulatory strategies.

A new acoustical parameter for speech intelligibility with regard to early vertical reflections (초기 수직반사음의 역할을 고려한 새로운 명료도 지표)

  • Park, Jong Young;Han, Myung Ho;Jeong, Dae Up;Oh, Yang Ki
    • KIEAE Journal
    • /
    • v.7 no.3
    • /
    • pp.63-70
    • /
    • 2007
  • It is known that early reflections, their energy and delay times after the arrival of direct sound are important factors for speech intelligibility. In this basis, acoustical parameters like D50 and C80 had been proposed and are widely used for assessing the listening condition of rooms. These parameters are focused on the fraction of the early energy to the total, regardless of the spatial characteristics of the early reflections. This means that all the early reflections, arrived in certain time boundary. from front, behind, down and upside have the same impact on speech intelligibility. From the questionable simplicity, the influence of the direction of early reflections on speech intelligibility is examined in this study. A computer simulation speech intelligibility test, conducted for 22 university students, found that the reflection of vertical direction with method of the Paired comparison also the preference of 0.746 degree was visible an increase.

Prediction of Prosodic Boundaries Using Dependency Relation

  • Kim, Yeon-Jun;Oh, Yung-Hwan
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.4E
    • /
    • pp.26-30
    • /
    • 1999
  • This paper introduces a prosodic phrasing method in Korean to improve the naturalness of speech synthesis, especially in text-to-speech conversion. In prosodic phrasing, it is necessary to understand the structure of a sentence through a language processing procedure, such as part-of-speech (POS) tagging and parsing, since syntactic structure correlates better with the prosodic structure of speech than with other factors. In this paper, the prosodic phrasing procedure is treated from two perspectives: dependency parsing and prosodic phrasing using dependency relations. This is appropriate for Ural-Altaic, since a prosodic boundary in speech usually concurs with a governor of dependency relation. From experimental results, using the proposed method achieved 12% improvement in prosody boundary prediction accuracy with a speech corpus consisting 300 sentences uttered by 3 speakers.

  • PDF

Predicting Variables of Speech Intelligibility in Adults with Hearing Impairment: Focusing on Correct Articulation (청각장애 성인의 말명료도 예측 요인: 조음정확도를 중심으로)

  • Sung, Hee-Jung;Choi, Eun-Ah;Yoon, Mi-Sun
    • MALSORI
    • /
    • no.61
    • /
    • pp.1-14
    • /
    • 2007
  • The purpose of this study was to analyze the relationship between segmental correctness and speech intelligibility in adults with hearing impairment. Segmental correctness was measured by percentage of correct vowels(PCV) and percentage of correct consonants(PCC). The results were shown as follows: First, PCV and PCC could predict speech intelligibility with statistical significance. Second, in consonant classes divided by place and manner of articulation, the PCC of plosives and alveolar sounds were significant predicting variables in each group ($R^{2}=50%;\;59%$). According to this study, the importance of segmental correctness on speech intelligibility of adults with hearing impairment was confirmed. Also correctness of plosive sounds in manner and alveolar sounds in place were significant factors to speech intelligibility.

  • PDF

Creation and Assessment of Korean Speech and Noise DB in Car Environments (자동차 환경에서의 노이즈 DB 및 한국어 음성 DB 구축)

  • Lee Kwang-Hyun;Kim Bong-Wan;Lee Yong-Ju
    • MALSORI
    • /
    • no.48
    • /
    • pp.141-153
    • /
    • 2003
  • Researches into robust recognition in noise environments, especially in car environments, are being carried out actively in speech community. In this paper we will report on three types of corpora that SiTEC (Speech Information TEchnology & industry promotion Center) has created for research into speech recognition in car noise environments. The first is the recordings of 900 Korean native speakers, distributed according to gender, age, and region, who uttered application words in car environments. The second is the collections of mixed noise in 3 car types by model while setting up various noise patterns which can be obtained with the car engine on or off, at different driving speed, and in different road conditions with windows open or closed. The third is the recordings of simulated speech by HATS (Head and Torso Simulator) in car environments with the internal and external noise factors added. These three types of recordings were all made through synchronized 8 channel microphones that are fixed in a car. The creation and applications of these corpora will be reported on in detail.

  • PDF

Speech Perception Ability of Schizophrenics - A Comparative Study with Depressives & Normal Control - (정신분열병환자의 언어지각 능력 - 우울증 환자군, 정상인과의 비교 연구 -)

  • Chung, Young-Cho;Lee, Soon Jeong;Lee, Seung-Hwan
    • Korean Journal of Biological Psychiatry
    • /
    • v.9 no.2
    • /
    • pp.112-119
    • /
    • 2002
  • Object:This study was to investigate the difference of speech perception ability in schizophrenic patients, and depression patients in order to explore trait-dependent speech perception ability of each disorder. Methods:The speech perception ability was assessed with masked speech tracking test(MST) in schizophrenic patients(N=31), depression patients(N=25), and normal controls(N=21). The continuous performance test(CPT) and sentence repetition test(SRT) were also used for assessment of attention and working memory. Results:The schizophrenic patients showed significant impaired MST performance, compared with depressive patients and normal controls. The performances of CPT and SRT were also more impaired in schizophrenic patients. The difference of MST performances between two patient group was cancelled out after consideration of differences in CPT & SRT performances. Conclusions:These results imply that schizophrenic patients have the impaired speech perception ability compared with depressive patients and normal controls. But speech perception ability was significantly influenced with CPT and SRT. For evaluation of pure speech perception ability, the more elaborate controlled study that excluded factors such as attention, working memory and intelligence is needed.

  • PDF

Analysis of Lexical Effect on Spoken Word Recognition Test (낱말 인식 검사에 대한 어휘적 특성의 영향 분석)

  • Yoon, Mi-Sun;Yi, Bong-Won
    • Proceedings of the KSPS conference
    • /
    • 2005.04a
    • /
    • pp.77-80
    • /
    • 2005
  • The aim of this paper was to analyze the lexical effects on spoken word recognition of Korean monosyllabic word. The lexical factors chosen in this paper was frequency, density and lexical familiarity of words. Result of the analysis was as follows; frequency was the significant factor to predict spoken word recognition score of monosyllabic word. The other factors were not significant. This result suggest that word frequency should be considered in speech perception test.

  • PDF

A Study on the Objective Evaluation Model of Telephone Transmission Quality (통화품질 객관평가 모델링에 관한 연구)

  • 조재철;박순영;방만원
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.16 no.6
    • /
    • pp.509-516
    • /
    • 1991
  • In this paper, we propose on objective evaluation model of telephone transmission qulity in order to estimate a satisfaction score regarding speech quality in a relephone network. As the degradantion factors of telephone transmission quality, this model takes into account transmission loss, noise, distortion, talker echo and sidetone. A performance index[PI] is introduced for five psychological factors affecting telephone speech qualty, and a Mean Opinion Score(MOS) is estimated from the sum of all Pis. The simulation results indicate theat the MOS obtained from the objective evaluation model is in good agreement with that of subjective evaluation.

  • PDF

Speech Stimuli on the Diagnostic Evaluation of Speech with Cleft Lip and Palate : Clinical Use and Literature Review (구개열 환자 말 평가 시 검사어에 대한 고찰 : 임상현장의 말 평가 어음자료와 문헌적 고찰을 중심으로)

  • Choi, Seong-Hee;Choi, Jae-Nam;Nam, Do-Hyun;Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.16 no.1
    • /
    • pp.33-48
    • /
    • 2005
  • Differential diagnosis of articulation and resonance problems in the cleft lip and palate speech is required for evaluating various factors contribute to speech problems such as VPI, dental occlusion, palatal fistulae, learning. However, validity of speech stimuli is current issue to evaluate accurately each problem in cleft speech. This study was conducted to investigate speech stimuli using in the clinical setting and review the literatures and articles published 1990 to 2005 for helping develop standardized speech samples. The results were recommendation to evaluate properly velopharyngeal function when conducting a diagnostic evaluation as follows : 1) In identification hypernasality, the speech stimuli should be included low pressure consonants to eliminate effects of nasal emission, compensatory articulation. 2) Speech stimuli should be consist of visual, front sounds to eliminate compensatory articulation and to stimulate easily. 3) Regarding early diagnosis and treatment, speech stimuli need to develop for infants and preschooler. 4) Stimulus length on nasalance scores should be at least 6 syllables. 5) In phonetic context on nasalance scores, /i/ vowel should be take into consideration excluding paragraph. 6) Connected speech stimuli should be developed for evaluating intelligibility and VP function.

  • PDF

Voice Quality Criteria for Heterogenous Network Communication Under Mobile-VoIP Environments

  • Choi, Jae-Hun;Seol, Soon-Uk;Chang, Joon-Hyuk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.3E
    • /
    • pp.99-108
    • /
    • 2009
  • In this paper, we suggest criteria for objective measurement of speech quality in mobile VoIP (Voice over Internet Protocol) services over wireless mobile internet such as mobile WiMAX networks. This is the case that voice communication service is available under other networks. When mobile VoIP service users in the mobile internet network based on packet call up PSTN and mobile network users, but there have not been relevant quality indexes and quality standards for evaluating speech quality of mobile VoIP. In addition, there are many factors influencing on the speech quality in packet network. Especially, if the degraded speech with packet loss transfers to the other network users through the handover, voice communication quality is significantly deteriorated by the transformation of speech codecs. In this paper, we eventually adopt the Gilbert-Elliot channel model to characterize packet network and assess the voice quality through the objective speech quality method of ITU-T P. 862. 1 MOS-LQO for the various call scenario from mobile VoIP service user to PSTN and mobile network users under various packet loss rates in the transmission channel environments. Our simulation results show that transformation of speech codecs results in the degraded speech quality for different transmission channel environments when mobile VoIP service users call up PSTN and mobile network users.