Search | Korea Science

Implementation of Quad Variable Rates ADPCM Speech CODEC on C6000 DSP considering the Environmental Noise (배경잡음을 고려한 4배 가변 압축률을 갖는 ADPCM의 C6000 DSP 실시간 구현)

Kim Dae-Sung;Han Kyong-ho
- Proceedings of the KIPE Conference
- /
- 2002.07a
- /
- pp.727-729
- /
- 2002
In this paper, we proposed quad variable rates ADPCM coding method and its implementation on C6000 DSP, which is modified from the standard ADPCM of ITU G.726 for speech quality improvement considering the environmental noise Four coding rates, 16Kbps, 24Kbps, 32Kbps and 40Kbps are used for speech window samples and the rate decision threshold is decided by the environmental noise level. The object of the proposed method is to reduce the coding rate while retaining the speech quality and the speech quality is considerably close to 40Kbps single rate coder with the coding rate close to 16Kbps single rate coder under the environmental noise. The environmental noise level affects the coding rate and the noise level is calculated per every speech window samples. At high noise level, more samples are coded at higher rates to enhance the quality, but at low noise level, only the big speech signals are coded at higher rates and more speech samples are coded at lower coding rates to reduce the coding rates. The influence of the noise on tile speech signal is considerably high for small signals and the small signal has the higher ZCR (zero crossing rate). The method is simulated in PC and to be implemented on C6000 floating point DSP board in real time operations.
PDF

A study on speech analysis of person with presbycusis (노인성 난청인의 음성특성에 관한 연구)

Lee, S.M.;Song, C.G.;Woo, H.C.;Lee, Y.M.;Kim, W.K.
- Proceedings of the KOSOMBE Conference
- /
- v.1997 no.11
- /
- pp.67-70
- /
- 1997
In this paper, we evaluated the character of speech of hearing impaired person (HIP) who acquire his hearing loss after the youth. It is usually observed that severe HIP decreased not only speech perception but also vocalization. so there is a need for sensitive and quantitative measures or the assesment of the speech of the HIP to serve both diagnostic and prognosic purposes, 7 HIP and 12 normal hearing person(NHP) were studied with pure tone test and speaking test using word/sentence table which consists of vowel(a:), mono and two syllables and a sentence. we analyzed formant frequency, pitch, sound intensity, speech duration of HIP and NHP speech. According to the results, in the HIP's speech we find that formant frequency was shifted, first-formant prominence was reduced, the dynamic range of sound intensity was decreased, speech duration was prolonged. In the next, we expect the correlation between hearing and speech character of HIP is cleared through analysis of more acoustic parameters and precise selection of HIP group.
PDF

A STUDY ON SPEECH PROBLEMS IN PATIENTS WITH VELOPHARYNGEAL INCOMPETENCY (연구개(軟口蓋) 인두간(咽頭間) 폐쇄부전(閉鎖不全)(Velopharyngeal Incompetency) 환자(患者)에 있어서 발음(發音) 장애(障碍)에 관한 연구(硏究))

Choi, Jin-Young;Min, Byoung-il
- Maxillofacial Plastic and Reconstructive Surgery
- /
- v.14 no.1_2
- /
- pp.22-39
- /
- 1992
The purpose of this study was to evaluate hypernasality, nasal air emission, glottal stop, articulation disorder in patients with velopharyngeal incompetency(V.P.I.) and to analyze speech improvement after pharyngoplasty. In this study 61 patients with velopharyngeal incompetency were tested, and in patents with pharyngoplasty speech problems before pharyngoplasty were compared with those after pharyngoplasty. The results obtained are as follows : 1. There are few speech problems in pronouncing the vowel sounds. 2. There are many speech problems in pronouncing the pressure sounds and few speech problems in non-pressure sounds. 3. Speech problems in patients with cleft palate are influenced not by anatomical defect but by severity of velopharyngeal incompetence after palatorrhaphy. 4. Operation methods which decrease the velopharygeal incompetence must be considered for reducing the speech problems. 5. Among the 61 cases with V.P.I. 19 cases(31%) showed nasal air emission and 24 cases(39%) showed glottal stop. 6. Pharyngoplasty is of benefit to primary precipitating components such as hypernasality, nasal air emission but of no benefit to secondary compensating component such as glottal stop. 7. There as no significant difference in speech improvement between pre-and post-pharyngoplasty(p<0.05).
PDF

Comparison of Speech Onset Detection Characteristics of Adaptation Algorithms for Cochlear Implant Speech Processor (인공와우 어음처리방식을 위한 적응효과 알고리즘의 음성개시점 검출 특성 비교)

Choi, Sung-Jin;Kim, Jin-Ho;Kim, Kyung-Hwan
- Journal of Biomedical Engineering Research
- /
- v.29 no.1
- /
- pp.25-31
- /
- 2008
It is well known that temporal information, i.e speech onset, about input speech can be represented to the response nerve signal of auditory nerve better depending on the adaptation effect occurred in the auditory nerve synapse. In addition, the performance of a speech processor of cochlear implant can be improved by the adaptation effect. In this paper, we observed the emphasis characteristic of speech onset in the recently proposed adaptation algorithm, analyzed the characteristic of performance change according to the variation of parameters and compared with transient emphasis spectral maxima (TESM) is the previous typical strategy. When observing false peaks which are generated everywhere except speech onset, in the case of the proposed model, the false peak were generated much less than in the case of the TESM and it is more distinguishable under noise.
https://doi.org/10.9718/JBER.2008.29.1.025 인용 PDF KSCI

A Performance of a Remote Speech Input Unit in Speech Recognition System (음성인식 시스템에서의 원격 음성입력기의 성능평가)

Lee, Gwang-seok
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2009.10a
- /
- pp.723-726
- /
- 2009
In this research, We simulated performances of error reduction algorithm for the speech signal based on the microphone array-based beamforming method in speech recognition system and analyzed its performance. Also, we processed speech signal adopted from microphone array and maximum signal to noise ratio for each channel, and then compared them with signal to noise ratio of speech signal. Speech recognition rate is improved from 54.2% to 61.4% in case 1 and is improved from 41.2% to 50.5% in case 2 of the lower signal to noise ratio. Therefore the average reduction rates are showed 15.7% in case 1.
PDF

Affixation effects on word-final coda deletion in spontaneous Seoul Korean speech

Kim, Jungsun
- Phonetics and Speech Sciences
- /
- v.8 no.4
- /
- pp.9-14
- /
- 2016
This study investigated the patterns of coda deletion in spontaneous Seoul Korean speech. More specifically, the current study focused on three factors in promoting coda deletion, namely, word position, consonant type, and morpheme type. The results revealed that, first, coda deletion frequently occurred when affixes were attached to the ends of words, rather than in affixes in word-internal positions or in roots. Second, alveolar consonants [n] and [l] in the coda positions of high-frequency affixes [nɨn] and [lɨl] were most likely to be deleted. Additionally, regarding affix reduction in the word-final position, all subjects seemed to depend on this articulatory strategy to a similar degree. In sum, the current study found that affixes without primary semantic content in spontaneous speech tend to undergo the process of reduction, favoring the occurrence of specific pronunciation variants.
https://doi.org/10.13064/KSSS.2016.8.4.009 인용 PDF KSCI

Chinese Pronunciation Correction System for Korean learners (한국인을 위한 중국어 발음 교정 시스템)

Kim, Hyo-Sook;Kim, Sun-Ju;Kang, Hyo-Won;Kim, Mu-Jung;Ha, Jin-Young
- Proceedings of the KSPS conference
- /
- 2005.04a
- /
- pp.45-48
- /
- 2005
This study is about constructing L2 pronunciation correction system for L1 speakers using speech technology. Chinese pronunciation system consists of initials, finals and tones. Initials/finals are in segmental level and tones are in suprasegmental level. So different method could be used assessing Korean users' Chinese. The recognition rate of initials is 81.9% and that of finals is 68.7% in the standard acoustic model. Differ from native speech recognition, nonnative speech recognition could be promoted by additional modeling using L2 speakers' speech. As a first step for the those task we analysed nonnative speech and then set a strategy for modeling Korean speakers'.
PDF

Building an Exceptional Pronunciation Dictionary For Korean Automatic Pronunciation Generator (한국어 자동 발음열 생성을 위한 예외발음사전 구축)

Kim, Sun-Hee
- Speech Sciences
- /
- v.10 no.4
- /
- pp.167-177
- /
- 2003
This paper presents a method of building an exceptional pronunciation dictionary for Korean automatic pronunciation generator. An automatic pronunciation generator is an essential element of speech recognition system and a TTS (Text-To-Speech) system. It is composed of a part of regular rules and an exceptional pronunciation dictionary. The exceptional pronunciation dictionary is created by extracting the words which have exceptional pronunciations from text corpus based on the characteristics of the words of exceptional pronunciation through phonological research and text analysis. Thus, the method contributes to improve performance of Korean automatic pronunciation generator as well as the performance of speech recognition system and TTS system.
PDF

Treatment of velopharyngeal insufficiency in a patient with a submucous cleft palate using a speech aid: the more treatment options, the better the treatment results

Park, Yun-Ha;Jo, Hyun-Jun;Hong, In-Seok;Leem, Dae-Ho;Baek, Jin-A;Ko, Seung-O
- Maxillofacial Plastic and Reconstructive Surgery
- /
- v.41
- /
- pp.19.1-19.6
- /
- 2019
Background: The submucous cleft palate (SMCP) is a type of cleft palate that may result in velopharyngeal insufficiency (VPI). Palate muscles completely separate oral and nasal cavities by closing off the velopharynx during functional processes such as speech or swallow. Also, hypernasality may arise from anatomical or neurological abnormalities in these functions. Treatments of this issue involve a combination of surgical intervention, speech aid, and speech therapy. This case report demonstrates successfully treated VPI resulted from SMCP without any surgical intervention but solely with speech aid appliance and speech therapy. Case presentation: A 13-year-old female patient with a speech disorder from velopharyngeal insufficiency that was caused by a submucous cleft palate visited to our OMFS clinic. In the intraoral examination, the patient had a short soft palate and bifid uvula. And the muscles in the palate did not contract properly during oral speech. She had no surgical history such as primary palatoplasty or pharyngoplasty except for tonsillectomy. And there were no other medical histories. Objective speech assessment using nasometer was performed. We diagnosed that the patient had a SMCP. The patient has shown a decrease in speech intelligibility, which resulted from hypernasality. We decided to treat the patient with speech aid (palatal lift) along with speech therapy. During the 7-month treatment, hypernasality measured by a nasometer decreased and speech intelligibility became normal. Conclusions: Surgery remains the first treatment option for patients with velopharyngeal insufficiencies from submucous cleft palates. However, there were few reports about objective speech evaluation pre- or post-operation. Moreover, there has been no report of non-surgical treatment in the recent studies. From this perspective, this report of objective improvement of speech intelligibility of VPI patient with SMCP by non-surgical treatment has a significant meaning. Speech aid can be considered as one of treatment options for management of SMCP.
https://doi.org/10.1186/s40902-019-0202-8 인용 PDF

Improvement of Rejection Performance using the Lip Image and the PSO-NCM Optimization in Noisy Environment (잡음 환경 하에서의 입술 정보와 PSO-NCM 최적화를 통한 거절 기능 성능 향상)

Kim, Byoung-Don;Choi, Seung-Ho
- Phonetics and Speech Sciences
- /
- v.3 no.2
- /
- pp.65-70
- /
- 2011
Recently, audio-visual speech recognition (AVSR) has been studied to cope with noise problems in speech recognition. In this paper we propose a novel method of deciding weighting factors for audio-visual information fusion. We adopt the particle swarm optimization (PSO) to weighting factor determination. The AVSR experiments show that PSO-based normalized confidence measures (NCM) improve the rejection performance of mis-recognized words by 33%.
PDF

Search Result 5,325, Processing Time 0.033 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)