Search | Korea Science

A Study on Speech Separation in Cochannel using Sinusoidal Model (Sinusoidal Model을 이용한 Cochannel상에서의 음성분리에 관한 연구)

Park, Hyun-Gyu;Shin, Joong-In;Park, Sang-Hee
- Proceedings of the KIEE Conference
- /
- 1997.11a
- /
- pp.597-599
- /
- 1997
Cochannel speaker separation is employed when speech from two talkers has been summed into one signal and it is desirable to recover one or both of the speech signals from the composite signal. Cochannel speech occurs in many common situations such as when two AM signals containing speech are transmitted on the same frequency or when two people are speaking simultaneously (e. g., when talking on the telephone). In this paper, the method that separated the speech in such a situation is proposed. Especially, only the voiced sound of few sound states is separated. And the similarity of the signals by the cross correlation between the signals for exactness of original signal and separated signal is proved.
PDF

On a Study of Measurement Method of Utterance Velocity for the Reduction of Transmission Rate in CELP Vocoder. (LSP 파라미터를 이용한 발성측정법)

장경아;배명진
- Proceedings of the IEEK Conference
- /
- 2000.11d
- /
- pp.199-202
- /
- 2000
Speaking Rate has variety depends on the situation and habit of speakers. It has been many studied about speaking rate In speaker recognition. The study of speaking rate in speech recognition is one of considerable matter when It is recognized the speakers and it is measured by many speech data base and complicate estimation for accuracy. In this paper, conventional vocoder process the speech signal when encoding and transmitting without regard to speaking rate so in order to apply the speaking rate for vocoder It should be considered the simpler algorithm and less computation amount than the conventional method of speaking rate used In speech recognition. We proposed the speaking rate algorithm which is used the simple parameter with Line Spectrum Pair (LSP). The proposed peaking rate method is measured by the information of LSP in speech. We measured the variety rate of phenomenon about utterances which have different velocity, respectively. As a result, It has distinct variation rate of phenomenon between utterances uttered fast and slow and the rate is 42.8% higher in case of uttered fast than in case of uttered slow.
PDF

The Noise Effect on Stuttering and Overall Speech Rate: Multi-talker Babble Noise (다화자잡음이 말더듬의 비율과 말속도에 미치는 영향)

Park, Jin;Chung, In-Kie
- Phonetics and Speech Sciences
- /
- v.4 no.2
- /
- pp.121-126
- /
- 2012
This study deals with how stuttering changes in its frequency in a situation where adult participants who stutter are exposed to one type of background noise, that is, multi-talker babble noise. Eight American English-speaking adults who stutter participated in this study. Each of the subjects read aloud sentences under each of three speaking conditions (i.e., typical solo reading (TSR), typical choral reading (TCR), and multi-talker babble noise reading (BNR)). Speech fluency was computed based on a percentage of syllables stuttered (%SS) and speaking rate was also assessed to examine if there was significant change in rates as a measure of vocal change under each of the speaking conditions. The study found that participants read more fluently both during BNR and during TCR than during TSR. The study also found that participants did not show significant changes in speaking rate across the three speaking conditions. Some discussion was provided in relation to the effect of multi-talker babble noise on the frequency of stuttering and its further speculation.
https://doi.org/10.13064/KSSS.2012.4.2.121 인용 PDF

Emotional Speaker Recognition using Emotional Adaptation (감정 적응을 이용한 감정 화자 인식)

Kim, Weon-Goo
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.66 no.7
- /
- pp.1105-1110
- /
- 2017
Speech with various emotions degrades the performance of the speaker recognition system. In this paper, a speaker recognition method using emotional adaptation has been proposed to improve the performance of speaker recognition system using affective speech. For emotional adaptation, emotional speaker model was generated from speaker model without emotion using a small number of training affective speech and speaker adaptation method. Since it is not easy to obtain a sufficient affective speech for training from a speaker, it is very practical to use a small number of affective speeches in a real situation. The proposed method was evaluated using a Korean database containing four emotions. Experimental results show that the proposed method has better performance than conventional methods in speaker verification and speaker recognition.
https://doi.org/10.5370/KIEE.2017.66.7.1105 인용 PDF KSCI

VOICE CONTROL SYSTEM FOR TELEVISION SET USING MASKING MODEL AS A FRONT-END OF SPEECH RECOGNIZER

Usagawa, Tsuyoshi;Iwata, Makoto;Ebata, Masanao
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06a
- /
- pp.991-996
- /
- 1994
Surrounding noise often affects the performance of speech recognition system when it is used in office or home. Especially situation is more serious when colored and nonstational noise such as an sound from television or other audio equipment is introduced. The authors proposed a voice control system for television set using an adaptive noise canceler, and it works well even is sound of television set has comparable level of speech. In this paper, a new front-end of speech recognition is introduced for the voice control system. This font-end utilizes a simplified masking model to reduce the effect of residual noise. According to experimental results, 90% correct recognition is achieved even if the level of television sound is almost 15dB higher than one of speech.
PDF

An Analysis of Science-gifted Elementary Students' Perception of Speech and the Relationship between Their Voluntary Speech and Scientific Creativity (초등과학영재학생의 발표에 대한 인식 및 발표의 자발성과 과학창의성의 관계 분석)

Kim, Minju;Lim, Chaeseong
- Journal of Korean Elementary Science Education
- /
- v.38 no.3
- /
- pp.331-344
- /
- 2019
This study aims to analyse science-gifted elementary students' perception of speech in general school class, school science class, and science-gifted class and the relationship between their voluntary speech and scientific creativity. For this, 39 fifth-graders in the Science-Gifted Education Center at Seoul Metropolitan Office of Education in Korea were asked about their frequency of voluntary speech on each class situation, the reasons for such behavior, and their general opinions about speech. Also, researchers collected the teachers' observation on students' speech in class. To get the scores for students' scientific creativity, four different subjects of tasks were presented. The students' scientific creativity scores were used for correlation analysis with their frequency of speech. The main findings from this study are as follows: First, science-gifted elementary students tended to be passive in science-gifted class compared to general school and school science class. Second, the main reason for the low frequency of students' speech in school classes is that they do not have many opportunities to make presentations. Third, a survey of students' general thoughts on speech showed that more students wanted to make a speech voluntarily in class than the opposite. Fourth, the four different scientific creativity tasks had little correlation. Fifth, the correlations between the frequency of voluntary speech and the scores of scientific creativity were mostly low, with significant results only for plant task. Sixth, the correlations between the frequency of voluntary speech and the two components that make up scientific creativity, originality and usefulness, were also mostly low, but significant results for both were found in plant task, with originality having a higher correlation than usefulness. Based on this results, this study discussed the meanings and implications of students' voluntary speech on elementary science education and creativity education.
https://doi.org/10.15267/keses.2019.38.3.331 인용 PDF KSCI

The effects of Speech Intervention for Speech Naturalness of North Korean Refugees Using Visual and Auditory Feedback (시.청각적 피드백을 이용한 언어중재가 북한이탈주민의 자연스러운 발화에 미치는 효과)

Kim, Tae-Hui;Kim, Soo-Jin
- Phonetics and Speech Sciences
- /
- v.2 no.4
- /
- pp.213-221
- /
- 2010
The number of North Korean refugees entering South Korea is continuously increasing. North Korean speakers show significant differences in vowel and consonant phonetics, length of vowels, and the rhythm and intonation of sentences. The object of this research was to examine the effectiveness of a speech intervention program for North Korean refugees using visual feedback through acoustical analysis for intonation. The subjects were three adults with no speech disabilities who had been in South Korea for less than five years. They had not received any prior treatment for inflection change. The program was set in a discourse situation and used Praat to evaluate intonation and provide visual feedback as demonstrating proper intonation changes through pitch contour. The results after intervention are as follows. First, intonation was significantly improved according to a 5-point subjective evaluation scale. Second, the pitch contour was similar to the contour of standard South Korean pronunciation. The subjects were very satisfied with this initial treatment and showed a high level of motivation. In subsequent study, the development of intervention and the comparison of interventions will be needed as well.
PDF

The Imitating Ability of Speaking Rates in 4-5 year old Children (학령 전기 아동의 말속도 모방능력에 관한 연구)

Sim, Hyun-Sub;Kim, Soo- Jin;Lee, Hee-Ran;Kim, Jung-Mee
- Speech Sciences
- /
- v.5 no.1
- /
- pp.141-149
- /
- 1999
Parental speaking rates reduction is frequently recommended by speech-language pathologists as a way to facilitate the fluency of preschool children who stutter. However, this clinical notion is in need of empirical support. For this reason, Sim & Zebrowski (1995) examined the ability of young children imitating different speaking rates. However, Sim & Zebrwoski's study was not made in a natural context but in the laboratory, so the findings are limited to apply to the clinical situation. The current study aimed to examine the ability of three different speaking rates(baseline, 10% slower, and 24% slower) in a natural situation both with instruction and without instruction. The results show that (1) all children were able to imitate the stimulus speaking rates adequately, (2) instruction about speaking rates for each child influenced the ability to imitate slower speaking rates. These clinical implications of findings in this study are that 4-5 year-old children are able to imitate different speaking rates with instruction and can be candidates for the parental speaking rates reduction program in the stutter therapy.
PDF

A Situation-Based Dialogue Management with Dialogue Examples (대화 예제를 이용한 상황 기반 대화 관리 시스템)

Lee, Cheong-Jae;Jung, Sang-Keun;Lee, Geun-Bae
- MALSORI
- /
- no.56
- /
- pp.185-194
- /
- 2005
In this paper, we present POSSDM (POSTECH Situation-Based Dialogue Manager) for a spoken dialogue system using a new example and situation-based dialogue management technique for effective generation of appropriate system responses. Spoken dialogue system should generate cooperative responses to smoothly control dialogue flow with the users. We introduce a new dialogue management technique incorporating dialogue examples and situation-based rules for EPG (Electronic Program Guide) domain. For the system response inference, we automatically construct and index a dialogue example database from dialogue corpus, and the best dialogue example is retrieved for a proper system response with the query from a dialogue situation including a current user utterance, dialogue act, and discourse history. When dialogue corpus is not enough to cover the domain, we also apply manually constructed situation-based rules mainly for meta-level dialogue management.
PDF

A Situation-Based Dialogue Management with Dialogue Examples (대화 예제를 이용한 상황 기반 대화 관리 시스템)

Lee, Cheon-Jae;Jung, Sang-Keun;Lee, Geun-Bae
- Proceedings of the KSPS conference
- /
- 2005.11a
- /
- pp.113-115
- /
- 2005
In this paper, we present POSSDM (POSTECH Situation-Based Dialogue Manager) for a spoken dialogue system using a new example and situation-based dialogue management techniques for effective generation of appropriate system responses. Spoken dialogue system should generate cooperative responses to smoothly control dialogue flow with the users. We introduce a new dialogue management technique incorporating dialogue examples and situation-based rules for EPG (Electronic Program Guide) domain. For the system response inference, we automatically construct and index a dialogue example database from dialogue corpus, and the best dialogue example is retrieved for a proper system response with the query from a dialogue situation including a current user utterance, dialogue act, and discourse history. When dialogue corpus is not enough to cover the domain, we also apply manually constructed situation-based rules mainly for meta-level dialogue management.
PDF

Search Result 122, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)