Search | Korea Science

Comparison of Speech Rate and Long-Term Average Speech Spectrum between Korean Clear Speech and Conversational Speech

Yoo, Jeeun;Oh, Hongyeop;Jeong, Seungyeop;Jin, In-Ki
- Journal of Audiology & Otology
- /
- v.23 no.4
- /
- pp.187-192
- /
- 2019
Background and Objectives: Clear speech is an effective communication strategy used in difficult listening situations that draws on techniques such as accurate articulation, a slow speech rate, and the inclusion of pauses. Although too slow speech and improperly amplified spectral information can deteriorate overall speech intelligibility, certain amplitude of increments of the mid-frequency bands (1 to 3 dB) and around 50% slower speech rates of clear speech, when compared to those in conversational speech, were reported as factors that can improve speech intelligibility positively. The purpose of this study was to identify whether amplitude increments of mid-frequency areas and slower speech rates were evident in Korean clear speech as they were in English clear speech. Subjects and Methods: To compare the acoustic characteristics of the two methods of speech production, the voices of 60 participants were recorded during conversational speech and then again during clear speech using a standardized sentence material. Results: The speech rate and longterm average speech spectrum (LTASS) were analyzed and compared. Speech rates for clear speech were slower than those for conversational speech. Increased amplitudes in the mid-frequency bands were evident for the LTASS of clear speech. Conclusions:The observed differences in the acoustic characteristics between the two types of speech production suggest that Korean clear speech can be an effective communication strategy to improve speech intelligibility.
https://doi.org/10.7874/jao.2019.00115 인용

Comparison of Speech Rate and Long-Term Average Speech Spectrum between Korean Clear Speech and Conversational Speech

Yoo, Jeeun;Oh, Hongyeop;Jeong, Seungyeop;Jin, In-Ki
- Korean Journal of Audiology
- /
- v.23 no.4
- /
- pp.187-192
- /
- 2019
Background and Objectives: Clear speech is an effective communication strategy used in difficult listening situations that draws on techniques such as accurate articulation, a slow speech rate, and the inclusion of pauses. Although too slow speech and improperly amplified spectral information can deteriorate overall speech intelligibility, certain amplitude of increments of the mid-frequency bands (1 to 3 dB) and around 50% slower speech rates of clear speech, when compared to those in conversational speech, were reported as factors that can improve speech intelligibility positively. The purpose of this study was to identify whether amplitude increments of mid-frequency areas and slower speech rates were evident in Korean clear speech as they were in English clear speech. Subjects and Methods: To compare the acoustic characteristics of the two methods of speech production, the voices of 60 participants were recorded during conversational speech and then again during clear speech using a standardized sentence material. Results: The speech rate and longterm average speech spectrum (LTASS) were analyzed and compared. Speech rates for clear speech were slower than those for conversational speech. Increased amplitudes in the mid-frequency bands were evident for the LTASS of clear speech. Conclusions:The observed differences in the acoustic characteristics between the two types of speech production suggest that Korean clear speech can be an effective communication strategy to improve speech intelligibility.
https://doi.org/10.7874/jao.2019.00115 인용

Intelligibility Improvement Benefit of Clear Speech and Korean Stops

Kang, Kyoung-Ho
- Phonetics and Speech Sciences
- /
- v.2 no.1
- /
- pp.3-11
- /
- 2010
The present study confirmed the intelligibility improvement benefit of clear speech by investigating the intelligibility of Korean stops produced in different speaking styles: conversational, citation-form, and clear speech. This finding supports the Hypo- & Hyper-speech theory that speakers adjust vocal effort to accommodate hearers' speech perception difficulty. A progressive intelligibility improvement was found for the three speaking styles investigated: clear speech was more intelligible than citation-form speech citation-form speech was more intelligible than conversational speech and clear speech was also more intelligible than conversational speech. These findings suggest that the manipulations to elicit three distinct speaking styles in a laboratory setting were successful. Korean lenis stops showed the least intelligibility improvement among the three Korean stop types, and this result suggests that lenis stops should be more resistant to intelligibility enhancement efforts in clear speech than aspirated and fortis stops.
PDF

Performance Enhancement of Speech Intelligibility in Communication System Using Combined Beamforming (directional microphone) and Speech Filtering Method (방향성 마이크로폰과 음성 필터링을 이용한 통신 시스템의 음성 인지도 향상)

Shin, Min-Cheol;Wang, Se-Myung
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2005.05a
- /
- pp.334-337
- /
- 2005
The speech intelligibility is one of the most important factors in communication system. The speech intelligibility is related with speech to noise ratio. To enhance the speech to noise ratio, background noise reduction techniques are being developed. As a part of solution to noise reduction, this paper introduces directional microphone using beamforming method and speech filtering method. The directional microphone narrows the spatial range of processing signal into the direction of the target speech signal. The noise signal located in the same direction with speech still remains in the processing signal. To sort this mixed signal into speech and noise, as a following step, a speech-filtering method is applied to pick up only the speech signal from the processed signal. The speech filtering method is based on the characteristics of speech signal itself. The combined directional microphone and speech filtering method gives enhanced performance to speech intelligibility in communication system.
PDF

An Experimental Phonetic Study of South and North Korean Speech (남북한 음성언어의 실험음성학적 연구)

Lee Hyeon-Bok
- MALSORI
- /
- no.29_30
- /
- pp.61-94
- /
- 1995
The aim of this paper is to describe the linguistic differences of two Koreas with a special emphasis on the phonetic aspect of Seoul and Pyongyang speech. The sources of the North Korean speech material used in the study are 1) the Pyongyang radio and TV broadcasts, 2) interviews of north Korean defectors and 3) speech material of north Korean scholars collected by the writer in london, Warsaw, France and China between 1989 and 1994. The most noticeable phonetic differences of Seoul and Pyongyang speech are abstracted as follows: 1) The vowels /이/, /에/ and /애/ are higher and fronter in Pyongyang speech than in Seoul speech. 2) The vowels /우/ and /으/ of Pyongyang speech are somewhat fronter than the corresponding vowels of Seoul speech. 3) The Pyongyang vowels /으/ and if are produced with rounded lips compared to the corresponding Seoul vowels. 4) The Pyongyang vowel /h) is much lower in tongue position and at the same time somewhat fronter than the corresponding Seoul vowel. 5) The consonants /r ,i ,n / are pronounced in Pyongyang as alveolar affricates or something close to them whereas they are pronounced in Seoul as post-alveolar affricates. 6) Unlike in Seoul speech there is a very strong tendency in Pyongyang speech to reduce aspiration feature in consonant seouences such as /ㅂ+ㅎ/, /ㄷ+ㅎ/, /ㄱ+ㅎ/. 7) /ㄴ/ and /ㄹ/(flap) freely occur word-initially in Pyongyang speech whereas they are restricted in Seoul speech. 8) Unlike in Seoul speech the phonemic contrast of long and short vowels are generally not functional in Pyongyang speech. Thus the vowels pronounced long in Seoul speech are usually pronounced short in Pyongyang speech.
PDF

Designing of Speech DB for Korean Pronunciation Education (한국어 발음 교육을 위한 음성 DB 구축 방안)

Jung Myungsook
- MALSORI
- /
- no.47
- /
- pp.51-72
- /
- 2003
The purpose of this paper is to design Speech Database for Korean pronunciation education. For this purpose, I investigated types of speech errors of Korean-learners, made texts for recording, which involves all types of speech errors, and showed how to gather speech data and how to tag their informations. It's natural that speech data should include Korean-learners' speech and Korean people's speech, because Speech DB that I try to develop is for teaching Korean pronunciation to foreigners. So this DB should have informations about speakers and phonetic informations, which are about phonetic value of segments and intonation of sentences. The intonation of sentence varies with the type of sentence, the structure of prosodic units, the length of a prosodic unit and so on. For this reason, Speech DB must involve tags about these informations.
PDF

Common Speech Database Collection for Telecommunications (통신망환경 한국어 공통음성 DB 구축)

Kim Sanghun;Park Moonwhan;Kim Hyunsuk
- Proceedings of the KSPS conference
- /
- 2003.05a
- /
- pp.23-26
- /
- 2003
This paper presents common speech database collection for telecommunication applications. During 3 year project, we will construct very large scale speech and text databases for speech recognition, speech synthesis, and speaker identification. The common speech database has been considered various communication environments, distribution of speakers' sex, distribution of speakers' age, and distribution of speakers' region. It consists of Korean continuous digit, isolated words, and sentences which reflects Korean phonetic coverage. In addition, it consists of various pronunciation style such as read speech, dialogue speech, and semi-spontaneous speech. Thanks to the common speech databases, the duplicated resources of Korean speech industries are prohibited. It encourages domestic speech industries and activate speech technology domestic market.
PDF

Characteristics of speech intelligibility and speech acceptability connected with mouth opening condition (구강 개방 상태에 따른 말 명료도 및 말 용인도 특성)

Song, Yun-Kyung
- Phonetics and Speech Sciences
- /
- v.3 no.3
- /
- pp.141-148
- /
- 2011
There are many factors that affect speech intelligibility and speech acceptability. Structural anomalies and neuromotor pathologies are known for the reasons of abnormal speech sounds. And there are minor variations related to oral mechanism. Speaking with restricted mouth opening related to therapeutic procedure or habitual speech pattern might affect the quality of speech sounds. So this study compared speech intelligibility and speech acceptability of recorded 24 words in two conditions (restricted mouth opening condition and normal mouth opening condition) by 30 normal hearing adults. The results showed that speech intelligibility and speech acceptability were significantly lower in restricted mouth opening condition. And speech acceptability was significantly lower than speech intelligibility in restricted mouth opening condition. Speech acceptability in restricted mouth opening condition was significantly lower especially in open vowel. These findings indicated that the mouth opening condition could affect vowel shape and could be an adverse effect on speech intelligibility and speech acceptability.
PDF

A Speech Enhancement Algorithm based on Human Psychoacoustic Property (심리음향 특성을 이용한 음성 향상 알고리즘)

Jeon, Yu-Yong;Lee, Sang-Min
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.59 no.6
- /
- pp.1120-1125
- /
- 2010
In the speech system, for example hearing aid as well as speech communication, speech quality is degraded by environmental noise. In this study, to enhance the speech quality which is degraded by environmental speech, we proposed an algorithm to reduce the noise and reinforce the speech. The minima controlled recursive averaging (MCRA) algorithm is used to estimate the noise spectrum and spectral weighting factor is used to reduce the noise. And partial masking effect which is one of the human hearing properties is introduced to reinforce the speech. Then we compared the waveform, spectrogram, Perceptual Evaluation of Speech Quality (PESQ) and segmental Signal to Noise Ratio (segSNR) between original speech, noisy speech, noise reduced speech and enhanced speech by proposed method. As a result, enhanced speech by proposed method is reinforced in high frequency which is degraded by noise, and PESQ, segSNR is enhanced. It means that the speech quality is enhanced.
https://doi.org/10.5370/KIEE.2010.59.6.1120 인용 PDF KSCI

A Comparative Study on Speech Rate Variation between Japanese/Chinese Learners of Korean and Native Korean (학습자의 발화 속도 변이 연구: 일본인과 중국인 한국어 학습자와 한국어 모어 화자 비교)

Kim, Miran;Gang, Hyeon-Ju;Ro, Juhyoun
- Korean Linguistics
- /
- v.63
- /
- pp.103-132
- /
- 2014
This study compares various speech rates of Korean learners with those of native Korean. Speech data were collected from 34 native Koreans and 33 Korean learners (19 Chinese and 14 Japanese). Each participant recorded a 9 syllabled Korean sentence at three different speech rate types. A total of 603 speech samples were analyzed by speech rate types (normal, slow, and fast), native languages (Korean, Chinese, Japanese), and learners' proficiency levels (beginner, intermediate, and advanced). We found that learners' L1 background plays a role in categorizing different speech rates in the L2 (Korean), and also that the leaners' proficiency correlates with the increase of speaking rate regardless of speech rate categories. More importantly, faster speech rate values found in the advanced level of learners do not necessarily match to the native speakers' speech rate categories. This means that learning speech rate categories can be more complex than we think of proficiency or fluency. That is, speech rate categories may not be acquired automatically during the course of second language learning, and implicit or explicit exposures to various rate types are necessary for second language learners to acquire a high level of communicative skills including speech rate variation. This paper discusses several pedagogical implications in terms of teaching pronunciation to second language learners.

Search Result 5,286, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)