• Title/Summary/Keyword: 음성요해도

Search Result 108, Processing Time 0.035 seconds

Effect of noise and reverberation on subjective measure of speech transmission performance for elderly person with hearing loss in residential space (주거 공간에서 고령자 청력손실을 고려한 소음 및 잔향에 따른 음성 전송 성능의 주관적 평가)

  • Oh, Yang Ki;Ryu, Jong-Kwan;Song, Han-Sol
    • The Journal of the Acoustical Society of Korea
    • /
    • v.37 no.5
    • /
    • pp.369-377
    • /
    • 2018
  • This study investigated the effect of noise and reverberation on subjective measure of speech transmission performance for elderly person with hearing loss in residential space through listening test. Floor impact, road traffic, airborne, and drainage noise were employed as the residential noise, and several impulse responses were obtained through room acoustical computer simulation for an apartment building. Sound sources for the listening test consisted of residential noises and speech sounds for boh the young (the original sound) and the aged (the sound filtered out by filters with frequency responses of hearing loss of 65 years elderly person). In the listening test, subjects evaluated speech intelligibility and listening difficulty for the presented word ($L_{Aeq}$ 55 dB) at three noise levels ($L_{Aeq}$ 30, 40, 50 dB) and three reverberation times (0.5, 1.0, 1.5 s). Results showed that the residential space with noise level lower than equal to 50 dB ($L_{i,Fmax,AW}$) for jumping noise and 40 dB ($L_{Aeq}$) for road traffic, airborne, and drainage noise had speech intelligibility of 90 % and over and listening difficulty of 30 % and below. Speech intelligibility and listening difficulty for the aged sound source was shown to be 0 % ~ 5 % lower and 2 % ~ 20 % higher than those for the young sound source, respectively.

A Comparative Study of Listener Perception of Durational Change in the Korean Auxiliary Particle '-yo' (보조사 '-요'의 음장 변화에 따른 청자의 지각 차이 비교)

  • Yoon, Eun-Kyung;Kim, Sul-Ki
    • Phonetics and Speech Sciences
    • /
    • v.3 no.4
    • /
    • pp.55-62
    • /
    • 2011
  • This paper investigates whether listeners perceive a different level of politeness when the duration of the Korean sentence-final auxiliary particle '-yo' is varied. A total of 10 Korean sentences were manipulated by lengthening and shortening '-yo' by 10%, 20%, and 30%. The participants included native Korean speakers and Chinese and Japanese learners of Korean (n=10, respectively). They were asked to rate the level of politeness of the stimuli on a 9-point scale. It was found that Korean listeners perceived decreased politeness as the duration of '-yo' was shortened and increased politeness as it was lengthened. However, Chinese and Japanese listeners did not perceive a different level of politeness from the manipulated sentences. This finding suggests that it is important to teach L2 speakers that the duration of the auxiliary particle '-yo' plays a role in Korean listeners' perception of politeness.

  • PDF

환경 변이에 강인한 화자 인식 기술

  • 김유진;정재호
    • Review of KIISC
    • /
    • v.12 no.2
    • /
    • pp.41-49
    • /
    • 2002
  • 음성 인식 기술과 뿌리를 공유하는 화자 인식 기술은 지난 수십 년간의 연구결과로 괄목할 만한 진보가 이루어졌으며 최근에는 일반화될 수 있으리라는 기대를 가지도록 하기에 충분했다. 하지만 이러한 기술이 실제 환경에 적용되었을 때, 발성 환경을 제어할 수 없으며 그 결과 훈련 환경과는 다른 환경에서 발성된 음성을 인식 해야하는 이른바 '불일치 조건(mismatch condition)' 현상이 발생하게된다. 초기에는 이 현상을 극복하기 위해 잡음 자체를 모델링하고 제거함으로써 훈련과 인식 환경의 차이를 일정하게 정규화(normalization)해주는 연구가 진행되었다. 하지만 최근에는 잡음에 의한 왜곡의 모델이 복잡하고 실제 인식 성능에 직접적으로 나타나지 않는 문제점을 추가로 극복하기 위해, 훈련과 인식 환경의 차이를 보상해주는(compensation) 연구가 활발히 진행되고 있다. 본 논문에서는 기본적인 화자인식기술과 함께 성능저하를 일으키는 불일치 요인들 및 그것들을 극복하기 위한 기술들을 소개하고자 한다.

Allophonic Rules and Determining Factors of Allophones in Korean (한국어의 변이음 규칙과 변이음의 결정 요인들)

  • Lee Ho-Young
    • MALSORI
    • /
    • no.21_24
    • /
    • pp.144-175
    • /
    • 1992
  • This paper aims to discuss determining factors of Korean allophones and to formulate and classify Korean allophonic rules systematically. The relationship between allophones and coarticulation, the most. influential factor of allophonic variation, is thoroughly investigated. Other factors -- speech tempo and style, dialect, and social factors such as age, set, class etc. -- are also briefly discussed. Allophonic rules are classified into two groups -- 3) those relevant to coarticulation and 2) those irrelevant to coarticulation. Rules of the first group are further classified into four subgroups according to the directionality of the coarticulation. Each allophonic nile formulation is explained and discussed in detai1. The allophonic rules formulated and classified in this paper are 1) Devoicing of Voiced Consonants, 2) Devoicing of Vowels, 3) Nasal Approach and Lateral Approach, 4) Uvularization, 5) Palatalization, 6) Voicing of Voiceless Lax Consonants, 7) Frication, 8) Labialization, 9) Nasalization, 10) Release Withholding and Release Masking, 11) Glottalization, 12) Flap Rule, 13) Vowel Weakening, and 14) Allophones of /ㅚ, ㅟ, ㅢ/ (which are realized as diphthongs or as monophthongs depending on phonetic contexts).

  • PDF

Predicting Variables of Speech Intelligibility in Adults with Hearing Impairment: Focusing on Correct Articulation (청각장애 성인의 말명료도 예측 요인: 조음정확도를 중심으로)

  • Sung, Hee-Jung;Choi, Eun-Ah;Yoon, Mi-Sun
    • MALSORI
    • /
    • no.61
    • /
    • pp.1-14
    • /
    • 2007
  • The purpose of this study was to analyze the relationship between segmental correctness and speech intelligibility in adults with hearing impairment. Segmental correctness was measured by percentage of correct vowels(PCV) and percentage of correct consonants(PCC). The results were shown as follows: First, PCV and PCC could predict speech intelligibility with statistical significance. Second, in consonant classes divided by place and manner of articulation, the PCC of plosives and alveolar sounds were significant predicting variables in each group ($R^{2}=50%;\;59%$). According to this study, the importance of segmental correctness on speech intelligibility of adults with hearing impairment was confirmed. Also correctness of plosive sounds in manner and alveolar sounds in place were significant factors to speech intelligibility.

  • PDF

Open Network Services-Data Grade and Leased Lines (개방망 서비스의 종류-데이터급망과 전용선망에서의 개방망 서비스)

  • Park, K.H.;Kang, S.J.
    • Electronics and Telecommunications Trends
    • /
    • v.8 no.3
    • /
    • pp.108-126
    • /
    • 1993
  • 개방망은 망 접속을 표준화하여 망을 접근하도록 하는 technical interface의 공개측면과 망이 가지고 있는 망서비스를 공개하여 사용자로 하여금 선택적으로 이용할 수 있게 해주는 망서비스 공개 측면을 모두 고려하여 망구조를 실현해야 한다. 통신망은 망의 서비스 유형 및 일반적인 기능에 따라 음성급 전화망, 데이터망, 전용선망, 이동통신망 및 위성망으로 구분할 수 있으며, 이에 대한 망 접속은 각 망별로 또한 분유될 수 있다. 망서비스는 기술의 발전과 망진화에 따른 기술적인 요인, 고도통신 사업의 다양화에 따른 사업자 요구에 의한 요인, 그리고 시장수요 요인에 의해 계속 발전.진화되어지는 동적인 것이다. 개방망구조는 망서비스와 기술적인 접속을 주요 내용으로 하고 있기 때문에 이것도 역시 계속 진화되는 것으로 해석해야 한다. 본고에서는 개방망의 서비스 측면에서 해당교환 시스팀이나 전송시스팀이 제공가능한 서비스들로서 개방망구조의 서비스메뉴로 표현할 수 있는 것들을 각 망에 대해 자세히 파악하고자 한다. 이번 호에서는 그 두번째 내용으로서 데이터급 망과 전용선망에서 개방망 서비스로서 국내 교환시스팀과 미국의 ONA 일환으로 BOC가 제공 가능한 것들을 소개한다.

The syllable recovery rule-base system for the post-processing of a continuous speech recognition (연속음성인식 후처리를 위한 음절 복원 rule-base시스템)

  • Park, Mi-Seong;Kim, Mi-Jin;Lee, Mun-Hui;Choi, Jae-Hyeok;Lee, Sang-Jo
    • Annual Conference on Human and Language Technology
    • /
    • 1998.10c
    • /
    • pp.379-385
    • /
    • 1998
  • 한국어가 연속적으로 발음될 때 여러 가지 음운 변동현상이 일어난다. 이것은 한국어 연속음성 인식을 어렵게 하는 주요 요인 중의 한가지이다. 본 논문은 음운변동현상이 반영된 음성 인식 문자열을 규칙에 의거하여 text 기반 문자열로 다시 복원시키고 복원 결과 후보들을 형태소 분석하여 유용한 문자열만을 최종 결과로 생성하게 하는 시스템을 구성하였다. 복원은 4가지 rule 즉, 음절 경계 종성 초성 복원 rule, 모음처리 복원 rule, 끝음절 중성 복원 rule, 한 음절처리 rule에 따라 이루어진다. 규칙 적용 과정중에 효과적인 복원을 위해 x-clustering정보를 정의 하여 사용하고, 형태소 분석기에 입력될 복원 후보수를 제한하기 위해 postfix음절 빈도정보를 구하여 사용한다.

  • PDF

Determinants of Safety and Satisfaction with In-Vehicle Voice Interaction : With a Focus of Agent Persona and UX Components (자동차 음성인식 인터랙션의 안전감과 만족도 인식 영향 요인 : 에이전트 퍼소나와 사용자 경험 속성을 중심으로)

  • Kim, Ji-hyun;Lee, Ka-hyun;Choi, Jun-ho
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.8
    • /
    • pp.573-585
    • /
    • 2018
  • Services for navigation and entertainment through AI-based voice user interface devices are becoming popular in the connected car system. Given the classification of VUI agent developers as IT companies and automakers, this study explores attributes of agent persona and user experience that impact the driver's perceived safety and satisfaction. Participants of a car simulator experiment performed entertainment and navigation tasks, and evaluated the perceived safety and satisfaction. Results of regression analysis showed that credibility of the agent developer, warmth and attractiveness of agent persona, and efficiency and care of the UX dimension showed significant impact on the perceived safety. The determinants of perceived satisfaction were unity of auto-agent makers and gender as predisposing factors, distance in the agent persona, and convenience, efficiency, ease of use, and care in the UX dimension. The contributions of this study lie in the discovery of the factors required for developing conversational VUI into the autonomous driving environment.

Performance Assessment of Speech Recogniger using Lombard Speech (롬바드 음성을 이용한 음성인식기의 성능 평가)

  • Jung, Sung-Yun;Chung, Hyun-Yeol;Kim, Kyung-Tae
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.5
    • /
    • pp.59-68
    • /
    • 1994
  • This paper describes the performance assessment test and analysis of test results on a Korean speech recognizer which recognizes Lombard effect received speech in noisy environment, as a basic performance assessment research. In the assessement test, standard speech data were first manipulated close to speech uttered in a noisy environment, and then performance assessment tests were carried out along with the assessment items (the type of noise, SNR) in two ways-one with Lombard effect received speech(LES), the other with not received(NLES). As a result, when 90% of recognition rate is set to be a recognition limit, it was achieved at 10dB SNR point with LES, while at 30dB with NLES. This 20dB of SNR difference indicates Lombard effect should be considered in real world assessment test. The type of noises didn't affect performance of recognizers in out tests. ANOVA analysis, in evaluating several kinds of recognizers, showed every assessment item affecting the recognition performance could be quantified.

  • PDF

Study on Motivation and Satisfaction of Voice Chat Service (음성채팅서비스사용자의이용동기와만족감)

  • Eunji Lee
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.1
    • /
    • pp.205-210
    • /
    • 2024
  • Nowadays, online messengers are the main communication tool of modern people. Currently, not only messengers that communicate based on text and images, but also services that can interact in real time through voice or screen sharing are actively used by the MZs. This study aims to figure out 1) the motivation of users of voice chat services, and 2) to explore the influence of motivation for use on satisfaction that one of the factors that determine the user's experience. As a result, five major motivations for using voice chat service(Relationship formation, Usefulness, Relationship maintenance, communication supplementation, and distance overcoming) were found. Among them 'Usefulness' and 'Relationship maintenance had a positive effect on user satisfaction. This study, highlighted the various needs of users who communicate in a non-face-to-face environments as well as factors to be satisfied for their positive experiences. These results should be actively used in the online communications market.