Search | Korea Science

The Continuous Speech Recognition with Prosodic Phrase Unit (운율구 단위의 연속음 인식)

강지영;엄기완;김진영;최승호
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.8
- /
- pp.9-16
- /
- 1999
Generally, a speaker structures utterances very clearly by grouping words into phrases. This facilitates the listener's recovery of the meaning of the utterance and the speaker's intention. To this purpose, a speaker uses, among other things, prosodic information such as intonation pause, duration, intensity, etc. The research described here is concerned with the relationship between the strength of prosodic boundaries in spoken utterances as perceived by untrained listeners(Perceptual boundary strength, PBS)-In this paper, the preceptual boundary strength is used as the same meaning of the prosodic boundary strength-and prosodic information. We made a rule determinating the prosodic boundaries and verified the usefulness of the prosodic phrase as a recognition unit. Experiments results showed that the performance of speech recognition(SR) is improved in aspect of recognition rate and time compared with that using sentences as recognition unit. In the future we will suggest the methods that estimate more appropriate boundaries and study more various methods of prosody assisted SR.
PDF

Implementation of Plastic Bottle Classification System for Recycling (분리수거를 위한 페트병 분리시스템의 구현)

Park, Yongha;Park, Jihoon;Chung, Hoyeong;Lee, Joosang;Lee, Jungyeop
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2021.05a
- /
- pp.365-368
- /
- 2021
In this study, a plastic bottle recycling bin system that utilizes an infrared sensor was implemented. The proposed system consists of a recognition unit, a control unit, an alarm unit, and a driving unit. The recognition unit detects the plastic bottle, measures the distance between the plastic bottle and the infrared sensor, extracts the value of the bottle, compares the extracted value with a standard range, and then transmits the control value to the control unit if the extracted value of the bottle is outside the standard range. In this case, the result of the presence or absence of a brand label or bottle cap is transmitted to the controller. The control unit opens the entrance of the recycling bin or alerts the alarm unit according to the result value transmitted from the sensor unit. In order to implement the proposed system, the recognition unit was implemented with an infrared sensor, and the control unit was made with an Arduino IDE controller, based on the C programming language. Additionally, the recognition unit and the control unit are able to communicate using analog signals. The proposed system accurately judges the presence or absence of a brand label and bottle cap of plastic bottles according to a predetermined algorithm. It then blocks the entrance of the recycling bin when a brand label or bottle cap is still attached. As the amount of waste discharged per person is relatively high and the majority of such waste is incinerated rather than recycled, the system proposed in this study is expected to increase the recycling rate of plastic bottles.
PDF

Performance Improvement of Continuous Digits Speech Recognition Using the Transformed Successive State Splitting and Demi-syllable Pair (반음절쌍과 변형된 연쇄 상태 분할을 이용한 연속 숫자 음 인식의 성능 향상)

Seo Eun-Kyoung;Choi Gab-Keun;Kim Soon-Hyob;Lee Soo-Jeong
- Journal of Korea Multimedia Society
- /
- v.9 no.1
- /
- pp.23-32
- /
- 2006
This paper describes the optimization of a language model and an acoustic model to improve speech recognition using Korean unit digits. Since the model is composed of a finite state network (FSN) with a disyllable, recognition errors of the language model were reduced by analyzing the grammatical features of Korean unit digits. Acoustic models utilize a demisyllable pair to decrease recognition errors caused by inaccurate division of a phone or monosyllable due to short pronunciation time and articulation. We have used the K-means clustering algorithm with the transformed successive state splitting in the feature level for the efficient modelling of feature of the recognition unit. As a result of experiments, 10.5% recognition rate is raised in the case of the proposed language model. The demi-syllable fair with an acoustic model increased 12.5% recognition rate and 1.5% recognition rate is improved in transformed successive state splitting.
PDF

A Study on Realization of Continuous Speech Recognition System of Speaker Adaptation (화자적응화 연속음성 인식 시스템의 구현에 관한 연구)

김상범;김수훈;허강인;고시영
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.3
- /
- pp.10-16
- /
- 1999
In this paper, we have studied Continuous Speech Recognition System of Speaker Adaptation using MAPE (Maximum A Posteriori Probability Estimation) which can adapt any small amount of adaptation speech data. Speaker adaptation is performed by the method of MAPB after Concatenation training which is making sentence unit HMM linked by syllable unit HMM and Viterbi segmentation classifies speech data to be adaptation into segmentation of syllable unit data automatically without hand labelling. For car control speech the recognition rates of adaptation of HMM was 77.18% which is approximately 6% improvement over that of unadapted HMM.(in case of O(n)DP)
PDF

A Development of Ultrasonic-wave Remote Control System For Recovering a Submarine Survey Equipment (해저 탐사 및 관측 장비 회수를 위한 초음파 원격제어시스템 개발)

Kim, Young-Jin;Huh, Kyung-Moo;Jeong, Han-Cheol;Woo, Jong-Sik;Cho, Young-June
- Proceedings of the KIEE Conference
- /
- 2004.11c
- /
- pp.117-119
- /
- 2004
In order to successfully exploit underwater resources, the first step would be a marine environmental research and exploration on the seafloor. Traditionally one sets up a long-term underwater experimental unit on the seafloor and retrieves the unit later after a certain period time. Essential to these applications is the reliable teleoperation and telemetering of the unit. This study presents ultrasonic-wave remote control system and an underwater sound recognition algorithm that can identify the sound signal without the influence of disturbances due to underwater environmental changes. The proposed method provides a means suitable for units which require low power dissipation and long-time underwater operation. We demonstrate its ability of securing stability and fast sound recognition through experimental methods.
PDF

A Phonetics Based Design of PLU Sets for Korean Speech Recognition (한국어 음성인식을 위한 음성학 기반의 유사음소단위 집합 설계)

Hong, Hye-Jin;Kim, Sun-Hee;Chung, Min-Hwa
- MALSORI
- /
- no.65
- /
- pp.105-124
- /
- 2008
This paper presents the effects of different phone-like-unit (PLU) sets in order to propose an optimal PLU set for the performance improvement of Korean automatic speech recognition (ASR) systems. The examination of 9 currently used PLU sets indicates that most of them include a selection of allophones without any sufficient phonetic base. In this paper, a total of 34 PLU sets are designed based on Korean phonetic characteristics arid the effects of each PLU set are evaluated through experiments. The results show that the accuracy rate of each phone is influenced by different phonetic constraint(s) which determine(s) the PLU sets, and that an optimal PLU set can be anticipated through the phonetic analysis of the given speech data.
PDF

A Separator system for underwater observing instrument (수중 관측 및 탐사장비 원격분리 시스템의 개발)

Kim, Young-Jin;Jeong, Han-Cheol;Huh, Kyung-Moo;Cho, Young-June
- Proceedings of the KIEE Conference
- /
- 2005.05a
- /
- pp.158-160
- /
- 2005
In order to successfully exploit underwater resources, the first step would be a marine environmental research and exploration on the seafloor. Traditionally one sets up a long-term underwater experimental unit on the seafloor and retrieves the unit later after a certain period time. Essential to these applications is the reliable teleoperation and telemetering of the unit. In our proposed ultrasonic-wave remote control system and an underwater sound recognition algorithm that can identify the sound signal without the influence of disturbances due to underwater environmental changes. The proposed method provides a means suitable for units which require low power dissipation and long-time underwater operation. We demonstrate its ability of securing stability and fast sound recognition through experimental methods.
PDF

Korean LVCSR for Broadcast News Speech

Lee, Gang-Seong
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.2E
- /
- pp.3-8
- /
- 2001
In this paper, we will examine a Korean large vocabulary continuous speech recognition (LVCSR) system for broadcast news speech. The combined vowel and implosive unit is included in a phone set together with other short phone units in order to obtain a longer unit acoustic model. The effect of this unit is compared with conventional phone units. The dictionary units for language processing are automatically extracted from eojeols appearing in transcriptions. Triphone models are used for acoustic modeling and a trigram model is used for language modeling. Among three major speaker groups in news broadcasts-anchors, journalists and people (those other than anchors or journalists, who are being interviewed), the speech of anchors and journalists, which has a lot of noise, was used for testing and recognition.
PDF

Ambiguity Types of the Homonymic & Heterographic Units for Improving Korean Voice Recognition System - a Preliminary Research (한국어 음성인식 시스템 향상을 위한 동음이철 단위의 중의성 유형 분류)

Yoon, Ae-Sun;Kang, Mi-Young
- Speech Sciences
- /
- v.15 no.4
- /
- pp.67-81
- /
- 2008
The accuracy rate of P2G (Phoneme-to-Grapheme) is one of the important factors determining the quality of unlimited voice recognition (VR) systems. Few studies were, however, conducted to reduce ambiguities of a phoneme string which can be segmented into a variety of different linguistic units (i.e. morphemes, words, eo-jeols), thus be transformed into more than one grapheme string. This paper is a preliminary research for building a large knowledge base of those homonymic & heterographic units(HHUs), which will provide unlimited Korean VR systems with more accurate P2G information. This paper analyzes 2 main factors generating HHUs: (1) boundary determination of the prosodic unit; (2) its segmentation into linguistic units. In this paper, linguistic characteristics determining variable boundaries of a prosodic unit are investigated, and the ambiguity types of HHUs are classified in accordance with their morphological and syntactic structures as well as with the phonological rules governing them.
PDF

Signal Value of Partial Song (Composed of 1 Phrase Unit) in Great Tits, Parus major: Evidence from Playback Experiments (박새(Parus major)의 Partial Song(1 phrase)의 신호적 가치)

천세민;박시룡
- The Korean Journal of Zoology
- /
- v.38 no.2
- /
- pp.230-237
- /
- 1995
Playback experiments were excecuted with seven threat Tit males inhabited in Gsngnae Myeon, Darak Ri, Chungbuk province to investigate the signal value of partial song (one unit phrase composed of two notes) as a species recognition releaser. Territorial males responded strongly to their own natural, synthetic and partial songs played in the field. However, thew showed weak or no responses to the playback songs of other species: Coal Tit (Porus ate4 and Yellow-throated Bunting (EmberiEa elegansl.6reat Tits distinguished conspecific partial songs readily from songs of other species. The results demonstrated that one unit phrase which is a basic arrangement of the Great Tit song, containes information on species recognition.
PDF

Search Result 517, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)