Search | Korea Science

A Study on Speech Recognition in a Running Automobile (주행중인 자동차 환경에서의 음성인식 연구)

양진우;김순협
- The Journal of the Acoustical Society of Korea
- /
- v.19 no.5
- /
- pp.3-8
- /
- 2000
In this paper, we studied design and implementation of a robust speech recognition system in noisy car environment. The reference pattern used in the system is DMS(Dynamic Multi-Section). Two separate acoustic models, which are selected automatically depending on the noisy car environment for the speech in a car moving at below 80km/h and over 80km/h are proposed. PLP(Perceptual Linear Predictive) of order 13 is used for the feature vector and OSDP (One-Stage Dynamic Programming) is used for decoding. The system also has the function of editing the phone-book for voice dialing. The system yields a recognition rate of 89.75% for male speakers in SI (speaker independent) mode in a car running on a cemented express way at over 80km/h with a vocabulary of 33 words. The system also yields a recognition rate of 92.29% for male speakers in SI mode in a car running on a paved express way at over 80km/h.
PDF

A Study On The ASP Module Using VoiceMXL in Automatic Speech Recognition System (VoiceXML을 이용한 음성 인식시스템에서의 ASP 모듈 연구)

장준식;김민석;윤재석
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2001.10a
- /
- pp.609-612
- /
- 2001
In this research, it has been shown that how the computer can recognize and understand spoken natural language and its symbolization using VoiceXML and Grammar Specific Language. In order for user to hear correct information, ASP Module has been revised and its effectivities has been experimented on the Voice portal airplane information system platform.
PDF

A Study on Precise Control of Autonomous Travelling Robot Based on RVR (RVR에 의한 자율주행로봇의 정밀제어에 관한연구)

Shim, Byoung-Kyun;Cong, Nguyen Huu;Kim, Jong-Soo;Ha, Eun-Tae
- Journal of the Korean Society of Industry Convergence
- /
- v.17 no.2
- /
- pp.42-53
- /
- 2014
Robust voice recognition (RVR) is essential for a robot to communicate with people. One of the main problems with RVR for robots is that robots inevitably real environment noises. The noise is captured with strong power by the microphones, because the noise sources are closed to the microphones. The signal-to-noise ratio of input voice becomes quite low. However, it is possible to estimate the noise by using information on the robot's own motions and postures, because a type of motion/gesture produces almost the same pattern of noise every time it is performed. In this paper, we propose an RVR system which can robustly recognize voice by adults and children in noisy environments. We evaluate the RVR system in a communication robot placed in a real noisy environment. Voice is captured using a wireless microphone. Navigation Strategy is shown Obstacle detection and local map, Design of Goal-seeking Behavior and Avoidance Behavior, Fuzzy Decision Maker and Lower level controller. The final hypothesis is selected based on posterior probability. We then select the task in the motion task library. In the motion control, we also integrate the obstacle avoidance control using ultrasonic sensors. Those are powerful for detecting obstacle with simple algorithm.
https://doi.org/10.21289/KSIC.2014.17.2.042 인용 PDF

An Implementation of Lip Print Recognition system using VHDL (VHDL을 이용한 구순문 인식 시스템의 구현 연구)

Choi, Woo-Jin;Chung, Chin-Hyun
- Proceedings of the KIEE Conference
- /
- 1999.07g
- /
- pp.2935-2937
- /
- 1999
The human has recognizable part of body such as a fingerprint, a crimson, a blood vessel. This part has been investigated constantly, its confidence for personal recognition is high. In spite of specialized part of human body, a lip print recognition is developed less than the other physical attribute that is a fingerprint. a voice pattern, a retinal blood-vessel pattern, or a facial recognition. This paper is to implement hardware for lip print recognition system using VHDL.
PDF

Snake Robot Motion Scheme Using Image and Voice (감각 정보를 이용한 뱀 로봇의 행동구현)

강준영;김성주;조현찬;전홍태
- Proceedings of the IEEK Conference
- /
- 2002.06c
- /
- pp.127-130
- /
- 2002
Human's brain action can divide by recognition and intelligence. recognition is sensing voice, image and smell and Intelligence is logical judgment, inference, decision. To this concept, Define function of cerebral cortex, and apply the result. Current expert system is lack, that reasoning by cerebral cortex and thalamus, hoppocampal and so on. In this paper, With human's brain action, wish to embody human's action artificially Embody brain mechanism using Modular Neural Network, Applied this result to snake robot.
PDF

Wearable Computing System for the bland persons (시각 장애우를 위한 Wearable Computing System)

Kim, Hyung-Ho;Choi, Sun-Hee;Jo, Tea-Jong;Kim, Soon-Ju;Jang, Jea-In
- Proceedings of the KIEE Conference
- /
- 2006.04a
- /
- pp.261-263
- /
- 2006
Nowadays, technologies such as RFID, sensor network makes our life comfortable more and more. In this paper we propose a wearable computing system for blind and deaf person who can be easily out of sight from our technology. We are making a wearable computing system that is consisted of embedded board to processing data, ultrasonic sensors to get distance data and motors that make vibration as a signal to see the screen for a deaf person. This system offers environmental informations by text and voice. For example, distance data from a obstacle to a person are calculated by data compounding module using sensed ultrasonic reflection time. This data is converted to text or voice by main processing module, and are serviced to a handicapped person. Furthermore we will extend this system using a voice recognition module and text to voice convertor module to help communication among the blind and deaf persons.
PDF

An analysis of a statistical difference of acoustic Parameters' distribution between normal voice and pathological voice (병적 음성과 정상 음성의 음향학적 파라미터 분포에 대한 통계적 분석)

김용주;권순복;김기련;신민철;조철우;왕수건
- Proceedings of the IEEK Conference
- /
- 2001.06d
- /
- pp.249-252
- /
- 2001
The most basic means of communication among humans is a voice. Without speaking of voice technologies, we found it is important and convenient to use a voice in everyday life. But. in consideration to speech recognition systems, we can't always desire a normal voice input as input signal to the system. Generally speaking. a pathological voice as against a normal which is a voice with a problem in the larynx. could be also special case of input voice. Of course, but the distortion of a speech signal by environmental effects i.e., noise or transmission channel was a raised problem. we will take up a pathological voices with laryngeal disease which is essential distortion factor in voice. Also, we are to find out the difference of acoustic parameters distribution between normal and pathological voice by a statistical method in our research.
PDF

An Ultrasonic Wave Encoder and Decoder for Indoor Positioning of Mobile Marketing System

Kim, Young-Mo;Jang, Se-Young;Park, Byeong-Chan;Bang, Kyung-Sik;Kim, Seok-Yoon
- Journal of the Korea Society of Computer and Information
- /
- v.24 no.7
- /
- pp.93-100
- /
- 2019
In this paper, we propose an intelligent marketing service system that can provide custom advertisements and events to both businesses and customers by identifying the location and contents using the ultrasonic signals and feature information in voice signals. We also develop the encoding and decoding algorithm of ultrasonic signals for this system and analyze the performance evaluation results. With the development of the hyper-connected society, the on-line marketing has been activated and is growing in size. Existing store marketing applications have disadvantages that customers have to find out events or promotional materials that the headquarters or stores throughusing the corresponding applications whenever they visit them. To solve these problems, there are attempts to create intelligent marketing tools using GPS technology and voice recognition technology. However, this approach has difficulties in technology development due to accuracy of location and speed of comparison and retrieval of voice recognition technology, and marketing services for customer relation are also much simplified.
https://doi.org/10.9708/jksci.2019.24.07.093 인용 PDF KSCI HTML

Foreign Accents Classification of English and Urdu Languages, Design of Related Voice Data Base and A Proposed MLP based Speaker Verification System

Muhammad Ismail;Shahzad Ahmed Memon;Lachhman Das Dhomeja;Shahid Munir Shah
- International Journal of Computer Science & Network Security
- /
- v.24 no.10
- /
- pp.43-52
- /
- 2024
A medium scale Urdu speakers' and English speakers' database with multiple accents and dialects has been developed to use in Urdu Speaker Verification Systems, English Speaker Verification Systems, accents and dialect verification systems. Urdu is the national language of Pakistan and English is the official language. Majority of the people are non-native Urdu speakers and non-native English in all regions of Pakistan in general and Gilgit-Baltistan region in particular. In order to design Urdu and English speaker verification systems for security applications in general and telephone banking in particular, two databases has been designed one for foreign accent of Urdu and another for foreign accent of English language. For the design of databases, voice data is collected from 180 speakers from GB region of Pakistan who could speak Urdu as well as English. The speakers include both genders (males and females) with different age groups ranging from 18 to 69 years. Finally, using a subset of the data, Multilayer Perceptron based speaker verification system has been designed. The designed system achieved overall accuracy rate of 83.4091% for English dataset and 80.0454% for Urdu dataset. It shows slight differences (4.0% with English and 7.4% with Urdu) in recognition accuracy if compared with the recently proposed multilayer perceptron (MLP) based SIS achieved 87.5% recognition accuracy
https://doi.org/10.22937/IJCSNS.2024.24.10.5 인용 PDF

Semantic-Oriented Error Correction for Voice-Activated Information Retrieval System

Yoon, Yong-Wook;Kim, Byeong-Chang;Lee, Gary-Geunbae
- MALSORI
- /
- no.44
- /
- pp.115-130
- /
- 2002
Voice input is often required in many new application environments, but the low rate of speech recognition makes it difficult to extend its application. Previous approaches were to raise the accuracy of the recognition by post-processing of the recognition results, which were all lexical-oriented. We suggest a new semantic-oriented approach in speech recognition error correction. Through experiments using a speech-driven in-vehicle telematics information application, we show the excellent performance of our approach and some advantages it has as a semantic-oriented approach over a pure lexical-oriented approach.
PDF

Search Result 332, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)