Search | Korea Science

Implementation of Embedded Speech Recognition System for Supporting Voice Commander to Control an Audio and a Video on Telematics Terminals (텔레메틱스 단말기 내의 오디오/비디오 명령처리를 위한 임베디드용 음성인식 시스템의 구현)

Kwon, Oh-Il;Lee, Heung-Kyu
- Journal of the Institute of Electronics Engineers of Korea TC
- /
- v.42 no.11
- /
- pp.93-100
- /
- 2005
In this paper, we implement the embedded speech recognition system to support various application services such as audio and video control using speech recognition interface on cars. The embedded speech recognition system is implemented and ported in a DSP board. Because MIC type and speech codecs affect the accuracy of speech recognition. And also, we optimize the simulation and test environment to effectively remove the real noises on a car. We applied a noise suppression and feature compensation algorithm to increase an accuracy of sppech recognition on a car. And we used a context dependent tied-mixture acoustic modeling. The performance evaluation showed high accuracy of proposed system in office environment and even real car environment.
PDF KSCI

Implementation of Real-time Vowel Recognition Mouse based on Smartphone (스마트폰 기반의 실시간 모음 인식 마우스 구현)

Jang, Taeung;Kim, Hyeonyong;Kim, Byeongman;Chung, Hae
- KIISE Transactions on Computing Practices
- /
- v.21 no.8
- /
- pp.531-536
- /
- 2015
The speech recognition is an active research area in the human computer interface (HCI). The objective of this study is to control digital devices with voices. In addition, the mouse is used as a computer peripheral tool which is widely used and provided in graphical user interface (GUI) computing environments. In this paper, we propose a method of controlling the mouse with the real-time speech recognition function of a smartphone. The processing steps include extracting the core voice signal after receiving a proper length voice input with real time, to perform the quantization by using the learned code book after feature extracting with mel frequency cepstral coefficient (MFCC), and to finally recognize the corresponding vowel using hidden markov model (HMM). In addition a virtual mouse is operated by mapping each vowel to the mouse command. Finally, we show the various mouse operations on the desktop PC display with the implemented smartphone application.
https://doi.org/10.5626/KTCP.2015.21.8.531 인용 KSCI

An AI Technology-based Intelligent Senior Assistant Voice Recognition System (AI 기술 기반 지능형 시니어 도우미 음성인식 시스템)

Hong, Phil-Doo
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2019.05a
- /
- pp.355-357
- /
- 2019
Now that we are entering an aging society, the user interface for new devices and IoT technology is very inconvenient for senior generation. To improve this, we propose an AI technology-based intelligent senior assistant voice recognition system. This system implements Cloud platform based API to accumulate data for machine learning processing, provides content for diagnosis and prevention of dementia, and provide chat-bot content for senior generation. We hope that senior generations will increase the accessibility and convenience of IoT devices and new technology devices with our system.
PDF

Design and Implementation of Context-aware Application on Smartphone Using Speech Recognizer

Kim, Kyuseok
- Journal of Advanced Information Technology and Convergence
- /
- v.10 no.2
- /
- pp.49-59
- /
- 2020
As technologies have been developing, our lives are getting easier. Today we are surrounded by the new technologies such as AI and IoT. Moreover, the word, "smart" is a very broad one because we are trying to change our daily environment into smart one by using those technologies. For example, the traditional workplaces have changed into smart offices. Since the 3rd industrial revolution, we have used the touch interface to operate the machines. In the 4th industrial revolution, however, we are trying adding the speech recognition module to the machines to operate them by giving voice commands. Today many of the things are communicated with human by voice commands. Many of them are called AI things and they do tasks which users request and do tasks more than what users request. In the 4th industrial revolution, we use smartphones all the time every day from the morning to the night. For this reason, the privacy using phone is not guaranteed sometimes. For example, the caller's voice can be heard through the phone speaker when accepting a call. So, it is needed to protect privacy on smartphone and it should work automatically according to the user context. In this aspect, this paper proposes a method to adjust the voice volume for call to protect privacy on smartphone according to the user context.
https://doi.org/10.14801/JAITC.2020.10.2.49 인용

A Survey Study on the Utilization Status and User Perception of the VUI of Smartphones (스마트폰 음성 인터페이스의 사용 현황 및 사용자 인식에 대한 조사 연구)

Choe, Jaeho;Kim, Hoontae
- The Journal of Society for e-Business Studies
- /
- v.21 no.4
- /
- pp.29-40
- /
- 2016
Voice User Interface (VUI) is the most familiar and comfortable interface to human. Recently, with the development of cloud and AI technologies VUI has been applied to various products. The aim of this study was to identify the problems of the current VUI and to find the direction of future study by investigating the utilization status and user perception of the VUI of smartphones. A survey was conducted with 163 college students using Google Forms. The results showed that the level of recognition of VUI is high but the rate of usage is very low, and many users feel uncomfortable about the voice recognition rate, reaction speed and operation method. Most of the survey participants tried VUI out of curiosity, but only a small portion of them found it useful to continue to use it. Many participants disliked talking to machines and also did not want others to listen. The study results will guide future research efforts for improving the utilization of VUI.
https://doi.org/10.7838/jsebs.2016.21.4.029 인용 PDF KSCI

Implementation of interactive Stock Trading System Using VoiceXML

Shin Jeong-Hoon;Cho Chang-Su;Hong Kwang-Seok
- Proceedings of the IEEK Conference
- /
- summer
- /
- pp.387-390
- /
- 2004
In this paper, we design and implement practical application service using VoiceXML. And we suggest new solutions of problems can be occurred when implementing a new systems using VoiceXML, based on the fact. Up to now, speech related services were developed using API (Application Program Interface) and programming languages, which methods depend on system architectures. It thus appears that reuse of contents and resource was very difficult. To solve these problems, nowadays, companies develop their applications using VoiceXML. Advantages of using VoiceXML when developing services are as follows. First, we can use web developing technologies and technologies for transmitting web contents. And, we can save labors for low level programming like C language or Assembler language. And we can save labors for managing resources, too. As the result of these advantages, we can reduce developing hours of applications services and we can solve problem of compatibility between systems. But, there's poor grip of actual problems can be occurred when implementing their own services using VoiceXML. To overcome these problems, we implemented interactive stock trading system using VoiceXML and concentrated our effort to find out problems when using VoiceXML. And then, we proposed solutions to these problems and analyzed strong points and weak points of suggested system.
PDF

Speaker Identification in Small Training Data Environment using MLLR Adaptation Method (MLLR 화자적응 기법을 이용한 적은 학습자료 환경의 화자식별)

Kim, Se-hyun;Oh, Yung-Hwan
- Proceedings of the KSPS conference
- /
- 2005.11a
- /
- pp.159-162
- /
- 2005
Identification is the process automatically identify who is speaking on the basis of information obtained from speech waves. In training phase, each speaker models are trained using each speaker's speech data. GMMs (Gaussian Mixture Models), which have been successfully applied to speaker modeling in text-independent speaker identification, are not efficient in insufficient training data environment. This paper proposes speaker modeling method using MLLR (Maximum Likelihood Linear Regression) method which is used for speaker adaptation in speech recognition. We make SD-like model using MLLR adaptation method instead of speaker dependent model (SD). Proposed system outperforms the GMMs in small training data environment.
PDF

A Study of Automatic Evaluation Platform for Speech Recognition Engine in the Vehicle Environment (자동차 환경내의 음성인식 자동 평가 플랫폼 연구)

Lee, Seong-Jae;Kang, Sun-Mee
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.37 no.7C
- /
- pp.538-543
- /
- 2012
The performance of the speech recognition engine is one of the most critical elements of the in-vehicle speech recognition interface. The objective of this paper is to develop an automated platform for running performance tests on the in-vehicle speech recognition engine. The developed platform comprise of main program, agent program, database management module, and statistical analysis module. A simulation environment for performance tests which mimics the real driving situations was constructed, and it was tested by applying pre-recorded driving noises and a speaker's voice as inputs. As a result, the validity of the results from the speech recognition tests was proved. The users will be able to perform the performance tests for the in-vehicle speech recognition engine effectively through the proposed platform.
https://doi.org/10.7840/KICS.2012.37.7C.538 인용 PDF KSCI

Development of an Embedded System for Ship′s Steering Gear using Voice Recognition Module (음성인식모듈을 이용한 선박조타용 임베디드 시스템 개발)

서기열;홍태호;김화영;박계각
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2004.04a
- /
- pp.144-148
- /
- 2004
Recently, various studies had been made for automatic control system of small ships, in order to improve maneuvering and to reduce labor and working on board. To achieve efficient operation of small ships, it had accomplished to rapid development of automatic technique, but the ship operation had been more complicated because of the need to handle various gauges and instruments. To solve these problems, there are examples to be applied to the speech information processing technologies which is one of the human interface methods in the system operation of ship, but the implementation of definite system is still incomplete. Therefore, the purpose of this paper is to implement the control system for ship steering using the voice recognition module.
PDF

Intelligent Retrieval System with Interactive Voice Support (대화형 음성 지원을 통한 지능형 검색 시스템)

Moon, K.J.;Yoo, Y.S.
- Journal of rehabilitation welfare engineering & assistive technology
- /
- v.9 no.1
- /
- pp.29-35
- /
- 2015
In this paper, we propose a intelligent retrieval system with interactive voice support. The developed system helps to find misrecognized words by using the relationship between lexical items in a sentence recognition and present the correct vocabulary. In this study, we implement a simulation system that can be proposed to determine the usefulness of the product search assistance system which offers applications. Experimental results were confirmed to correct the wrong speech recognition vocabulary in a simple user interface to help the product search.
PDF

Search Result 100, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)