Search | Korea Science

A Study of Hybrid Automatic Interpret Support System (하이브리드 자동 통역지원 시스템에 관한 연구)

Lim, Chong-Gyu;Gang, Bong-Gyun;Park, Ju-Sik;Kang, Bong-Kyun
- Journal of Korean Society of Industrial and Systems Engineering
- /
- v.28 no.3
- /
- pp.133-141
- /
- 2005
The previous research has been mainly focused on individual technology of voice recognition, voice synthesis, translation, and bone transmission technical. Recently, commercial models have been produced using aforementioned technologies. In this research, a new automated translation support system concept has been proposed by combining established technology of bone transmission and wireless system. The proposed system has following three major components. First, the hybrid system consist of headset, bone transmission and other technologies will recognize user's voice. Second, computer recognized voice (using small server attached to the user) of the user will be converted into digital signal. Then it will be translated into other user's language by translation algorithm. Third, the translated language will be wirelessly transmitted to the other party. The transmitted signal will be converted into voice in the other party's computer using the hybrid system. This hybrid system will transmit the clear message regardless of the noise level in the environment or user's hearing ability. By using the network technology, communication between users can also be clearly transmitted despite the distance.
PDF KSCI

Voice Recognition Elevator for Handicapped People (장애인을 위한 음성인식 엘리베이터)

Oh, Yong-Jae;Kim, Jeong-Rae;Chung, Ik-Joo
- Journal of Industrial Technology
- /
- v.33 no.A
- /
- pp.55-60
- /
- 2013
In this paper, we proposed an efficient method for implementing a voice recognition elevator. Unlike the existing ones, the proposed system is based on the bluetooth communication and smartphones equipped with the google speech recognition software, which makes it possible that the speech recognition capability can be added to the previously installed elevators. In order to improve the recognition accuracy, instead of using the result of the google recognizer, we built a web server where the user data are accumulated and they are used for recognition error correction.
PDF

Kiosk for the Visually Impaired using Voice Recognition (음성인식 기능을 이용한 시각장애인용 키오스크)

Kim, Dae-Young;Lee, Ah-Hyun;Lee, Gun-Haeng;Kim, Se-Hyun;Lee, Boong-Joo
- The Journal of the Korea institute of electronic communication sciences
- /
- v.17 no.5
- /
- pp.873-882
- /
- 2022
In this paper, we studied the voice recognition system kiosk for convenience, thinking that the kiosk widely used in modern society should compensate for the inconvenience of using by the visually impaired. Using ultrasonic sensor and PIR(Passive Infrared), it recognizes the visually impaired within the range of 80cm-40cm, introduces the kiosk through the MP3 module and induces them to come closer. Also, when the visually impaired within 40cm is recognized, the product description and order are guided through the MP3 module. A recording-based data voice recognition system and a kiosk that outputs desired items through servo motors were studied. A kiosk for the convenience of the visually impaired was manufactured through operation and optimization experiments of PIR, ultrasonic, voice recognition, and shock sensor for the manufactured voice recognition kiosk. Finally, it was confirmed that security can be strengthened by using shock sensors and emergency bells to enhance security.
https://doi.org/10.13067/JKIECS.2022.17.5.873 인용 PDF KSCI

Real-time Phoneme Recognition System Using Max Flow Matching (최대 흐름 정합을 이용한 실시간 음소인식 시스템 구현)

Lee, Sang-Yeob;Park, Seong-Won
- Journal of Korea Game Society
- /
- v.12 no.1
- /
- pp.123-132
- /
- 2012
There are many of games using smart devices. Voice recognition is can be useful way for input. In the game, voice have to be quickly recognized, at the same time it have to be manipulated promptly as well. In this study, we developed the optimized real-time phoneme recognition using max flow matching that it can be efficiently used in the game field. Firstly, voice wavelength is transformed to FFT, secondly, transformed value is made by a graph in Z plane, thirdly, data is extracted in specific area, and then data is saved in database. After all the value is recognized using weighted bipartite max flow matching. This way would be useful method in game or robot field when researchers hope to recognize the fast voice recognition.
https://doi.org/10.7583/JKGS.2012.12.1.123 인용 PDF KSCI

VOICE CONTROL SYSTEM FOR TELEVISION SET USING MASKING MODEL AS A FRONT-END OF SPEECH RECOGNIZER

Usagawa, Tsuyoshi;Iwata, Makoto;Ebata, Masanao
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06a
- /
- pp.991-996
- /
- 1994
Surrounding noise often affects the performance of speech recognition system when it is used in office or home. Especially situation is more serious when colored and nonstational noise such as an sound from television or other audio equipment is introduced. The authors proposed a voice control system for television set using an adaptive noise canceler, and it works well even is sound of television set has comparable level of speech. In this paper, a new front-end of speech recognition is introduced for the voice control system. This font-end utilizes a simplified masking model to reduce the effect of residual noise. According to experimental results, 90% correct recognition is achieved even if the level of television sound is almost 15dB higher than one of speech.
PDF

A study on the voice command recognition at the motion control in the industrial robot (산업용 로보트의 동작제어 명령어의 인식에 관한 연구)

이순요;권규식;김홍태
- Journal of the Ergonomics Society of Korea
- /
- v.10 no.1
- /
- pp.3-10
- /
- 1991
The teach pendant and keyboard have been used as an input device of control command in human-robot sustem. But, many problems occur in case that the usef is a novice. So, speech recognition system is required to communicate between a human and the robot. In this study, Korean voice commands, eitht robot commands, and ten digits based on the broad phonetic analysis are described. Applying broad phonetic analysis, phonemes of voice commands are divided into phoneme groups, such as plosive, fricative, affricative, nasal, and glide sound, having similar features. And then, the feature parameters and their ranges to detect phoneme groups are found by minimax method. Classification rules are consisted of combination of the feature parameters, such as zero corssing rate(ZCR), log engery(LE), up and down(UD), formant frequency, and their ranges. Voice commands were recognized by the classification rules. The recognition rate was over 90 percent in this experiment. Also, this experiment showed that the recognition rate about digits was better than that about robot commands.
PDF

Design of the Motorized Wheel Chair(INMEL-1) Controlled by Response Type Voices (응답형 음성제어 전동 휠체어(INMEL-1)의 설계)

정동명;홍승홍
- Journal of Biomedical Engineering Research
- /
- v.8 no.2
- /
- pp.231-240
- /
- 1987
This Paper introduces a new design of motorized wheel chair for the disabled, which is intended to improve the quality of the disabled's indoor life. This vehicle was based on high manoeuvrability of the omnidirectional drive and saftey. Usually, the vehicle controlled by a joystick but also the voice control system to be prepared for the severely disabled. This voice control system responds to the result of voice recognition, state of system or warning of dangers with voices, which has real time response and 95.3% recognition ratio and satisfactory synthesis voice Quality Therefore this system is able to provide independency in driving and the disabled's daily life.
PDF

An Implementation of Interactive Voice Recognition Stock Trading System Using VoiceXML (VoiceXML을 이용한 대화형 음성 인식 증권 거래 시스템 구현)

Cho, Chang-Su;Shin, Jeong-Hoon;Hong, Kwang-Seok
- The KIPS Transactions:PartB
- /
- v.11B no.4
- /
- pp.517-526
- /
- 2004
In this paper, we implemented practical application service using VoiceXML. Developers can utilize the advantages of using VoiceXML such as reducing development time and sharing contents between applications. Up to now, speech related services were developed using APIs and programming languages such as C/C++ or exclusive developing tools, which methods depend on system architectures. For this reasons, reuse of contents and resources was very difficult. If developers want to change scenarios of the application services or change platforms, they have to edit and recompile their program sources. To solve these problems, nowadays, companies develop their applications using VoiceXML. But, there's poor grip of actual problems can be occurred when they use VoiceXML. To overcome these problems, we implemented stock trading system using VoiceXML. We found out problems which occurred during developing services. We proposed solutions to these problems And, we analyzed strong points and weak points of applications using suggested system.
https://doi.org/10.3745/KIPSTB.2004.11B.4.517 인용 PDF KSCI

Design of Voice Control Solution for Industrial Articulated Robot (산업용 다관절로봇 음성제어솔루션 설계)

Kwak, Kwang-Jin;Kim, Dae-Yeon;Park, Jeongmin
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.21 no.2
- /
- pp.55-60
- /
- 2021
As the smart factory progresses, the use of automation facilities and robots is increasing. Also, with the development of IT technology, the utilization of the system using voice recognition is also increasing. Voice recognition technology is a technology that stands out in smart home and various IoT technologies, but it is difficult to apply to factories due to the specificity of factories. Therefore, in this study, a method to control an industrial articulated robot was designed using voice recognition technology that considers the situation at the manufacturing site. It was confirmed that the robot could be controlled through network protocol and command conversion after receiving voice commands for robot operation through mobile.
https://doi.org/10.7236/JIIBC.2021.21.2.55 인용 PDF KSCI HTML

Robust Speech Recognition Algorithm of Voice Activated Powered Wheelchair for Severely Disabled Person (중증 장애우용 음성구동 휠체어를 위한 강인한 음성인식 알고리즘)

Suk, Soo-Young;Chung, Hyun-Yeol
- The Journal of the Acoustical Society of Korea
- /
- v.26 no.6
- /
- pp.250-258
- /
- 2007
Current speech recognition technology s achieved high performance with the development of hardware devices, however it is insufficient for some applications where high reliability is required, such as voice control of powered wheelchairs for disabled persons. For the system which aims to operate powered wheelchairs safely by voice in real environment, we need to consider that non-voice commands such as user s coughing, breathing, and spark-like mechanical noise should be rejected and the wheelchair system need to recognize the speech commands affected by disability, which contains specific pronunciation speed and frequency. In this paper, we propose non-voice rejection method to perform voice/non-voice classification using both YIN based fundamental frequency(F0) extraction and reliability in preprocessing. We adopted a multi-template dictionary and acoustic modeling based speaker adaptation to cope with the pronunciation variation of inarticulately uttered speech. From the recognition tests conducted with the data collected in real environment, proposed YIN based fundamental extraction showed recall-precision rate of 95.1% better than that of 62% by cepstrum based method. Recognition test by a new system applied with multi-template dictionary and MAP adaptation also showed much higher accuracy of 99.5% than that of 78.6% by baseline system.
https://doi.org/10.7776/ASK.2007.26.6.250 인용 PDF KSCI

Search Result 332, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)