Search | Korea Science

A Study on the Speech Recognition of Korean Phonemes Using Recurrent Neural Network Models (순환 신경망 모델을 이용한 한국어 음소의 음성인식에 대한 연구)

김기석;황희영
- The Transactions of the Korean Institute of Electrical Engineers
- /
- v.40 no.8
- /
- pp.782-791
- /
- 1991
In the fields of pattern recognition such as speech recognition, several new techniques using Artifical Neural network Models have been proposed and implemented. In particular, the Multilayer Perception Model has been shown to be effective in static speech pattern recognition. But speech has dynamic or temporal characteristics and the most important point in implementing speech recognition systems using Artificial Neural Network Models for continuous speech is the learning of dynamic characteristics and the distributed cues and contextual effects that result from temporal characteristics. But Recurrent Multilayer Perceptron Model is known to be able to learn sequence of pattern. In this paper, the results of applying the Recurrent Model which has possibilities of learning tedmporal characteristics of speech to phoneme recognition is presented. The test data consist of 144 Vowel+ Consonant + Vowel speech chains made up of 4 Korean monothongs and 9 Korean plosive consonants. The input parameters of Artificial Neural Network model used are the FFT coefficients, residual error and zero crossing rates. The Baseline model showed a recognition rate of 91% for volwels and 71% for plosive consonants of one male speaker. We obtained better recognition rates from various other experiments compared to the existing multilayer perceptron model, thus showed the recurrent model to be better suited to speech recognition. And the possibility of using Recurrent Models for speech recognition was experimented by changing the configuration of this baseline model.

Hybrid Facial Representations for Emotion Recognition

Yun, Woo-Han;Kim, DoHyung;Park, Chankyu;Kim, Jaehong
- ETRI Journal
- /
- v.35 no.6
- /
- pp.1021-1028
- /
- 2013
Automatic facial expression recognition is a widely studied problem in computer vision and human-robot interaction. There has been a range of studies for representing facial descriptors for facial expression recognition. Some prominent descriptors were presented in the first facial expression recognition and analysis challenge (FERA2011). In that competition, the Local Gabor Binary Pattern Histogram Sequence descriptor showed the most powerful description capability. In this paper, we introduce hybrid facial representations for facial expression recognition, which have more powerful description capability with lower dimensionality. Our descriptors consist of a block-based descriptor and a pixel-based descriptor. The block-based descriptor represents the micro-orientation and micro-geometric structure information. The pixel-based descriptor represents texture information. We validate our descriptors on two public databases, and the results show that our descriptors perform well with a relatively low dimensionality.
https://doi.org/10.4218/etrij.13.2013.0054 인용 PDF KSCI

Speech Recognition by Neural Net Pattern Recognition Equations with Self-organization

Kim, Sung-Ill;Chung, Hyun-Yeol
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.2E
- /
- pp.49-55
- /
- 2003
The modified neural net pattern recognition equations were attempted to apply to speech recognition. The proposed method has a dynamic process of self-organization that has been proved to be successful in recognizing a depth perception in stereoscopic vision. This study has shown that the process has also been useful in recognizing human speech. In the processing, input vocal signals are first compared with standard models to measure similarities that are then given to a process of self-organization in neural net equations. The competitive and cooperative processes are conducted among neighboring input similarities, so that only one winner neuron is finally detected. In a comparative study, it showed that the proposed neural networks outperformed the conventional HMM speech recognizer under the same conditions.
PDF KSCI

Face Recognition Method Based on Local Binary Pattern using Depth Images (깊이 영상을 이용한 지역 이진 패턴 기반의 얼굴인식 방법)

Kwon, Soon Kak;Kim, Heung Jun;Lee, Dong Seok
- Journal of Korea Society of Industrial Information Systems
- /
- v.22 no.6
- /
- pp.39-45
- /
- 2017
Conventional Color-Based Face Recognition Methods are Sensitive to Illumination Changes, and there are the Possibilities of Forgery and Falsification so that it is Difficult to Apply to Various Industrial Fields. In This Paper, we propose a Face Recognition Method Based on LBP(Local Binary Pattern) using the Depth Images to Solve This Problem. Face Detection Method Using Depth Information and Feature Extraction and Matching Methods for Face Recognition are implemented, the Simulation Results show the Recognition Performance of the Proposed Method.
https://doi.org/10.9723/jksiis.2017.22.6.039 인용 PDF KSCI

Shape Recognition and Classification Based on Poisson Equation- Fourier-Mellin Moment Descriptor

Zou, Jian-Cheng;Ke, Nan-Nan;Lu, Yan
- International Journal of CAD/CAM
- /
- v.8 no.1
- /
- pp.69-72
- /
- 2009
In this paper, we present a new shape descriptor, which is named Poisson equation-Fourier-Mellin moment Descriptor. We solve the Poisson equation in the shape area, and use the solution to get feature function, which are then integrated using Fourier-Mellin moment to represent the shape. This method develops the Poisson equation-geometric moment Descriptor proposed by Lena Gorelick, and keeps both advantages of Poisson equation-geometric moment and Fourier-Mellin moment. It is proved better than Poisson equation-geometric moment Descriptor in shape recognition and classification experiments.
PDF KSCI

Standard Primitives Processing and the Definition of Similarity Measure Functions for Hanguel Character CAI Learning and Writer's Recognition System (한글 문자 익히기 및 서체 인식 시스템의 개발을 위한 표준 자소의 처리 및 유사도 함수의 정의)

Jo, Dong-Uk
- The Transactions of the Korea Information Processing Society
- /
- v.7 no.3
- /
- pp.1025-1031
- /
- 2000
Pre-existing pattern recognition techniques, in the case of character recognition, have limited on the application field. But CAI character learning system and writer's recognition system are very important parts. The application field of pre-existing system can be expanded in the content that the learning of characters and the recognition of writers in the proposed paper. In order to achieve these goals, the development contents are the following: Firstly, pre-processing method by understanding the image structure is proposed, secondly, recognition of characters are accomplished b the histogram distribution characteristics. Finally, similarity measure functions are defined from standard character pattern for matching of the input character pattern. Also the effectiveness of this system is demonstrated by experimenting the standard primitive image.
PDF

Improvement of Bit Recognition Rate for Color QR Codes By Multiplexing Color and Pattern Information (색 및 패턴 정보 다중화를 이용한 칼라 QR코드의 비트 인식률 개선)

Kim, Jin-Soo
- Journal of Korea Multimedia Society
- /
- v.24 no.8
- /
- pp.1012-1019
- /
- 2021
Currently, since the black-white QR (Quick Response) codes have limited storage capacity, color QR codes have been actively being studied. By multiplexing 3 colors, the color QR codes can allow the code capacity to be increased by three times, however, the color multiplexing brings about the possibility of crosstalk and noises in the acquisition process of the final image, incurring the decrease of bit-recognition rate. In order to improve the bit recognition rate, while keeping the storage capacity high, this paper proposes a new type of color QR code which uses the pattern information as well as the color information, and then analyzes how to increase the bit recognition rate. For this aim, the paper presents an efficient system which extracts embedded information from color QR code and then, through practical experiments, it is shown that the proposed color QR codes improves the bit recognition rate and are useful for commercial applications, compared to the conventional color codes.
https://doi.org/10.9717/kmms.2021.24.8.1012 인용 PDF KSCI HTML

An Implementation of Lip Print Recognition system using VHDL (VHDL을 이용한 구순문 인식 시스템의 구현 연구)

Choi, Woo-Jin;Chung, Chin-Hyun
- Proceedings of the KIEE Conference
- /
- 1999.07g
- /
- pp.2935-2937
- /
- 1999
The human has recognizable part of body such as a fingerprint, a crimson, a blood vessel. This part has been investigated constantly, its confidence for personal recognition is high. In spite of specialized part of human body, a lip print recognition is developed less than the other physical attribute that is a fingerprint. a voice pattern, a retinal blood-vessel pattern, or a facial recognition. This paper is to implement hardware for lip print recognition system using VHDL.
PDF

Finger Vein Recognition Using Generalized Local Line Binary Pattern

Lu, Yu;Yoon, Sook;Xie, Shan Juan;Yang, Jucheng;Wang, Zhihui;Park, Dong Sun
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.8 no.5
- /
- pp.1766-1784
- /
- 2014
Finger vein images contain rich oriented features. Local line binary pattern (LLBP) is a good oriented feature representation method extended from local binary pattern (LBP), but it is limited in that it can only extract horizontal and vertical line patterns, so effective information in an image may not be exploited and fully utilized. In this paper, an orientation-selectable LLBP method, called generalized local line binary pattern (GLLBP), is proposed for finger vein recognition. GLLBP extends LLBP for line pattern extraction into any orientation. To effectually improve the matching accuracy, the soft power metric is employed to calculate the matching score. Furthermore, to fully utilize the oriented features in an image, the matching scores from the line patterns with the best discriminative ability are fused using the Hamacher rule to achieve the final matching score for the last recognition. Experimental results on our database, MMCBNU_6000, show that the proposed method performs much better than state-of-the-art algorithms that use the oriented features and local features, such as LBP, LLBP, Gabor filter, steerable filter and local direction code (LDC).
https://doi.org/10.3837/tiis.2014.05.015 인용 PDF KSCI KPUBS HTML

Two stage neural network for spatio-temporal pattern recognition (시변패턴 인식을 위한 2단 구조의 신경회로망)

Lim, Chung-Soo;Lee, Chong-Ho
- Proceedings of the KIEE Conference
- /
- 1998.07g
- /
- pp.2290-2292
- /
- 1998
This paper introduces Two-stage neural network that is capable of recognizing spatio-temporal patterns. First stage takes a spatio-temporal pattern as input and compress it into sparse spatio-temporal pattern. Second stage is for temporal pattern recognition with nonuniform inhibitory connections and different cell sizes. These are basic properties for detecting a embeded pattern in a larger pattern. The network is evaluated by computer simulation.
PDF

Search Result 2,473, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)