Search | Korea Science

A Novel Fuzzy Neural Network and Learning Algorithm for Invariant Handwritten Character Recognition (변형에 무관한 필기체 문자 인식을 위한 퍼지 신경망과 학습 알고리즘)

Yu, Jeong-Su
- Journal of The Korean Association of Information Education
- /
- v.1 no.1
- /
- pp.28-37
- /
- 1997
This paper presents a new neural network based on fuzzy set and its application to invariant character recognition. The fuzzy neural network consists of five layers. The results of simulation show that the network can recognize characters in the case of distortion, translation, rotation and different sizes of handwritten characters and even with noise(8${\sim}$30%)). Translation, distortion, different sizes and noise are achieved by layer L2 and rotation invariant by layer L5. The network can recognize 108 examples of training with 100% recognition rate when they are shifted in eight directions by 1 pixel and 2 pixels. Also, the network can recognize all the distorted characters with 100% recognition rate. The simulations show that the test patterns cover a ${\pm}20^{\circ}$ range of rotation correctly. The proposed network can also recall correctly all the learned characters with 100% recognition rate. The proposed network is simple and its learning and recall speeds are very fast. This network also works for the segmentation and recognition of handwritten characters.
PDF

Recognition of Restricted Continuous Korean Speech Using Perceptual Model (인지 모델을 이용한 제한된 한국어 연속음 인식)

Kim, Seon-Il;Hong, Ki-Won;Lee, Haing-Sei
- The Journal of the Acoustical Society of Korea
- /
- v.14 no.3
- /
- pp.61-70
- /
- 1995
In this paper, the PLP cepstrum which is close to human perceptual characteristics was extracted through the spread time area to get the temperal feature. Phonemes were recognized by artificial neural network similar to the learning method of human. The phoneme strings were matched by Markov models which well suited for sequence. Phoneme recognition for the continuous Korean speech had been done using speech blocks in which speech frames were gathered with unequal numbers. We parameterized the blocks using 7th order PLPs, PTP, zero crossing rate and energy, which neural network used as inputs. The 100 data composed of 10 Korean sentences which were taken from the speech two men pronounced five times for each sentence were used for the the recognition. As a result, maximum recognition rate of 94.4% was obtained. The sentence was recognized using Markov models generated by the phoneme strings recognized from earlier results the recognition for the 200 data which two men sounded 10 times for each sentence had been carried out. The sentence recognition rate of 92.5% was obtained.
PDF

A License-Plate Image Binarization Algorithm Based on Least Squares Method for License-Plate Recognition of Automobile Black-Box Image (블랙박스 영상용 자동차 번호판 인식을 위한 최소 자승법 기반의 번호판 영상 이진화 알고리즘)

Kim, Jin-young;Lim, Jongtae;Heo, Seo Weon
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.22 no.5
- /
- pp.747-753
- /
- 2018
In the license-plate recognition systems for automobile black Image, the license-plate image frequently has a shadow due to outdoor environments which are frequently changing. Such a shadow makes unpredictable errors in the segmentation process of individual characters and numbers of the license plate image, and reduces the overall recognition rate. In this paper, to improve the recognition rate in these circumstance, a license-plate image binarization algorithm is proposed removing the shadow effectively. The propose algorithm splits the license-plate image into the regions with the shadow and without. To find out the boundary of two regions, the algorithm estimates the curve for shadow boundary using the least-squares method. The simulation is performed for the license-plate image having its shadow, and the results show much higher recognition rate than the previous algorithm.
https://doi.org/10.6109/jkiice.2018.22.4.747 인용 PDF KSCI

C2DPCA & R2DLDA for Face Recognition (얼굴 인식 시스템을 위한 C2DPCA & R2DLDA)

Yun, Tae-Sung;Song, Young-Jun;Kim, Dong-Woo;Ahn, Jae-Hyeong
- The Journal of the Korea Contents Association
- /
- v.10 no.8
- /
- pp.18-25
- /
- 2010
The study has proposed a method that simultaneously takes advantage of each projection matrix acquired by using column-directional two-dimensional PCA(C2DPCA) and row-directional two-dimensional LDA(R2DLDA). The proposed method can acquire a great secure recognition rate, with no relation to the number of training images, with acquired low-dimensional feature matrixes including both the horizontal and the vertical features of a face. Besides, in the alternate experiment of PCA and LDA to row-direction and column-direction respectively(C2DPCA & R2DLDA, C2DLDA & R2DPCA), we could make sure the system of 2 dimensional LDA with row-directional feature(C2DPCA & R2DLDA) obtain higher recognition rate with low dimension than opposite case. As a result of experimenting that, the proposed method has showed a greater recognition rate of 99.4% than the existing methods such as 2DPCA and 2DLDA, etc. Also, it was proved that its recognition processing is over three times as fast as that of 2DPCA or 2DLDA.
https://doi.org/10.5392/JKCA.2010.10.8.018 인용 PDF KSCI

Pre-processing Method for Face Recognition Robust to Lightness Variation; Facial Symmetry (조명 변화에 강건한 얼굴 인식의 전처리 기법; 얼굴의 대칭성)

Kwon Heak-Bong;Kim Young-Gil;Chang Un-Dong;Song Young-Jun
- The Journal of the Korea Contents Association
- /
- v.4 no.4
- /
- pp.163-169
- /
- 2004
In this paper. we propose a shaded recognition method using symmetric feature. When the existing PCA is applied to shaded face images, the recognition rate is decreased. To improve the recognition rate, we use facial symmetry. If the difference of light and shade is greater than a threshold value, we make a mirror image by replacing the dark side with the bright side symmetrically Then the mirror image is compared with a query image. We compare the performance of the proposed algorithm with the existing algorithms such as PCA, PCA without three eigenfaces and histogram equalization methods. The recognition rate of our method shows $98.889\%$ with the excellent result.
PDF

Speech Recognition of Korean Phonemes 'ㅅ', 'ㅈ', 'ㅊ' based on Volatility and Turning Points (변동성과 전환점에 기반한 한국어 음소 'ㅅ', 'ㅈ', 'ㅊ' 음성 인식)

Lee, Jae Won
- KIISE Transactions on Computing Practices
- /
- v.20 no.11
- /
- pp.579-585
- /
- 2014
A phoneme is the minimal unit of speech, and it plays a very important role in speech recognition. This paper proposes a novel method that can be used to recognize 'ㅅ', 'ㅈ', and 'ㅊ' among Korean phonemes. The proposed method is based on a volatility indicator and a turning point indicator that are calculated for each constituting block of the input speech signal. The volatility indicator is the sum of the differences between the values of each two samples adjacent in a block, and the turning point indicator is the number of extremal points at which the direction of the increment or decrement of the values of the sample are inverted in a block. A phoneme recognition algorithm combines the two indicators to finally determine the positions at which the three target phonemes mentioned above are recognized by utilizing optimized thresholds related with those indicators. The experimental results show that the proposed method can markedly reduce the error rate of the existing methods both in terms of the false reject rate and the false accept rate.
https://doi.org/10.5626/KTCP.2014.20.11.579 인용

Efficient context dependent process modeling using state tying and decision tree-based method (상태 공유와 결정트리 방법을 이용한 효율적인 문맥 종속 프로세스 모델링)

Ahn, Chan-Shik;Oh, Sang-Yeob
- Journal of Korea Multimedia Society
- /
- v.13 no.3
- /
- pp.369-377
- /
- 2010
In vocabulary recognition systems based on HMM(Hidden Markov Model)s, training process unseen model bring on show a low recognition rate. If recognition vocabulary modify and make an addition then recreated modeling of executed database collected and training sequence on account of bring on additional expenses and take more time. This study suggest efficient context dependent process modeling method using decision tree-based state tying. On study suggest method is reduce recreated of model and it's offered that robustness and accuracy of context dependent acoustic modeling. Also reduce amount of model and offered training process unseen model as concerns context dependent a likely phoneme model has been used unseen model solve the matter. System performance as a result of represent vocabulary dependence recognition rate of 98.01%, vocabulary independence recognition rate of 97.38%.
PDF KSCI

Vehicle Recognition using NMF in Urban Scene (도심 영상에서의 비음수행렬분해를 이용한 차량 인식)

Ban, Jae-Min;Lee, Byeong-Rae;Kang, Hyun-Chul
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.37 no.7C
- /
- pp.554-564
- /
- 2012
The vehicle recognition consists of two steps; the vehicle region detection step and the vehicle identification step based on the feature extracted from the detected region. Features using linear transformations have the effect of dimension reduction as well as represent statistical characteristics, and show the robustness in translation and rotation of objects. Among the linear transformations, the NMF(Non-negative Matrix Factorization) is one of part-based representation. Therefore, we can extract NMF features with sparsity and improve the vehicle recognition rate by the representation of local features of a car as a basis vector. In this paper, we propose a feature extraction using NMF suitable for the vehicle recognition, and verify the recognition rate with it. Also, we compared the vehicle recognition rate for the occluded area using the SNMF(sparse NMF) which has basis vectors with constraint and LVQ2 neural network. We showed that the feature through the proposed NMF is robust in the urban scene where occlusions are frequently occur.
https://doi.org/10.7840/KICS.2012.37.7C.554 인용 PDF KSCI

A Real-Time Embedded Speech Recognition System (실시간 임베디드 음성 인식 시스템)

남상엽;전은희;박인정
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.40 no.1
- /
- pp.74-81
- /
- 2003
In this study, we'd implemented a real time embedded speech recognition system that requires minimum memory size for speech recognition engine and DB. The word to be recognized consist of 40 commands used in a PCS phone and 10 digits. The speech data spoken by 15 male and 15 female speakers was recorded and analyzed by short time analysis method, which window size is 256. The LPC parameters of each frame were computed through Levinson-Burbin algorithm and they were transformed to Cepstrum parameters. Before the analysis, speech data should be processed by pre-emphasis that will remove the DC component in speech and emphasize high frequency band. Baum-Welch reestimation algorithm was used for the training of HMM. In test phone, we could get a recognition rate using likelihood method. We implemented an embedded system by porting the speech recognition engine on ARM core evaluation board. The overall recognition rate of this system was 95%, while the rate on 40 commands was 96% and that 10 digits was 94%.
PDF KSCI

The Development of a License Plate Recognition System using Template Matching Method in Embedded System (임베디드 시스템에서의 템플릿 매칭 기법을 이용한 번호판 인식 시스템 개발)

Kim, Hong-Hee;Lee, Jae-Heung
- Journal of IKEEE
- /
- v.15 no.4
- /
- pp.274-280
- /
- 2011
The implementation of the recognition system of a vehicle license plate and the Linux OS environment which is built in SoC Embedded system and its test result are presented in this paper. In order to recognize a vehicle license plate, each character has to be extracted from the whole image of a license plate and the extracted image is revised for the template matching. Labeling technique and numerical features are used to detect the vehicle license plate. Each character in the license plate has coordinates. The extracted image is revised by comparison of the numerical coordinates and recognized through template matching method. The experimental results show that the license plate detection rate is 96%, and a character recognition rate is 73%, and a number recognition rate is 97% for about 300 license plate images. The average time of the recognition in the embedded board is 0.66 sec.
https://doi.org/10.7471/ikeee.2011.15.4.274 인용 PDF KSCI

Search Result 2,809, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)