Search | Korea Science

A Study on Human Recognition Experiments with Handwritten Digit for Machine Recognition of Handwritten Digit (필기 숫자의 기계 인식을 위한 인간의 필기 숫자 인식 실험에 대한 고찰)

Yoon, Sung-Soo;Chung, Hyun-Sook;Yi, Kwang-Oh;Lee, Yill-Byeong;Lee, Sang-Ho
- Journal of the Korean Institute of Intelligent Systems
- /
- v.18 no.3
- /
- pp.373-380
- /
- 2008
So far there have been many researches on machine-based recognition of handwritten digit. But we have not yet attained the level of performance that can be satisfactory to men. The dissatisfaction with the performance of machine comes from not only the low accuracy of recognition but also the dissimilarity of the recognition results between man and machine. To reduce the difference of machine from man we first made an experiment with the human recognition of handwritten digits and then inquiry into the way of the human recognition that makes the results of men different from that of machine. We found out the attributes that play an important role in the human recognition process through the analysis of the experimental results like uni- and bi-directional confused pairs of digits, several ones unmixed up with another and the redundancy of mis-recognition, and proposed the approach direction to be able to improve the accuracy of the machine-based recognition, and furthermore the similarity in the recognition results of men and machine on the basis of the found facts above.
https://doi.org/10.5391/JKIIS.2008.18.3.373 인용 PDF KSCI

Digit Recognition using Speech and Image Information (음성과 영상정보를 이용한 우리말 숫자음 인식)

조현욱;이종혁
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2001.10a
- /
- pp.257-260
- /
- 2001
We propose The Korean digit recognition system using speech and image information. In the experiments, we investigate that image information affect recognition rate. Recognition rate of teamed data and testing data show 100%, 78% each other.
PDF

Performance Improvement of Connected Digit Recognition with Channel Compensation Method for Telephone speech (채널보상기법을 사용한 전화 음성 연속숫자음의 인식 성능향상)

Kim Min Sung;Jung Sung Yun;Son Jong Mok;Bae Keun Sung
- MALSORI
- /
- no.44
- /
- pp.73-82
- /
- 2002
Channel distortion degrades the performance of speech recognizer in telephone environment. It mainly results from the bandwidth limitation and variation of transmission channel. Variation of channel characteristics is usually represented as baseline shift in the cepstrum domain. Thus undesirable effect of the channel variation can be removed by subtracting the mean from the cepstrum. In this paper, to improve the recognition performance of Korea connected digit telephone speech, channel compensation methods such as CMN (Cepstral Mean Normalization), RTCN (Real Time Cepatral Normalization), MCMN (Modified CMN) and MRTCN (Modified RTCN) are applied to the static MFCC. Both MCMN and MRTCN are obtained from the CMN and RTCN, respectively, using variance normalization in the cepstrum domain. Using HTK v3.1 system, recognition experiments are performed for Korean connected digit telephone speech database released by SITEC (Speech Information Technology & Industry Promotion Center). Experiments have shown that MRTCN gives the best result with recognition rate of 90.11% for connected digit. This corresponds to the performance improvement over MFCC alone by 1.72%, i.e, error reduction rate of 14.82%.
PDF

A Tow-stage Recognition Approach Based on Error Pattern Hypotheses for Connected Digit Recognition

Oh, Wook-Kwon;Un, Chong-Kwan
- The Journal of the Acoustical Society of Korea
- /
- v.15 no.3E
- /
- pp.31-36
- /
- 1996
In this paper, a two-stage recognition approach based on error pattern hypotheses is proposed to reduce errors of a connected digit recognizer. In the approach, a conventional recognizer is first used to produce N-best candidate strings, and then error patterns are hypothesized by examining the candidate strings. For substitution error pattern hypotheses, error-pattern-dependent classifiers having more discriminative power than the first-stage classifier are used ; and for insertion and deletion errors, word duration and energy contour information are exploited are exploited to discriminated confusing pairs. Simulation results showed that the proposed approach achieves 15% decrease in word error rate for speaker-independent Korean connected digit recognition when a hidden Markov model-based recognizer is used for the first-stage classifier.
PDF

7-Segment Optical Character Recognition Using Template Matching (템플릿 매칭을 이용한 7-세그먼트 광학 문자 인식)

Jung, Min Chul
- Journal of the Semiconductor & Display Technology
- /
- v.19 no.4
- /
- pp.130-134
- /
- 2020
This paper proposes a new method for the digit recognition on a 7-segment display. The proposed method uses morphological processing that dilates segments of digits and connects them into strokes. The digits are extracted by connected component analysis and finally, template matching method recognizes the extracted digits. The proposed method is implemented using C language in Raspberry Pi 4 system with a camera module for a real-time image processing. Experiments were conducted by using various 7-segment LED displays and 7-segment mono LCD displays. The results show that the proposed method is successful for the digit recognition on the 7-segment displays.
PDF KSCI

A Study on the Spoken KOrean-Digit Recognition Using the Neural Netwok (神經網을 利用한 韓國語數字音認識에 관한 硏究)

Park, Hyun-Hwa;Gahang, Hae Dong;Bae, Keun Sung
- The Journal of the Acoustical Society of Korea
- /
- v.11 no.3
- /
- pp.5-13
- /
- 1992
Taking devantage of the property that Korean digit is a mono-syllable word, we proposed a spoken Korean-digit recognition scheme using the multi-layer perceptron. The spoken Korean-digit is divided into three segments (initial sound, medial vowel, and final consonant) based on the voice starting / ending points and a peak point in the middle of vowel sound. The feature vectors such as cepstrum, reflection coefficients, ${\Delta}$cepstrum and ${\Delta}$energy are extracted from each segment. It has been shown that cepstrum, as an input vector to the neural network, gives higher recognition rate than reflection coefficients. Regression coefficients of cepstrum did not affect as much as we expected on the recognition rate. That is because, it is believed, we extracted features from the selected stationary segments of the input speech signal. With 150 ceptral coefficients obtained from each spoken digit, we achieved correct recognition rate of 97.8%.
PDF

A Spoken Korean-Digits Recognition System Based on Linear Prdiction Spectra (선형예측에 의한 숫자음성 자동인식)

;安居院猛
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.17 no.3
- /
- pp.12-19
- /
- 1980
A speech recognition system for separately pronounced Korean digits is described. The system is composed of four stages ; parameter extraction, segmentation by voiced-unovied analysis, formant tracking and pattern matching. Digit speech is segmented into an unvoiced segment and/or a voiced one using ZCR and energy measurements, then to estimate the first three formant frequencies a relatively simple formant tracking scheme is applied to the raw formant data extracted from linear prediction spectra. Finally, pattern matching is made using dynamic programmig method. Recognition experiment is carried out for 150 digit utterences spoken by three male speakers, and recgnition rate 94 % is obtained.
PDF

Handwriting Thai Digit Recognition Using Convolution Neural Networks (다양한 컨볼루션 신경망을 이용한 태국어 숫자 인식)

Onuean, Athita;Jung, Hanmin;Kim, Taehong
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2021.05a
- /
- pp.15-17
- /
- 2021
Handwriting recognition research is mainly focused on deep learning techniques and has achieved a great performance in the last few years. Especially, handwritten Thai digit recognition has been an important research area including generic digital numerical information, such as Thai official government documents and receipts. However, it becomes also a challenging task for a long time. For resolving the unavailability of a large Thai digit dataset, this paper constructs our dataset and learns them with some variants of the CNN model; Decision tree, K-nearest neighbors, Alexnet, LaNet-5, and VGG (11,13,16,19). The experimental results using the accuracy metric show the maximum accuracy of 98.29% when using VGG 13 with batch normalization.
PDF

Performance Comparison of Korean Connected Digit Telephone Speech Recognition According to Aurora Feature Extraction (Aurora 특징파라미터 추출기법에 따른 한국어 연속숫자음 전화음성의 인식 성능 비교)

Kim Min Sung;Jung Sung Yun;Son Jong Mok;Bae Keun Sung;Kim Sang Hun
- Proceedings of the KSPS conference
- /
- 2003.10a
- /
- pp.145-148
- /
- 2003
To improve the recognition performance of Korean connected digit telephone speech, in this paper, both Aurora feature extraction method that employs noise reduction 2-state Wiener filter and DWFBA method are investigated and used. CMN and MRTCN are applied to static features for channel compensation. Telephone digit speech database released by SITEC is used for recognition experiments with HTK system. Experimental results has shown that Aurora feature is slightly better than MFCC and DWFBA without channel compensation. And when channel compensation is included, Aurora feature is slightly better than DWFBA with MRTCN.
PDF

A Study on the Features for Building Korean Digit Recognition System Based on Multilayer Perceptron (다층 퍼셉트론에 기반한 한국어 숫자음 인식시스템 구현을 위한 특징 연구)

김인철;김대영
- Journal of Korea Society of Industrial Information Systems
- /
- v.6 no.4
- /
- pp.81-88
- /
- 2001
In this paper, a Korean digit recognition system based on a multilayer Perceptron is implemented. We also investigate the performance of widely used speech features, such as the Mel-scale filterbank, MFCC, LPCC, and PLP coefficients, by applying them as input of the proposed recognition system. In order to build a robust speech system, the experiments for demonstrating its recognition performance for the clean data as well as corrupt data are carried out. In experiments of recognizing 20 Korean digit, we found that the Mel-scale filterbank coefficients performs best in terms of recognition accuracy for the speech dependent and speech independent database even though noise is considerably added.
PDF

Search Result 138, Processing Time 0.399 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)