Search | Korea Science

A Study on Korean 4-connected Digit Recognition Using Demi-syllable Context-dependent Models (반음절 문맥종속 모델을 이용한 한국어 4 연숫자음 인식에 관한 연구)

이기영;최성호;이호영;배명진
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.3
- /
- pp.175-181
- /
- 2003
Because a word of Korean digits is a syllable and deeply coarticulatied in connected digits, some recognition models based on demisyllables have been proposed by researchers. However, they could not show an excellent recognition results yet. This paper proposes a recognition model based on extended and context-dependent demisyllables, such as a tri-demisyllable like a tri-phone, for the Korean 4-connected digits recognition. For experiments, we use a toolkit of HTK 3.0 for building this model of continuous HMMs using training Korean connected digits from SiTEC database and for recognizing unknown ones. The results show that the recognition rate is 92% and this model has an ability to improve the recognition performance of Korean connected digits.
PDF KSCI

7-Segment Optical Character Recognition Using Template Matching (템플릿 매칭을 이용한 7-세그먼트 광학 문자 인식)

Jung, Min Chul
- Journal of the Semiconductor & Display Technology
- /
- v.19 no.4
- /
- pp.130-134
- /
- 2020
This paper proposes a new method for the digit recognition on a 7-segment display. The proposed method uses morphological processing that dilates segments of digits and connects them into strokes. The digits are extracted by connected component analysis and finally, template matching method recognizes the extracted digits. The proposed method is implemented using C language in Raspberry Pi 4 system with a camera module for a real-time image processing. Experiments were conducted by using various 7-segment LED displays and 7-segment mono LCD displays. The results show that the proposed method is successful for the digit recognition on the 7-segment displays.
PDF KSCI

Study on the Recognition of Spoken Korean Continuous Digits Using Phone Network (음성망을 이용한 한국어 연속 숫자음 인식에 관한 연구)

Lee, G.S.;Lee, H.J.;Byun, Y.G.;Kim, S.H.
- Proceedings of the KIEE Conference
- /
- 1988.07a
- /
- pp.624-627
- /
- 1988
This paper describes the implementation of recognition of speaker - dependent Korean spoken continuous digits. The recognition system can be divided into two parts, acoustic - phonetic processor and lexical decoder. Acoustic - phonetic processor calculates the feature vectors from input speech signal and the performs frame labelling and phone labelling. Frame labelling is performed by Bayesian classification method and phone labelling is performed using labelled frame and posteriori probability. The lexical decoder accepts segments (phones) from acoustic - phonetic processor and decodes its lexical structure through phone network which is constructed from phonetic representation of ten digits. The experiment carried out with two sets of 4continuous digits, each set is composed of 35 patterns. An evaluation of the system yielded a pattern accuracy of about 80 percent resulting from a word accuracy of about 95 percent.
PDF

A Study on the Recognition of Korean 4 Connected Digits Considering Co-articulation (조음결합을 고려한 4연 숫자음 인식에 관한 연구)

이종진;이광석;허강인;김명기;고시영
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.17 no.1
- /
- pp.20-28
- /
- 1992
Co-articulation is one of major factors that make connected word recognition difficult. This Study Considers the fact that the head Part Of the following word is changed by the Preceding word in a connection point, by applying the co-articulation model, and adj usting the following word .We choose a critical damping second order linear system for the co-articulation model, combining a one-stage DP matching recognition algorithm with this model, and Investigating the effects. The recognition experiment is carried out for 35 Korean 4 connected digits spoken by 5 male speakers, and recognition rate Is upgraded by 4.7 percent.
PDF

Car License Plate Extraction Based on Detection of Numeral Regions (숫자 영역 탐색에 기반한 자동차 번호판 추출)

Lee, Duk-Ryong;Oh, Il-Seok
- The Journal of The Korea Institute of Intelligent Transport Systems
- /
- v.7 no.1
- /
- pp.59-67
- /
- 2008
In this paper we propose an algorithm to extract the license plate regions from Korean car images. The idea of this paper is that we first find the four digits in the input car image and then segment the plate region using the digit information. Out method has advantage of segmenting simultaneously the plate regions and four digits regions. The first step finds and groups the connected components with proper sizes as candidate digits. The second step applies an serial alignment condition to find out probable 4-digits. In the third step, we recognize the candidate digits and assign the confidence values to each of them. The final step extracts the license plate region which has the highest confidence value. We used the Perfect Metrics classification algorithm to estimate the confidence. In our experiment, we got 97.23% and 95.45% correct detection rates, 0.09% and 0.11% false detection rates for 4,600 daytime and 264 nighttime images, respectively.
PDF

Online Digit Recognition using Start and End Point

Shim, Jae-chang;Ansari, Md Israfil
- Journal of Multimedia Information System
- /
- v.4 no.1
- /
- pp.39-42
- /
- 2017
Communication between human and machine is having been researched from last few decades and still it's a challenging task because human behavior is unpredictable. When it comes on handwritten digits almost each human has their own writing style. Handwritten digit recognition plays an important role, especially in the courtesy amounts on bank checks, postal code on mail address etc. In our study, we proposed an efficient feature extraction system for recognizing single digit number drawn by mouse or by a finger on a screen. Our proposed method combines basic image processing and reading the strokes of a line drawn. It is very simple and easy to implement in various platform as compare to the system which required high system configuration. This system has been designed, implemented, and tested successfully.
https://doi.org/10.9717/JMIS.2017.4.1.39 인용 PDF KSCI

Recognition of Korean Isolated Digits Using a Pole-Zero Model (Polo-Zero 모델을 이용한 한국어 단독 숫자음 인식)

;;Alan Conrad Bovik
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.25 no.4
- /
- pp.356-365
- /
- 1988
In this paper, we describe an isolated words recognition system for Korean isolated digits based on a voiced -unvoiced decision algorithm and a frequency domain analysis. The algorithm first performs a voiced-unvoiced decision procedure for the begtinning part of each uttered work using the normalized log energy and zero crossing rate as decision parameters. Based on this decision,. each word is assigned to one of two classes. In order to identify the uttered word within each class, a dynamic time warping algorithm is applied using formant frequencies as the basis for the distance measure. We exploit a pole-zero analysis to measure formant frequencies in each frame. We have observed that pole-zero analysis can provide more accurate estimation of formant frequencies than analysis based on poles only. Experimental recognition rates of 97.3% illustrating the performance of the recognition system was achieved.
PDF

A Study on Phoneme Likely Units to Improve the Performance of Context-dependent Acoustic Models in Speech Recognition (음성인식에서 문맥의존 음향모델의 성능향상을 위한 유사음소단위에 관한 연구)

임영춘;오세진;김광동;노덕규;송민규;정현열
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.5
- /
- pp.388-402
- /
- 2003
In this paper, we carried out the word, 4 continuous digits. continuous, and task-independent word recognition experiments to verify the effectiveness of the re-defined phoneme-likely units (PLUs) for the phonetic decision tree based HM-Net (Hidden Markov Network) context-dependent (CD) acoustic modeling in Korean appropriately. In case of the 48 PLUs, the phonemes /ㅂ/, /ㄷ/, /ㄱ/ are separated by initial sound, medial vowel, final consonant, and the consonants /ㄹ/, /ㅈ/, /ㅎ/ are also separated by initial sound, final consonant according to the position of syllable, word, and sentence, respectively. In this paper. therefore, we re-define the 39 PLUs by unifying the one phoneme in the separated initial sound, medial vowel, and final consonant of the 48 PLUs to construct the CD acoustic models effectively. Through the experimental results using the re-defined 39 PLUs, in word recognition experiments with the context-independent (CI) acoustic models, the 48 PLUs has an average of 7.06%, higher recognition accuracy than the 39 PLUs used. But in the speaker-independent word recognition experiments with the CD acoustic models, the 39 PLUs has an average of 0.61% better recognition accuracy than the 48 PLUs used. In the 4 continuous digits recognition experiments with the liaison phenomena. the 39 PLUs has also an average of 6.55% higher recognition accuracy. And then, in continuous speech recognition experiments, the 39 PLUs has an average of 15.08% better recognition accuracy than the 48 PLUs used too. Finally, though the 48, 39 PLUs have the lower recognition accuracy, the 39 PLUs has an average of 1.17% higher recognition characteristic than the 48 PLUs used in the task-independent word recognition experiments according to the unknown contextual factor. Through the above experiments, we verified the effectiveness of the re-defined 39 PLUs compared to the 48PLUs to construct the CD acoustic models in this paper.
PDF KSCI

A Study on the Automatic Recognition of Korean Basic Spoken Digit Using Energy of Special Bandwidth (특정 대역 에너지를 이용한 한국어 기본 수자 음성의 백동 인식에 관한 연구)

Han, Hee;Kim, Soon-Hyob;Park, Kyu-Tae
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.19 no.3
- /
- pp.5-12
- /
- 1982
Through the use of energy ratio of special bandwidths of basic vowels, recognition of Korean basic spoken digit is performed in logical combination with a zero-crossing rate and an energy parameter. In the experiments for recognition of the digits, the speech signal of spoken digits is filtered by a lowpass filter of which the cutoff frequency is 10KHz, and then sampled at 20KHz of sampling rate, In the speech signal processing, we used four FIR digital filters, and the order of filter lengths is 61, 120, 25, 25respectively. The filters are designed by using Remetz exchange algorithm.[13],[14] As a result, the recognition rate of 92% for the three speakers is obstained.
PDF

A Study on the Algorithm Development for Speech Recognition of Korean and Japanese (한국어와 일본어의 음성 인식을 위한 알고리즘 개발에 관한 연구)

Lee, Sung-Hwa;Kim, Hyung-Lae
- Journal of IKEEE
- /
- v.2 no.1 s.2
- /
- pp.61-67
- /
- 1998
In this thesis, experiment have performed with the speaker recognition using multilayer feedforward neural network(MFNN) model using Korean and Japanese digits . The 5 adult males and 5 adult females pronounciate form 0 to 9 digits of Korean, Japanese 7 times. And then, they are extracted characteristics coefficient through Pitch deletion algorithm, LPC analysis, and LPC Cepstral analysis to generate input pattern of MFNN. 5 times among them are used to train a neural network, and 2 times is used to measure the performance of neural network. Both Korean and Japanese, Pitch coefficients is about 4%t more enhanced than LPC or LPC Cepstral coefficients.
PDF

Search Result 38, Processing Time 0.032 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)