Search | Korea Science

Server based Mobile Multi-lingual Recognition System of Name-card (서버기반 모바일 다국어 명함인식 시스템)

Jang, Dong-Hyeub;Lee, Jae-Hong;Kim, Seong-Hak
- KIPS Transactions on Software and Data Engineering
- /
- v.3 no.4
- /
- pp.155-162
- /
- 2014
In this study, we developed a server-based mobile multi-lingual name-card recognition system which utilizes smartphone only as a terminal for capturing images of name-card and displaying results of recognition, running server as a recognizer of characters. For efficient processing and transmission of captured images, we corrected the distorted images, removed noises from them, and defined the socket-based protocol for wireless transmission of images between smartphone and the recognizer on server. Various tests for name-cards of five language types show increased recognition rate and speed of the developed system against conventional smartphone-based recognizers.
https://doi.org/10.3745/KTSDE.2014.3.4.155 인용 PDF KSCI

A Study on the Digital Signal Processing for the Pattern fiecognition of Weld Flaws (용접결함의 패턴인식을 위한 디지털 신호처리에 관한 연구)

김재열;송찬일;김병현
- Proceedings of the Korean Society of Precision Engineering Conference
- /
- 1995.10a
- /
- pp.393-396
- /
- 1995
In this syudy, the researches classifying the artificial and natural flaws in welding parts are performed using the smart pattern recognition technology. For this purpose the smart signal pattern recognition package including the user defined function was developed and the total procedure including the digital signal processing,feature extraction , feature selection and classifier selection is treated by bulk. Specially it is composed with and discussed using the statistical classifier such as the linear disciminant function classifier, the empirical Bayesian classifier. Also, the smart pattern recognition technology is applied to classification problem of natural flaw(i.e multiple classification problem-crack,lack of penetration,lack of fusion,porosity,and slag inclusion, the planar and volumetric flaw classification problem). According to this results, if appropriately learned the neural network classifier is better than ststistical classifier in the classification problem of natural flaw. And it is possible to acquire the recognition rate of 80% above through it is different a little according to domain extracting the feature and the classifier.
PDF

Broken Detection of the Traffic Sign by using the Location Histogram Matching

Yang, Liu;Lee, Suk-Hwan;Kwon, Seong-Geun;Moon, Kwang-Seok;Kwon, Ki-Ryong
- Journal of Korea Multimedia Society
- /
- v.15 no.3
- /
- pp.312-322
- /
- 2012
The paper presents an approach for recognizing the broken area of the traffic signs. The method is based on the Recognition System for Traffic Signs (RSTS). This paper describes an approach to using the location histogram matching for the broken traffic signs recognition, after the general process of the image detection and image categorization. The recognition proceeds by using the SIFT matching to adjust the acquired image to a standard position, then the histogram bin will be compared preprocessed image with reference image, and finally output the location and percents value of the broken area. And between the processing, some preprocessing like the blurring is added in the paper to improve the performance. And after the reorganization, the program can operate with the GPS for traffic signs maintenance. Experimental results verified that our scheme have a relatively high recognition rate and a good performance in general situation.
https://doi.org/10.9717/kmms.2012.15.3.312 인용 PDF KSCI

Polynomial Higher Order Neural Network for Shift-invariant Pattern Recognition (위치 변환 패턴 인식을 위한 다항식 고차 뉴럴네트워크)

Chung, Jong-Su;Hong, Sung-Chan
- The Transactions of the Korea Information Processing Society
- /
- v.4 no.12
- /
- pp.3063-3068
- /
- 1997
In this paper, we have extended the generalization back-propagation algorithm to multi-layer polynomial higher order neural networks. The purpose of this paper is to describe various pattern recognition using polynomial higher-order neural network. And we have applied shift position T-C test pattern for invariant pattern recognition and measured generalization by mirror symmetry problem. simulation result shows that the ability for invariant pattern recognition increase with the proposed technique. Recognition rate of invariant T-C pattern is 90% effective and of mirror symmetry problem is 70% effective when the proposed technique is utilized. These results are much better than those by the conventional methods.
PDF

Korean Continuous Speech Recognition Using Discrete Duration Control Continuous HMM (이산 지속시간제어 연속분포 HMM을 이용한 연속 음성 인식)

Lee, Jong-Jin;Kim, Soo-Hoon;Hur, Kang-In
- The Journal of the Acoustical Society of Korea
- /
- v.14 no.1
- /
- pp.81-89
- /
- 1995
In this paper, we report the continuous speech recognition system using the continuous HMM with discrete duration control and the regression coefficients. Also, we do recognition experiment using One Pass DP method(for 25 sentences of robot control commands) with finite state automata context control. In the experiment for 4 connected spoken digits, the recognition rates are $93.8\%$ when the discrete duration control and the regression coefficients are included, and $80.7\%$ when they are not included. In the experiment for 25 sentences of the robot control commands, the recognition rate are $90.9\%$ when FSN is not included and $98.4\%$ when FSN is included.
PDF

Implementation of HMM Based Speech Recognizer with Medium Vocabulary Size Using TMS320C6201 DSP (TMS320C6201 DSP를 이용한 HMM 기반의 음성인식기 구현)

Jung, Sung-Yun;Son, Jong-Mok;Bae, Keun-Sung
- The Journal of the Acoustical Society of Korea
- /
- v.25 no.1E
- /
- pp.20-24
- /
- 2006
In this paper, we focused on the real time implementation of a speech recognition system with medium size of vocabulary considering its application to a mobile phone. First, we developed the PC based variable vocabulary word recognizer having the size of program memory and total acoustic models as small as possible. To reduce the memory size of acoustic models, linear discriminant analysis and phonetic tied mixture were applied in the feature selection process and training HMMs, respectively. In addition, state based Gaussian selection method with the real time cepstral normalization was used for reduction of computational load and robust recognition. Then, we verified the real-time operation of the implemented recognition system on the TMS320C6201 EVM board. The implemented recognition system uses memory size of about 610 kbytes including both program memory and data memory. The recognition rate was 95.86% for ETRI 445DB, and 96.4%, 97.92%, 87.04% for three kinds of name databases collected through the mobile phones.
PDF KSCI

The Continuous Speech Recognition with Prosodic Phrase Unit (운율구 단위의 연속음 인식)

강지영;엄기완;김진영;최승호
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.8
- /
- pp.9-16
- /
- 1999
Generally, a speaker structures utterances very clearly by grouping words into phrases. This facilitates the listener's recovery of the meaning of the utterance and the speaker's intention. To this purpose, a speaker uses, among other things, prosodic information such as intonation pause, duration, intensity, etc. The research described here is concerned with the relationship between the strength of prosodic boundaries in spoken utterances as perceived by untrained listeners(Perceptual boundary strength, PBS)-In this paper, the preceptual boundary strength is used as the same meaning of the prosodic boundary strength-and prosodic information. We made a rule determinating the prosodic boundaries and verified the usefulness of the prosodic phrase as a recognition unit. Experiments results showed that the performance of speech recognition(SR) is improved in aspect of recognition rate and time compared with that using sentences as recognition unit. In the future we will suggest the methods that estimate more appropriate boundaries and study more various methods of prosody assisted SR.
PDF

A VQ Codebook Design Based on Phonetic Distribution for Distributed Speech Recognition (분산 음성인식 시스템의 성능향상을 위한 음소 빈도 비율에 기반한 VQ 코드북 설계)

Oh Yoo-Rhee;Yoon Jae-Sam;Lee Gil-Ho;Kim Hong-Kook;Ryu Chang-Sun;Koo Myoung-Wa
- Proceedings of the KSPS conference
- /
- 2006.05a
- /
- pp.37-40
- /
- 2006
In this paper, we propose a VQ codebook design of speech recognition feature parameters in order to improve the performance of a distributed speech recognition system. For the context-dependent HMMs, a VQ codebook should be correlated with phonetic distributions in the training data for HMMs. Thus, we focus on a selection method of training data based on phonetic distribution instead of using all the training data for an efficient VQ codebook design. From the speech recognition experiments using the Aurora 4 database, the distributed speech recognition system employing a VQ codebook designed by the proposed method reduced the word error rate (WER) by 10% when compared with that using a VQ codebook trained with the whole training data.
PDF

Design of A Speech Recognition System using Hidden Markov Models (은닉 마코프 모델을 이용한 음성 인식 시스템 설계)

Lee, Chul-Won;Lim, In-Chil
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.33B no.1
- /
- pp.108-115
- /
- 1996
This paper proposes an algorithm and a model topology for the connected speech recognition using Discrete Hidden Markov Models. A proposed model uses diphone and triphone model which consider the recognition rate and recognisable vocabulary. Considering more exact inter- phoneme segmentation and execution speed of algorithm, 4 states have to exist in diphone model where the first state and the last state are keeping a steady state, the other states hold a transient state. 7 states have to exist in triphone model where 7 states are specified and improved to 3 steady states and 4 transition states. Also, the proposed speech recognition algorithm is designed to detect the inter-phoneme segmentation during the recognition processing.
PDF

Text-dependent Speaker Recognition System Using DTW & VQ (VQ와 DTW를 이용한 문장 의존형 화자인식 시스템)

Jung JongSoon;Oh SeYoung;Bae MyungJin
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.97-103
- /
- 2001
The speaker recognition method using DTW algorithm has the problem that is reducing the performance of the speaker recognition system as the time variation. So there are many proposed algorithms to solve these problems. This paper proposes the new method If make the reference pattern that is acceptable to intra-speaker variation by reference pattern normalization. And to avoid reducing performance of speaker recognition system, we use the modified reference pattern to recognize the system user. The used methods in this paper are VQ and DTW. As the result of simulation we can obtain the $97.5\%$ of recognition accuracy rate.
PDF

Search Result 2,809, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)