• Title/Summary/Keyword: recognition-rate

Search Result 2,809, Processing Time 0.03 seconds

Vocabulary Recognition Performance Improvement using k-means Algorithm for GMM Support (GMM 지원을 위해 k-means 알고리즘을 이용한 어휘 인식 성능 개선)

  • Lee, Jong-Sub
    • Journal of Digital Convergence
    • /
    • v.13 no.2
    • /
    • pp.135-140
    • /
    • 2015
  • General CHMM vocabulary recognition system is model observation probability for vocabulary recognition of recognition rate's low. Used as the limiting unit is applied only to some problem in the phoneme model. Also, they have a problem that does not conform to the needs of the search range to meaning of the words in the vocabulary. Performs a phoneme recognition using GMM to improve these problems. We solve the problem according to the limited search words characterized by an improved k-means algorithm. Measure the effectiveness represented by the accuracy and reproducibility as compared to conventional system performance experiments. Performance test results accuracy is 83%p, and recall is 67%p.

Fingerprint Recognition System for On-line User Authentication (온라인 사용자 인증을 위한 지문인식 시스템)

  • Han, Sang-Hoon;Lee, Ho;Seo, Jeong-Man
    • Journal of the Korea Society of Computer and Information
    • /
    • v.11 no.1 s.39
    • /
    • pp.283-292
    • /
    • 2006
  • Interest about a latest security connection technology rises, and try to overcome security vulnerability Certification about on-line user methods through fingerprint that is biometries information apply. In this study, designs and implements fingerprint recognition system that is invariant to rotation by fingerprint recognition system for certification about on-line user. Proposed method focused in matching process through pre-process of fingerprint image, feature point extraction. Improved process time and correct recognition rate in fingerprint recognition system that is invariant to rotation presented in existing study. Also, improved noise, distortion problems that happen in preprocess of existing study applying directional Laplacian filter.

  • PDF

Parking Space Recognition for Autonomous Valet Parking Using Height and Salient-Line Probability Maps

  • Han, Seung-Jun;Choi, Jeongdan
    • ETRI Journal
    • /
    • v.37 no.6
    • /
    • pp.1220-1230
    • /
    • 2015
  • An autonomous valet parking (AVP) system is designed to locate a vacant parking space and park the vehicle in which it resides on behalf of the driver, once the driver has left the vehicle. In addition, the AVP is able to direct the vehicle to a location desired by the driver when requested. In this paper, for an AVP system, we introduce technology to recognize a parking space using image sensors. The proposed technology is mainly divided into three parts. First, spatial analysis is carried out using a height map that is based on dense motion stereo. Second, modelling of road markings is conducted using a probability map with a new salient-line feature extractor. Finally, parking space recognition is based on a Bayesian classifier. The experimental results show an execution time of up to 10 ms and a recognition rate of over 99%. Also, the performance and properties of the proposed technology were evaluated with a variety of data. Our algorithms, which are part of the proposed technology, are expected to apply to various research areas regarding autonomous vehicles, such as map generation, road marking recognition, localization, and environment recognition.

Speech Emotion Recognition Using Confidence Level for Emotional Interaction Robot (감정 상호작용 로봇을 위한 신뢰도 평가를 이용한 화자독립 감정인식)

  • Kim, Eun-Ho
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.19 no.6
    • /
    • pp.755-759
    • /
    • 2009
  • The ability to recognize human emotion is one of the hallmarks of human-robot interaction. Especially, speaker-independent emotion recognition is a challenging issue for commercial use of speech emotion recognition systems. In general, speaker-independent systems show a lower accuracy rate compared with speaker-dependent systems, as emotional feature values depend on the speaker and his/her gender. Hence, this paper describes the realization of speaker-independent emotion recognition by rejection using confidence measure to make the emotion recognition system be homogeneous and accurate. From comparison of the proposed methods with conventional method, the improvement and effectiveness of proposed methods were clearly confirmed.

Development of Gate Operation System Based on Image Processing (영상처리에 기반한 게이트 운영시스템 개발)

  • 강대성;유영달
    • Journal of Korean Port Research
    • /
    • v.13 no.2
    • /
    • pp.303-312
    • /
    • 1999
  • The automated gate operating system is developed in this paper that controls the information of container at gate in the ACT. This system can be divided into three parts and consists of container identifier recognition car plate recognition container deformation perception. We linked each system and organized efficient gate operating system. To recognize container identifier the preprocess using LSPRD(Line Scan Proper Region Detection)is performed and the identifier is recognized by using neural network MBP When car plate is recognized only car image is extracted by using color information of car and hough transform. In the port of container deformation perception firstly background is removed by using moving window. Secondly edge is detected from the image removed characters on the surface of container deformation perception firstly background is removed by using moving window. Secondly edge is detected from the image removed characters on the surface of container. Thirdly edge is fitted into line segment so that container deformation is perceived. As a results of the experiment with this algorithm superior rate of identifier recognition is shown and the car plate recognition system and container deformation perception that are applied in real-time are developed.

  • PDF

Monosyllable Speech Recognition through Facial Movement Analysis (안면 움직임 분석을 통한 단음절 음성인식)

  • Kang, Dong-Won;Seo, Jeong-Woo;Choi, Jin-Seung;Choi, Jae-Bong;Tack, Gye-Rae
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.63 no.6
    • /
    • pp.813-819
    • /
    • 2014
  • The purpose of this study was to extract accurate parameters of facial movement features using 3-D motion capture system in speech recognition technology through lip-reading. Instead of using the features obtained through traditional camera image, the 3-D motion system was used to obtain quantitative data for actual facial movements, and to analyze 11 variables that exhibit particular patterns such as nose, lip, jaw and cheek movements in monosyllable vocalizations. Fourteen subjects, all in 20s of age, were asked to vocalize 11 types of Korean vowel monosyllables for three times with 36 reflective markers on their faces. The obtained facial movement data were then calculated into 11 parameters and presented as patterns for each monosyllable vocalization. The parameter patterns were performed through learning and recognizing process for each monosyllable with speech recognition algorithms with Hidden Markov Model (HMM) and Viterbi algorithm. The accuracy rate of 11 monosyllables recognition was 97.2%, which suggests the possibility of voice recognition of Korean language through quantitative facial movement analysis.

Face Pose Transformation for Pose Invariant Face Recognition (포즈에 독립적인 얼굴 인식을 위한 얼굴 포즈 변환)

  • Park Hyun-Sun;Park Jong-Il;Kim Whoi-Yul
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.30 no.6C
    • /
    • pp.570-576
    • /
    • 2005
  • Recognition of posed face is one of the most challenging problems in the field of face recognition. In this paper, as a preprocessing step for recognizing such faces, a method to transform non-frontal face images into frontal face images is proposed. The linear relationship between eigenfaces is utilized to obtain a pose transform matrix. The proposed method is verified with a well-known face recognition algorithm based on PCA/LDA. Compared to the conventional algorithm applied to the original posed face images, our experimental results indicated that the proposed method contributes to improve the recognition rate of such faces by $20\%$.

A Study on the Model Recognition of Moving Vehicles Using a Neural Network (신경망을 이용한 운행차량의 차종인식 연구)

  • Lee, Hyo-Jong
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.42 no.4 s.304
    • /
    • pp.69-78
    • /
    • 2005
  • The number of vehicles are rapidly increased as modern industrialization is developed worldwide. Vehicle recognition has been studied for a while because mmy People acknowledged it has critical functions to solve the problems of traffic control or vehicle-related crimes. In this paper a novel method is proposed to recognize vehicles' model corresponding makers in order to increase the efficiency of recognition. Texture features are computed from the frontal image of vehicles. A three-layer neural network was built and trained with the texture features for recognition. The proposed method shows 95$\%$ recognition rate for moving vehicles' models.

Isolated Word Recognition with the E-MIND II Neurocomputer (E-MIND II를 이용한 고립 단어 인식 시스템의 설계)

  • Kim, Joon-Woo;Jeong, Hong;Kim, Myeong-Won
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.32B no.11
    • /
    • pp.1527-1535
    • /
    • 1995
  • This paper introduces an isolated word recognition system realized on a neurocomputer called E-MIND II, which is a 2-D torus wavefront array processor consisting of 256 DNP IIs. The DNP II is an all digital VLSI unit processor for the EMIND II featuring the emulation capability of more than thousands of neurons, the 40 MHz clock speed, and the on-chip learning. Built by these PEs in 2-D toroidal mesh architecture, the E- MIND II can be accelerated over 2 Gcps computation speed. In this light, the advantages of the E-MIND II in its capability of computing speed, scalability, computer interface, and learning are especially suitable for real time application such as speech recognition. We show how to map a TDNN structure on this array and how to code the learning and recognition algorithms for a user independent isolated word recognition. Through hardware simulation, we show that recognition rate of this system is about 97% for 30 command words for a robot control.

  • PDF

Gradation Image Processing for Text Recognition in Road Signs Using Image Division and Merging

  • Chong, Kyusoo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.13 no.2
    • /
    • pp.27-33
    • /
    • 2014
  • This paper proposes a gradation image processing method for the development of a Road Sign Recognition Platform (RReP), which aims to facilitate the rapid and accurate management and surveying of approximately 160,000 road signs installed along the highways, national roadways, and local roads in the cities, districts (gun), and provinces (do) of Korea. RReP is based on GPS(Global Positioning System), IMU(Inertial Measurement Unit), INS(Inertial Navigation System), DMI(Distance Measurement Instrument), and lasers, and uses an imagery information collection/classification module to allow the automatic recognition of signs, the collection of shapes, pole locations, and sign-type data, and the creation of road sign registers, by extracting basic data related to the shape and sign content, and automated database design. Image division and merging, which were applied in this study, produce superior results compared with local binarization method in terms of speed. At the results, larger texts area were found in images, the accuracy of text recognition was improved when images had been gradated. Multi-threshold values of natural scene images are used to improve the extraction rate of texts and figures based on pattern recognition.