• Title/Summary/Keyword: recognition-rate

Search Result 2,809, Processing Time 0.029 seconds

Surface Curvature Based 3D Pace Image Recognition Using Depth Weighted Hausdorff Distance (표면 곡률을 이용하여 깊이 가중치 Hausdorff 거리를 적용한 3차원 얼굴 영상 인식)

  • Lee Yeung hak;Shim Jae chang
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.1
    • /
    • pp.34-45
    • /
    • 2005
  • In this paper, a novel implementation of a person verification system based on depth-weighted Hausdorff distance (DWHD) using the surface curvature of the face is proposed. The definition of Hausdorff distance is a measure of the correspondence of two point sets. The approach works by finding the nose tip that has a protrusion shape on the face. In feature recognition of 3D face image, one has to take into consideration the orientated frontal posture to normalize after extracting face area from original image. The binary images are extracted by using the threshold values for the curvature value of surface for the person which has differential depth and surface characteristic information. The proposed DWHD measure for comparing two pixel sets were used, because it is simple and robust. In the experimental results, the minimum curvature which has low pixel distribution achieves recognition rate of 98% among the proposed methods.

  • PDF

Hand posture recognition robust to rotation using temporal correlation between adjacent frames (인접 프레임의 시간적 상관 관계를 이용한 회전에 강인한 손 모양 인식)

  • Lee, Seong-Il;Min, Hyun-Seok;Shin, Ho-Chul;Lim, Eul-Gyoon;Hwang, Dae-Hwan;Ro, Yong-Man
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.11
    • /
    • pp.1630-1642
    • /
    • 2010
  • Recently, there is an increasing need for developing the technique of Hand Gesture Recognition (HGR), for vision based interface. Since hand gesture is defined as consecutive change of hand posture, developing the algorithm of Hand Posture Recognition (HPR) is required. Among the factors that decrease the performance of HPR, we focus on rotation factor. To achieve rotation invariant HPR, we propose a method that uses the property of video that adjacent frames in video have high correlation, considering the environment of HGR. The proposed method introduces template update of object tracking using the above mentioned property, which is different from previous works based on still images. To compare our proposed method with previous methods such as template matching, PCA and LBP, we performed experiments with video that has hand rotation. The accuracy rate of the proposed method is 22.7%, 14.5%, 10.7% and 4.3% higher than ordinary template matching, template matching using KL-Transform, PCA and LBP, respectively.

Phonetic Question Set Generation Algorithm (음소 질의어 집합 생성 알고리즘)

  • 김성아;육동석;권오일
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.2
    • /
    • pp.173-179
    • /
    • 2004
  • Due to the insufficiency of training data in large vocabulary continuous speech recognition, similar context dependent phones can be clustered by decision trees to share the data. When the decision trees are built and used to predict unseen triphones, a phonetic question set is required. The phonetic question set, which contains categories of the phones with similar co-articulation effects, is usually generated by phonetic or linguistic experts. This knowledge-based approach for generating phonetic question set, however, may reduce the homogeneity of the clusters. Moreover, the experts must adjust the question sets whenever the language or the PLU (phone-like unit) of a recognition system is changed. Therefore, we propose a data-driven method to automatically generate phonetic question set. Since the proposed method generates the phone categories using speech data distribution, it is not dependent on the language or the PLU, and may enhance the homogeneity of the clusters. In large vocabulary speech recognition experiments, the proposed algorithm has been found to reduce the error rate by 14.3%.

A New Power Spectrum Warping Approach to Speaker Warping (화자 정규화를 위한 새로운 파워 스펙트럼 Warping 방법)

  • 유일수;김동주;노용완;홍광석
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.4
    • /
    • pp.103-111
    • /
    • 2004
  • The method of speaker normalization has been known as the successful method for improving the accuracy of speech recognition at speaker independent speech recognition system. A frequency warping approach is widely used method based on maximum likelihood for speaker normalization. This paper propose a new power spectrum warping approach to making improvement of speaker normalization better than a frequency warping. Th power spectrum warping uses Mel-frequency cepstrum analysis(MFCC) and is a simple mechanism to performing speaker normalization by modifying the power spectrum of Mel filter bank in MFCC. Also, this paper propose the hybrid VTN combined the Power spectrum warping and a frequency warping. Experiment of this paper did a comparative analysis about the recognition performance of the SKKU PBW DB applied each speaker normalization approach on baseline system. The experiment results have shown that a frequency warping is 2.06%, the power spectrum is 3.06%, and hybrid VTN is 4.07% word error rate reduction as of word recognition performance of baseline system.

Enhanced ART1 Algorithm for the Recognition of Student Identification Cards of the Educational Matters Administration System on the Web (웹 환경 학사관리 시스템의 학생증 인식을 위한 개선된 ART1 알고리즘)

  • Park Hyun-Jung;Kim Kwang-Baek
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.5 s.37
    • /
    • pp.333-342
    • /
    • 2005
  • This paper proposes a method, which recognizes student's identification card by using image processing and recognition technology and can manage student information on the web. The presented scheme sets up an average brightness as a threshold, based on the brightest Pixel and the least bright one for the source image of the ID card. It is converting to binary image, applies a horizontal histogram, and extracts student number through its location. And, it removes the noise of the student number region by the mode smoothing with 3$\times$3 mask. After removing noise from the student number region, each number is extracted using vertical histogram and normalized. Using the enhanced ART1 algorithm recognized the extracted student number region. In this study, we propose the enhanced ART1 algorithm different from the conventional ART1 algorithm by the dynamical establishment of the vigilance parameter. which shows a tolerance limit of unbalance between voluntary and stored patterns for clustering. The Experiment results showed that the recognition rate of the proposed ART1 algorithm was improved much more than that of the conventional ART1 algorithm. So, we develop an educational matters administration system by using the proposed recognition method of the student's identification card.

  • PDF

A Noisy-Robust Approach for Facial Expression Recognition

  • Tong, Ying;Shen, Yuehong;Gao, Bin;Sun, Fenggang;Chen, Rui;Xu, Yefeng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.4
    • /
    • pp.2124-2148
    • /
    • 2017
  • Accurate facial expression recognition (FER) requires reliable signal filtering and the effective feature extraction. Considering these requirements, this paper presents a novel approach for FER which is robust to noise. The main contributions of this work are: First, to preserve texture details in facial expression images and remove image noise, we improved the anisotropic diffusion filter by adjusting the diffusion coefficient according to two factors, namely, the gray value difference between the object and the background and the gradient magnitude of object. The improved filter can effectively distinguish facial muscle deformation and facial noise in face images. Second, to further improve robustness, we propose a new feature descriptor based on a combination of the Histogram of Oriented Gradients with the Canny operator (Canny-HOG) which can represent the precise deformation of eyes, eyebrows and lips for FER. Third, Canny-HOG's block and cell sizes are adjusted to reduce feature dimensionality and make the classifier less prone to overfitting. Our method was tested on images from the JAFFE and CK databases. Experimental results in L-O-Sam-O and L-O-Sub-O modes demonstrated the effectiveness of the proposed method. Meanwhile, the recognition rate of this method is not significantly affected in the presence of Gaussian noise and salt-and-pepper noise conditions.

Robust Speech Recognition Parameters for Emotional Variation (감정 변화에 강인한 음성 인식 파라메터)

  • Kim Weon-Goo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.15 no.6
    • /
    • pp.655-660
    • /
    • 2005
  • This paper studied the feature parameters less affected by the emotional variation for the development of the robust speech recognition technologies. For this purpose, the effect of emotional variation on the speech recognition system and robust feature parameters of speech recognition system were studied using speech database containing various emotions. In this study, LPC cepstral coefficient, met-cepstral coefficient, root-cepstral coefficient, PLP coefficient, RASTA met-cepstral coefficient were used as a feature parameters. And CMS and SBR method were used as a signal bias removal techniques. Experimental results showed that the HMM based speaker independent word recognizer using RASTA met-cepstral coefficient :md its derivatives and CMS as a signal bias removal showed the best performance of $7.05\%$ word error rate. This corresponds to about a $52\%$ word error reduction as compare to the performance of baseline system using met - cepstral coefficient.

An Efficient Face Recognition Using First Moment of Image and Basis Images (영상의 1차 모멘트와 기저영상을 이용한 효율적인 얼굴인식)

  • Cho Yong-Hyun
    • The KIPS Transactions:PartB
    • /
    • v.13B no.1 s.104
    • /
    • pp.7-14
    • /
    • 2006
  • This paper presents an efficient face recognition method using both first moment of image and basis images. First moment which is a method for finding centroid of image, is applied to exclude the needless backgrounds in the face recognitions by shifting to the centroid of face image. Basis images which are the face features, are respectively extracted by principal component analysis(PCA) and fixed-point independent component analysis(FP-ICA). This is to improve the recognition performance by excluding the redundancy considering to second- and higher-order statistics of face image. The proposed methods has been applied to the problem for recognizing the 48 face images(12 persons*4 scenes) of 64*64 pixels. The 3 distances such as city-block, Euclidean, negative angle are used as measures when match the probe images to the nearest gallery images. The experimental results show that the proposed methods has a superior recognition performances(speed, rate) than conventional PCA and FP-ICA without preprocessing, the proposed FP-ICA has also better performance than the proposed PCA. The city-block has been relatively achieved more an accurate similarity than Euclidean or negative angle.

A Technique of Feature Vector Generation for Eye Region Using Embedded Information of Various Color Spaces (다양한 색공간 정보를 이용한 눈 영역의 특징벡터 생성 기법)

  • Park, Jung-Hwan;Shin, Pan-Seop;Kim, Guk-Boh;Jung, Jong-Jin
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.64 no.1
    • /
    • pp.82-89
    • /
    • 2015
  • The researches of image recognition have been processed traditionally. Especially, face recognition technology has been received attractions with advance and applied to various areas according as camera sensor embedded into many devices such as smart phone. In this study, we design and develop a feature vector generation technique of face for making animation caricatures using methods for face detection which are previous stage of face recognition. At first, we detect both face region and detailed eye region of component element by Viola&Johns's realtime detection method which are called as ROI(Region Of Interest). And then, we generate feature vectors of eye region by utilizing factors as opposed to the periphery and by using appearance information of eye. At this point, we focus on the embedded information in many color spaces to overcome the problems which can be occurred by using one color space. We propose a feature vector generation method using information from many color spaces. Finally, we experiment the test of feature vector generation by the proposed method with enough quantity of sample picture data and evaluate the proposed method for factors of estimating performance such as error rate, accuracy and generation time.

A Rehabilitation Training System Using the infrared LED based Motion Analysis (적외선 LED 기반 모션 분석을 통한 재활훈련 시스템)

  • Choi, Wan;Lim, Chul-Su;Kim, Tae-Young
    • Journal of the Korea Computer Graphics Society
    • /
    • v.13 no.4
    • /
    • pp.29-36
    • /
    • 2007
  • The objective of this paper is to develop the motion recognition method for the rehabilitation program of the people with physical disabilities. Using the infrared ray LED, this method gets the information of the user's motion and tells the difference between the information we get and the pre-defined information. In order to get the fast recognition and comparison, this method positions the specific coordinates of infrared ray LED in the pre-classified sector which keeps certain distance and angle from the point of origin. The result in our experiment shows the speed over 20 frames per second in every motion, and we are satisfied the rate for the recognition of the motions which is near 98%.

  • PDF