• Title/Summary/Keyword: problem recognition

Search Result 1,868, Processing Time 0.026 seconds

Half-Against-Half Multi-class SVM Classify Physiological Response-based Emotion Recognition

  • Vanny, Makara;Ko, Kwang-Eun;Park, Seung-Min;Sim, Kwee-Bo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.23 no.3
    • /
    • pp.262-267
    • /
    • 2013
  • The recognition of human emotional state is one of the most important components for efficient human-human and human- computer interaction. In this paper, four emotions such as fear, disgust, joy, and neutral was a main problem of classifying emotion recognition and an approach of visual-stimuli for eliciting emotion based on physiological signals of skin conductance (SC), skin temperature (SKT), and blood volume pulse (BVP) was used to design the experiment. In order to reach the goal of solving this problem, half-against-half (HAH) multi-class support vector machine (SVM) with Gaussian radial basis function (RBF) kernel was proposed showing the effective techniques to improve the accuracy rate of emotion classification. The experimental results proved that the proposed was an efficient method for solving the emotion recognition problems with the accuracy rate of 90% of neutral, 86.67% of joy, 85% of disgust, and 80% of fear.

Polynomial Higher Order Neural Network for Shift-invariant Pattern Recognition (위치 변환 패턴 인식을 위한 다항식 고차 뉴럴네트워크)

  • Chung, Jong-Su;Hong, Sung-Chan
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.12
    • /
    • pp.3063-3068
    • /
    • 1997
  • In this paper, we have extended the generalization back-propagation algorithm to multi-layer polynomial higher order neural networks. The purpose of this paper is to describe various pattern recognition using polynomial higher-order neural network. And we have applied shift position T-C test pattern for invariant pattern recognition and measured generalization by mirror symmetry problem. simulation result shows that the ability for invariant pattern recognition increase with the proposed technique. Recognition rate of invariant T-C pattern is 90% effective and of mirror symmetry problem is 70% effective when the proposed technique is utilized. These results are much better than those by the conventional methods.

  • PDF

Type of subjective recognition on the problem and policy alternatives to violence response experienced by emergency medical technicians (119구급대원이 경험한 폭력대응에 대한 문제점과 정책대안의 주관적 인식유형)

  • Lee, Ga-Yeon;Choi, Eun-Sook
    • The Korean Journal of Emergency Medical Services
    • /
    • v.26 no.1
    • /
    • pp.37-56
    • /
    • 2022
  • Purpose: This study aimed to identify and present suitable recognition types of policy alternative for before and after response, according to the recognition types of problems in response to violence. Methods: This study investigated 36 EMT's of 17 cities and provinces nationwide. The study was approved by the Kongju National University Institute Review Board (KNU_IRB_2021-17). Data were collected from May 1, 2021 to August 30, 2021 and analyzed by Q factor analysis using the PC-QUNAL program. Results: Recognition types of the problem in 119 EMT's response to violence were described as "I type; lack of professional manpower," "II type; inadequate policy on violence," and "III type; lack of awareness on the emergency field." Recognition types of policy alternative on response to violence by 119 EMT's were described as "Itype; training and public relations oriented," "II type; work environment improvement," "III type; violence handling specialization demand," and "IV type; recovery support seeker." Conclusion: This study provides the foundation required to develop and implement the policies regarding the response to violence; therefore, contributing to EMT's provision.

Pose-normalized 3D Face Modeling for Face Recognition

  • Yu, Sun-Jin;Lee, Sang-Youn
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.12C
    • /
    • pp.984-994
    • /
    • 2010
  • Pose variation is a critical problem in face recognition. Three-dimensional(3D) face recognition techniques have been proposed, as 3D data contains depth information that may allow problems of pose variation to be handled more effectively than with 2D face recognition methods. This paper proposes a pose-normalized 3D face modeling method that translates and rotates any pose angle to a frontal pose using a plane fitting method by Singular Value Decomposition(SVD). First, we reconstruct 3D face data with stereo vision method. Second, nose peak point is estimated by depth information and then the angle of pose is estimated by a facial plane fitting algorithm using four facial features. Next, using the estimated pose angle, the 3D face is translated and rotated to a frontal pose. To demonstrate the effectiveness of the proposed method, we designed 2D and 3D face recognition experiments. The experimental results show that the performance of the normalized 3D face recognition method is superior to that of an un-normalized 3D face recognition method for overcoming the problems of pose variation.

Subword-based Lip Reading Using State-tied HMM (상태공유 HMM을 이용한 서브워드 단위 기반 립리딩)

  • Kim, Jin-Young;Shin, Do-Sung
    • Speech Sciences
    • /
    • v.8 no.3
    • /
    • pp.123-132
    • /
    • 2001
  • In recent years research on HCI technology has been very active and speech recognition is being used as its typical method. Its recognition, however, is deteriorated with the increase of surrounding noise. To solve this problem, studies concerning the multimodal HCI are being briskly made. This paper describes automated lipreading for bimodal speech recognition on the basis of image- and speech information. It employs audio-visual DB containing 1,074 words from 70 voice and tri-viseme as a recognition unit, and state tied HMM as a recognition model. Performance of automated recognition of 22 to 1,000 words are evaluated to achieve word recognition of 60.5% in terms of 22word recognizer.

  • PDF

An Efficient Binarization Method for Vehicle License Plate Character Recognition

  • Yang, Xue-Ya;Kim, Kyung-Lok;Hwang, Byung-Kon
    • Journal of Korea Multimedia Society
    • /
    • v.11 no.12
    • /
    • pp.1649-1657
    • /
    • 2008
  • In this paper, to overcome the failure of binarization for the characters suffered from low contrast and non-uniform illumination in license plate character recognition system, we improved the binarization method by combining local thresholding with global thresholding and edge detection. Firstly, apply the local thresholding method to locate the characters in the license plate image and then get the threshold value for the character based on edge detector. This method solves the problem of local low contrast and non-uniform illumination. Finally, back-propagation Neural Network is selected as a powerful tool to perform the recognition process. The results of the experiments i1lustrate that the proposed binarization method works well and the selected classifier saves the processing time. Besides, the character recognition system performed better recognition accuracy 95.7%, and the recognition speed is controlled within 0.3 seconds.

  • PDF

Recognition Performance Improvement for Noisy-speech by Parallel Model Compensation Adaptation Using Frequency-variant added with ML (최대우도를 부가한 주파수 변이 PMC 방법의 잡음 음성 인식 성능개선)

  • Choi, Sook-Nam;Chung, Hyun-Yeol
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.8
    • /
    • pp.905-913
    • /
    • 2013
  • The Parallel Model Compensation Using Frequency-variant: FV-PMC for noise-robust speech recognition is a method to classify the noises, which are expected to be intermixed with input speech when recognized, into several groups of noises by setting average frequency variant as a threshold value; and to recognize the noises depending on the classified groups. This demonstrates the excellent performance considering noisy speech categorized as good using the standard threshold value. However, it also holds a problem to decrease the average speech recognition rate with regard to unclassified noisy speech, for it conducts the process of speech recognition, combined with noiseless model as in the existing PMC. To solve this problem, this paper suggests a enhanced method of recognition to prevent the unclassified through improving the extent of rating scales with use of maximum likelihood so that the noise groups, including input noisy speech, can be classified into more specific groups, which leads to improvement of the recognition rate. The findings from recognition experiments using Aurora 2.0 database showed the improved results compared with those from the method of the previous FV-PMC.

Oversampling-Based Ensemble Learning Methods for Imbalanced Data (불균형 데이터 처리를 위한 과표본화 기반 앙상블 학습 기법)

  • Kim, Kyung-Min;Jang, Ha-Young;Zhang, Byoung-Tak
    • KIISE Transactions on Computing Practices
    • /
    • v.20 no.10
    • /
    • pp.549-554
    • /
    • 2014
  • Handwritten character recognition data is usually imbalanced because it is collected from the natural language sentences written by different writers. The imbalanced data can cause seriously negative effect on the performance of most of machine learning algorithms. But this problem is typically ignored in handwritten character recognition, because it is considered that most of difficulties in handwritten character recognition is caused by the high variance in data set and similar shapes between characters. We propose the oversampling-based ensemble learning methods to solve imbalanced data problem in handwritten character recognition and to improve the recognition accuracy. Also we show that proposed method achieved improvements in recognition accuracy of minor classes as well as overall recognition accuracy empirically.

The Long Distance Face Recognition using Multiple Distance Face Images Acquired from a Zoom Camera (줌 카메라를 통해 획득된 거리별 얼굴 영상을 이용한 원거리 얼굴 인식 기술)

  • Moon, Hae-Min;Pan, Sung Bum
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.24 no.6
    • /
    • pp.1139-1145
    • /
    • 2014
  • User recognition technology, which identifies or verifies a certain individual is absolutely essential under robotic environments for intelligent services. The conventional face recognition algorithm using single distance face image as training images has a problem that face recognition rate decreases as distance increases. The face recognition algorithm using face images by actual distance as training images shows good performance but this has a problem that it requires user cooperation. This paper proposes the LDA-based long distance face recognition method which uses multiple distance face images from a zoom camera for training face images. The proposed face recognition technique generated better performance by average 7.8% than the technique using the existing single distance face image as training. Compared with the technique that used face images by distance as training, the performance fell average 8.0%. However, the proposed method has a strength that it spends less time and requires less cooperation to users when taking face images.

Method for Spectral Enhancement by Binary Mask for Speech Recognition Enhancement Under Noise Environment (잡음환경에서 음성인식 성능향상을 위한 바이너리 마스크를 이용한 스펙트럼 향상 방법)

  • Choi, Gab-Keun;Kim, Soon-Hyob
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.7
    • /
    • pp.468-474
    • /
    • 2010
  • The major factor that disturbs practical use of speech recognition is distortion by the ambient and channel noises. Generally, the ambient noise drops the performance and restricts places to use. DSR (Distributed Speech Recognition) based speech recognition also has this problem. Various noise cancelling algorithms are applied to solve this problem, but loss of spectrum and remaining noise by incorrect noise estimation at low SNR environments cause drop of recognition rate. This paper proposes methods for speech enhancement. This method uses MMSE-STSA for noise cancelling and ideal binary mask to compensate damaged spectrum. According to experiments at noisy environment (SNR 15 dB ~ 0 dB), the proposed methods showed better spectral results and recognition performance.