• Title/Summary/Keyword: recognition-rate

Search Result 2,809, Processing Time 0.027 seconds

An ASIC implementation of a Dual Channel Acoustic Beamforming for MEMS microphone in 0.18㎛ CMOS technology (0.18㎛ CMOS 공정을 이용한 MEMS 마이크로폰용 이중 채널 음성 빔포밍 ASIC 설계)

  • Jang, Young-Jong;Lee, Jea-Hack;Kim, Dong-Sun;Hwang, Tae-ho
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.13 no.5
    • /
    • pp.949-958
    • /
    • 2018
  • A voice recognition control system is a system for controlling a peripheral device by recognizing a voice. Recently, a voice recognition control system have been applied not only to smart devices but also to various environments ranging from IoT(: Internet of Things), robots, and vehicles. In such a voice recognition control system, the recognition rate is lowered due to the ambient noise in addition to the voice of the user. In this paper, we propose a dual channel acoustic beamforming hardware architecture for MEMS(: Microelectromechanical Systems) microphones to eliminate ambient noise in addition to user's voice. And the proposed hardware architecture is designed as ASIC(: Application-Specific Integrated Circuit) using TowerJazz $0.18{\mu}m$ CMOS(: Complementary Metal-Oxide Semiconductor) technology. The designed dual channel acoustic beamforming ASIC has a die size of $48mm^2$, and the directivity index of the user's voice were measured to be 4.233㏈.

A Method for Tennis Swing Recognition Using Accelerator Sensors on a Smartphone (스마트폰 가속도 센서를 이용한 테니스 스윙 인식 방법)

  • Kim, Sangchul;Che, Zhong Yong
    • Journal of Korea Game Society
    • /
    • v.13 no.2
    • /
    • pp.29-38
    • /
    • 2013
  • Recently there has been an increasing interest on tangible games in which human motions are recognized rather than the handling of keyboards and mouses. Such games require a motion controller for recognizing the motions of users. In this paper, we analyze the characteristics of values of accelerator sensors which are generated by a user who perform a tennis swing while holding a smartphone with his/her hand, and propose a method for motion recognition based on DWT(Discrete Wavelet Transform). The proposed method enables a smartphone to serve as a motion controller, so that a user can enjoy a tangible tennis game without eliminates a need for buying the device. We developed a tennis game prototype using the proposed method. To our experiment, our method showed a high recognition rate and the usefulness in the game.

3D Facial Landmark Tracking and Facial Expression Recognition

  • Medioni, Gerard;Choi, Jongmoo;Labeau, Matthieu;Leksut, Jatuporn Toy;Meng, Lingchao
    • Journal of information and communication convergence engineering
    • /
    • v.11 no.3
    • /
    • pp.207-215
    • /
    • 2013
  • In this paper, we address the challenging computer vision problem of obtaining a reliable facial expression analysis from a naturally interacting person. We propose a system that combines a 3D generic face model, 3D head tracking, and 2D tracker to track facial landmarks and recognize expressions. First, we extract facial landmarks from a neutral frontal face, and then we deform a 3D generic face to fit the input face. Next, we use our real-time 3D head tracking module to track a person's head in 3D and predict facial landmark positions in 2D using the projection from the updated 3D face model. Finally, we use tracked 2D landmarks to update the 3D landmarks. This integrated tracking loop enables efficient tracking of the non-rigid parts of a face in the presence of large 3D head motion. We conducted experiments for facial expression recognition using both framebased and sequence-based approaches. Our method provides a 75.9% recognition rate in 8 subjects with 7 key expressions. Our approach provides a considerable step forward toward new applications including human-computer interactions, behavioral science, robotics, and game applications.

3D Face Recognition using Cumulative Histogram of Surface Curvature (표면곡률의 누적히스토그램을 이용한 3차원 얼굴인식)

  • 이영학;배기억;이태흥
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.5
    • /
    • pp.605-616
    • /
    • 2004
  • A new practical implementation of a facial verification system using cumulative histogram of surface curvatures for the local and contour line areas is proposed, in this paper. The approach works by finding the nose tip that has a protrusion shape on the face. In feature recognition of 3D face images, one has to take into consideration the orientated frontal posture to normalize after extracting face area from the original image. The feature vectors are extracted by using the cumulative histogram which is calculated from the curvature of surface for the contour line areas: 20, 30 and 40, and nose, mouth and eyes regions, which has depth and surface characteristic information. The L1 measure for comparing two feature vectors were used, because it was simple and robust. In the experimental results, the maximum curvature achieved recognition rate of 96% among the proposed methods.

Variations of AlexNet and GoogLeNet to Improve Korean Character Recognition Performance

  • Lee, Sang-Geol;Sung, Yunsick;Kim, Yeon-Gyu;Cha, Eui-Young
    • Journal of Information Processing Systems
    • /
    • v.14 no.1
    • /
    • pp.205-217
    • /
    • 2018
  • Deep learning using convolutional neural networks (CNNs) is being studied in various fields of image recognition and these studies show excellent performance. In this paper, we compare the performance of CNN architectures, KCR-AlexNet and KCR-GoogLeNet. The experimental data used in this paper is obtained from PHD08, a large-scale Korean character database. It has 2,187 samples of each Korean character with 2,350 Korean character classes for a total of 5,139,450 data samples. In the training results, KCR-AlexNet showed an accuracy of over 98% for the top-1 test and KCR-GoogLeNet showed an accuracy of over 99% for the top-1 test after the final training iteration. We made an additional Korean character dataset with fonts that were not in PHD08 to compare the classification success rate with commercial optical character recognition (OCR) programs and ensure the objectivity of the experiment. While the commercial OCR programs showed 66.95% to 83.16% classification success rates, KCR-AlexNet and KCR-GoogLeNet showed average classification success rates of 90.12% and 89.14%, respectively, which are higher than the commercial OCR programs' rates. Considering the time factor, KCR-AlexNet was faster than KCR-GoogLeNet when they were trained using PHD08; otherwise, KCR-GoogLeNet had a faster classification speed.

A Study on Image Recognition based on the Characteristics of Retinal Cells (망막 세포 특성에 의한 영상인식에 관한 연구)

  • Cho, Jae-Hyun;Kim, Do-Hyeon;Kim, Kwang-Baek
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.11 no.11
    • /
    • pp.2143-2149
    • /
    • 2007
  • Visual Cortex Stimulator is among artificial retina prosthesis for blind man, is the method that stimulate the brain cell directly without processing the information from retina to visual cortex. In this paper, we propose image construction and recognition model that is similar to human visual processing by recognizing the feature data with orientation information, that is, the characteristics of visual cortex. Back propagation algorithm based on Delta-bar delta is used to recognize after extracting image feature by Kirsh edge detector. Various numerical patterns are used to analyze the performance of proposed method. In experiment, the proposed recognition model to extract image characteristics with the orientation of information from retinal cells to visual cortex makes a little difference in a recognition rate but shows that it is not sensitive in a variety of learning rates similar to human vision system.

Development of a Detection and Recognition System for Rectangular Marker (사각형 마커 검출 및 인식 시스템 개발)

  • Kang Sun-Kyung;Lee Sang-Seol;Jung Sung-Tae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.11 no.4 s.42
    • /
    • pp.97-107
    • /
    • 2006
  • In this paper, we present a method for the detection and recognition of rectangular markers from a camera image. It converts the camera image to a binary image and extracts contours of objects in the binary image. After that. it approximates the contours to a list of line segments. It finds rectangular markers by using geometrical features which are extracted from the approximated line segments. It normalizes the shape of extracted markers into exact squares by using the warping technique. It extracts feature vectors from marker image by using principal component analysis. It then calculates the distance between feature vector of input marker image and those of standard markers. Finally, it recognizes the marker by using minimum distance method. Experimental results show that the Proposed method achieves 98% recognition rate at maximum for 50 markers and execution speed of 11.1 frames/sec for images which contains eleven markers.

  • PDF

A Survey on Korean Medicine Doctors' Recognition and Treatment for Developing Korean Medicine Clinical Practice Guideline of Coldness of Hands and Feet (한의표준임상진료지침 개발을 위한 수족냉증에 대한 한의사의 인식과 치료현황)

  • Lee, Dong-Nyung;Kim, Hyung-Jun;Yu, Jun-Sang
    • The Journal of Korean Obstetrics and Gynecology
    • /
    • v.30 no.3
    • /
    • pp.92-116
    • /
    • 2017
  • Objectives: The purpose of this study were to researched a Korean medicine doctors' recognition about coldness of hands and feet, and developing of korean medicine clinical practice guidelines (CPG) for coldness of hands and feet. Methods: We conducted a questionnaire survey targeting 399 Korean medicine doctors belonging to the Association of Korean Medicine by e-mail and analyzed the answers. Results: 1. 86.86% of the respondents agreed about the necessity of CPG for coldness of hands and feet. 2. 84.2% of respondents wanted coding of Korean Standard Classification of Diseases (KCD) on coldness of hands and feet. 3. To diagnosis a coldness of hands and feet, the respondents used a Subjective symptoms (98.5%), Infrared thermographic imaging device (DITI) (26.32%) Heart rate variablity test (HRV) (17.04%), Thermometer (9.77%), Cold stress test (2.76%) 4. Causing of coldness of hands and feet, the respondents considered a constitution or heredity (84.71%), stress (73.66%), lack of exercise (64.91%), irregular eating habits (51.63%), Cold meals (32.83%), depression (31.33%), etc. 5. Treating coldness of hands and feet, the respondents used a herbal medicine (66.85%), acupuncture (70.7%) Pharmacopuncture (23.85%) and moxibustion (60.08%) for $10.91{\pm}8.03week$. Conclusions: We researched a Korean Medicine doctors' recognition of CPG, clinical diagnosis, treatment on a coldness of hands and feet, and policy they required.

A Korean speech recognition based on conformer (콘포머 기반 한국어 음성인식)

  • Koo, Myoung-Wan
    • The Journal of the Acoustical Society of Korea
    • /
    • v.40 no.5
    • /
    • pp.488-495
    • /
    • 2021
  • We propose a speech recognition system based on conformer. Conformer is known to be convolution-augmented transformer, which combines transfer model for capturing global information with Convolution Neural Network (CNN) for exploiting local feature effectively. The baseline system is developed to be a transfer-based speech recognition using Long Short-Term Memory (LSTM)-based language model. The proposed system is a system which uses conformer instead of transformer with transformer-based language model. When Electronics and Telecommunications Research Institute (ETRI) speech corpus in AI-Hub is used for our evaluation, the proposed system yields 5.7 % of Character Error Rate (CER) while the baseline system results in 11.8 % of CER. Even though speech corpus is extended into other domain of AI-hub such as NHNdiguest speech corpus, the proposed system makes a robust performance for two domains. Throughout those experiments, we can prove a validation of the proposed system.

Regional Boundary Operation for Character Recognition Using Skeleton (골격을 이용한 문자 인식을 위한 지역경계 연산)

  • Yoo, Suk Won
    • The Journal of the Convergence on Culture Technology
    • /
    • v.4 no.4
    • /
    • pp.361-366
    • /
    • 2018
  • For each character constituting learning data, different fonts are added in pixel unit to create MASK, and then pixel values belonging to the MASK are divided into three groups. The experimental data are modified into skeletal forms, and then regional boundary operation is used to create a boundary that distinguishes the background region adjacent to the skeleton of the character from the background of the modified experimental data. Discordance values between the modified experimental data and the MASKs are calculated, and then the MASK with the minimum value is found. This MASK is selected as a finally recognized result for the given experiment data. The recognition algorithm using skeleton of the character and the regional boundary operation can easily extend the learning data set by adding new fonts to the given learning data, and also it is simple to implement, and high character recognition rate can be obtained.