• Title/Summary/Keyword: Characteristic of Recognition

Search Result 508, Processing Time 0.025 seconds

The Recognition of Unvoiced Consonants Using Characteristic Parameters of the Phonemes (음소 특정 파라미터를 이용한 무성자음 인식)

  • 허만택;이종혁;남기곤;윤태훈;김재창;이양성
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.31B no.4
    • /
    • pp.175-182
    • /
    • 1994
  • In this study, we present unvoiced consonant recognition system using characteristic parameters of the phoneme of the each syllable. For the recognition, the characteristic parameters on the time domain such as ZCR, total energy of the consonant region and half region energy of the consonant region, and those on the frequency domain such as the frequency spectrum of the transition region are used. The objective unvoiced consonants in this study are /ㄱ/,/ㄷ/,/ㅂ/,/ㅈ/,/ㅋ/,/ㅌ/,/ㅍ/ and /ㅊ/. Each characteristic parameter of two regions extracted from these segmented unvoiced consonants are used for each recognition system of the region, independently, And complementing two outputs of each other system, the final output is to be produced. The recognition system is implemented using MLP which has learning ability. The recognition simulation results for 112 unvoiced consonant samples are that average recognition rates are 96.4$\%$ under 80$\%$ learning rates and 93.7$\%$ under 60$\%$ learning rates.

  • PDF

A Study on Speaker Recognition Using MFCC Parameter Space (파마메터 공간을 이용한 화자인식에 관한 연구)

  • Lee Yong-woo;Lim dong-Chol;Lee Haing Sea
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.57-60
    • /
    • 2001
  • This paper reports on speaker-Recognition of context independence-speaker recognition in the field of the speech recognition. It is important to select the parameter reflecting the characteristic of each single person because speaker-recognition is to identify who speaks in the database. We used Mel Frequency Cesptrum Coefficient and Vector Quantization to identify in this paper. Specially, it considered to find characteristic-vector of the speaker in different from known method; this paper used the characteristic-vector which is selected in MFCC Parameter Space. Also, this paper compared the recognition rate according to size of codebook from this database and the time needed for operation with the existing one. The results is more improved $3\sim4\%$ for recognition rate than established Vector Quantization Algorithm.

  • PDF

An Analysis on the characteristic of recognition about Individual Housing according to the landscape in Donghae Seaside (동해연안 주택외관의 인지특성에 관한 연구)

  • Cho, Won-Seok;Kim, Heung-Ki;Kim, Yong-Ki;Joo, Jae-Woo;Kim, Jung-Hyun
    • Journal of the Korean Institute of Rural Architecture
    • /
    • v.7 no.3
    • /
    • pp.27-35
    • /
    • 2005
  • This study is about finding out characteristic of recognition individual housing in seaside of Donghae. To accomplish this purpose, we survey the 150 houses related to the landscape. Thus the major analysis is to take basic data, such as image(modern, western, traditional, etc) about exterior form of housing corresponding to the landscape. The result summarized as follows First, the elements for the characteristic of recognition were exterior material finish, exterior color, roof type, roof material finish, window size, roof slope, area of wall vs roof. Second, the image of traditional housing was very insufficient to plan landscape of housing with design elements. This research suggests that landscape housing of future is to be environmental landscape design and the proper design is to be various considering not only user's preference but also control of landscape.

  • PDF

Lidar Based Object Recognition and Classification (자율주행을 위한 라이다 기반 객체 인식 및 분류)

  • Byeon, Yerim;Park, Manbok
    • Journal of Auto-vehicle Safety Association
    • /
    • v.12 no.4
    • /
    • pp.23-30
    • /
    • 2020
  • Recently, self-driving research has been actively studied in various institutions. Accurate recognition is important because information about surrounding objects is needed for safe autonomous driving. This study mainly deals with the signal processing of LiDAR among sensors for object recognition. LiDAR is a sensor that is widely used for high recognition accuracy. First, we clustered and tracked objects by predicting relative position and speed of objects. The characteristic points of all objects were extracted using point cloud data of each objects through proposed algorithm. The Classification between vehicle and pedestrians is estimated using number of characteristic points and distances among characteristic points. The algorithm for classifying cars and pedestrians was implemented and verified using test vehicle equipped with LiDAR sensors. The accuracy of proposed object classification algorithm was about 97%. The classification accuracy was improved by about 13.5% compared with deep learning based algorithm.

Comparison of Characteristic Vector of Speech for Gender Recognition of Male and Female (남녀 성별인식을 위한 음성 특징벡터의 비교)

  • Jeong, Byeong-Goo;Choi, Jae-Seung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.7
    • /
    • pp.1370-1376
    • /
    • 2012
  • This paper proposes a gender recognition algorithm which classifies a male or female speaker. In this paper, characteristic vectors for the male and female speaker are analyzed, and recognition experiments for the proposed gender recognition by a neural network are performed using these characteristic vectors for the male and female. Input characteristic vectors of the proposed neural network are 10 LPC (Linear Predictive Coding) cepstrum coefficients, 12 LPC cepstrum coefficients, 12 FFT (Fast Fourier Transform) cepstrum coefficients and 1 RMS (Root Mean Square), and 12 LPC cepstrum coefficients and 8 FFT spectrum. The proposed neural network trained by 20-20-2 network are especially used in this experiment, using 12 LPC cepstrum coefficients and 8 FFT spectrum. From the experiment results, the average recognition rates obtained by the gender recognition algorithm is 99.8% for the male speaker and 96.5% for the female speaker.

The Speaker Recognition System using the Pitch Alteration (피치변경을 이용한 화자인식 시스템)

  • Jung JongSoon;Bae MyungJin
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.115-118
    • /
    • 2002
  • Parameters used in a speaker recognition system are desirable expressing speaker's characteristics filly and have in a speech. That is to say, if inter-speaker than intra-speaker variance a big characteristic, it is useful to distinguish between speakers. Also, to make minimum error between speakers, it is required the improved recognition technology as well as the distinguishing characteristics. When we see the result of recent simulation performance, we obtain more exact performance by using dynamic characteristics and constant characteristics by a speaking habit. Therefore we suggest it to solve this problem as followings. The prosodic information is used by a characteristic vector of speech. Characteristics vector generally using in speaker recognition system is a modeling spectrum information and is working for a high performance in non-noise circumstance. However, it is found a problem that characteristic vector is distorted in noise circumstance and it makes a reduction of recognition rate. In this paper, we change pitch line divided by segment which can estimate a dynamic characteristic and it is used as a recognition characteristic. we confirmed that the dynamic characteristic is very robust in noise circumstance with a simulation. We make a decision of acceptance or rejection by comparing test pattern and recognition rate using the proposed algorithm has more improvement than using spectrum and prosodic information. Especially stational recognition rate can be obtained in noise circumstance through the simulation.

  • PDF

The Study of Mobile Robot Self-displacement Recognition Using Stereo Vision (스테레오 비젼을 이용한 이동로봇의 자기-이동변위인식 시스템에 관한 연구)

  • 심성준;고덕현;김규로;이순걸
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2003.06a
    • /
    • pp.934-937
    • /
    • 2003
  • In this paper, authors use a stereo vision system based on the visual model of human and establish inexpensive method that recognizes moving distance using characteristic points around the robot. With the stereovision. the changes of the coordinate values of the characteristic points that are fixed around the robot are measured. Self-displacement and self-localization recognition system is proposed from coordination reconstruction with those changes. To evaluate the proposed system, several characteristic points that is made with a LED around the robot and two cheap USB PC cameras are used. The mobile robot measures the coordinate value of each characteristic point at its initial position. After moving, the robot measures the coordinate values of the characteristic points those are set at the initial position. The mobile robot compares the changes of these several coordinate values and converts transformation matrix from these coordinate changes. As a matrix of the amount and the direction of moving displacement of the mobile robot, the obtained transformation matrix represents self-displacement and self-localization by the environment.

  • PDF

The Neural-Network Approach to Recognize Defect Pattern in LED Manufacturing

  • Chen, Wen-Chin;Tsai, Chih-Hung;Hsu, Shou-Wen
    • International Journal of Quality Innovation
    • /
    • v.7 no.3
    • /
    • pp.58-69
    • /
    • 2006
  • This paper presents neural network-based recognition system for automatic light emitting diode (LED) inspection. The back-propagation neural network (BPNN) is proposed and tested. The current-voltage (I-V) characteristic data of LED from the inspection process is used for the network training and testing. This study selects 300 random samples as network training and employs 100 samples as network testing. The experimental results show that if the classification work is done well, the accuracy of recognition is 100%, and the testing speed of the proposed recognition system is almost one half faster than the traditional inspection system does. The proposed neural-network approach is successfully demonstrated by real data sets and can be effectively developed as a recognition system for a practical application purpose.

Emotion Recognition using Prosodic Feature Vector and Gaussian Mixture Model (운율 특성 벡터와 가우시안 혼합 모델을 이용한 감정인식)

  • Kwak, Hyun-Suk;Kim, Soo-Hyun;Kwak, Yoon-Keun
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2002.11b
    • /
    • pp.762-766
    • /
    • 2002
  • This paper describes the emotion recognition algorithm using HMM(Hidden Markov Model) method. The relation between the mechanic system and the human has just been unilateral so far. This is the why people don't want to get familiar with multi-service robots of today. If the function of the emotion recognition is granted to the robot system, the concept of the mechanic part will be changed a lot. Pitch and Energy extracted from the human speech are good and important factors to classify the each emotion (neutral, happy, sad and angry etc.), which are called prosodic features. HMM is the powerful and effective theory among several methods to construct the statistical model with characteristic vector which is made up with the mixture of prosodic features

  • PDF

Monophone and Biphone Compuond Unit for Korean Vocabulary Speech Recognition (한국어 어휘 인식을 위한 혼합형 음성 인식 단위)

  • 이기정;이상운;홍재근
    • Journal of the Korea Computer Industry Society
    • /
    • v.2 no.6
    • /
    • pp.867-874
    • /
    • 2001
  • In this paper, considering the pronunciation characteristic of Korean, recognition units which can shorten the recognition time and reflect the coarticulation effect simultaneously are suggested. These units are composed of monophone and hipbone ones. Monophone units are applied to the vowels which represent stable characteristic. Biphones are used to the consonant which vary according to adjacent vowel. In the experiment of word recognition of PBW445 database, the compound units result in comparable recognition accuracy with 57% speed up compared with triphone units and better recognition accuracy with similar speed. In addition, we can reduce the memory size because of fewer units.

  • PDF