• Title/Summary/Keyword: recognition-rate

Search Result 2,809, Processing Time 0.026 seconds

HMM with Global Path constraint in Viterbi Decoding for Insolated Word Recognition (전체 경로 제한 조건을 갖는 HMM을 이용한 단독음 인식)

  • Kim, Weon-Goo;Ahn, Dong-Soon;Youn, Dae-Hee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.1E
    • /
    • pp.11-19
    • /
    • 1994
  • Hidden Markov Models (HMM's) with explicit state duration density (HMM/SD) can represent the time-varying characteristics of speech signals more accurately. However, such an advantage is reduced in relatively smooth state duration densities or ling bounded duration. To solve this problem, we propose HMM's with global path constraint (HMM/GPC) where the transition between states occur only within prescribed time slots. HMM/GPC explicitly limits state durations and accurately describes the temproal structure of speech simply and efficiently. HMM's formed by combining HMM/GPC with HMM/SD are also presented (HMM/SD+GPC) and performances are compared. HMM/GPC can be implemented with slight modifications to the conventional Viterbi algorithm. HMM/GPC and HMM/SD_GPC not only show superior performance than the conventional HMM and HMM/SD but also require much less computation. In the speaket independent isolated word recognition experiments, the minimum recognition eror rate of HMM/GPC(1.6%) is 1.1% lower than the conventional HMM's and the required computation decreased about 57%.

  • PDF

Presentation control of a computer using hand motion identification rules (손동작 식별 규칙을 이용한 컴퓨터의 프레젠테이션 제어)

  • Lee, Kyu-Won
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.22 no.9
    • /
    • pp.1172-1178
    • /
    • 2018
  • A system that control computer presentations by using the hand motion recognition and identification is proposed. The system recognizes and identifies various types of motion in hand motion, controlls the presentation without additional control devices. To recognize hand movements, it performs a face and hand region detection. Facial area is detected using Haar classifier and hand region is extracted according to skin color information on HSV color model. The face area is used to determine the beginning and end of hand gestures, the size and direction of motion. It recognizes various hand gestures and uses them to control computer presentations according to the hand motion identification rules that are proposed and set horizontal and vertical axes from the face area. It is confirmed that 97.2% recognition rate is obtained in about 1200 hand motion recognition experiments and the proposed algorithm is valid in presentation control.

Gesture Recognition Using Stereo Tracking Initiator and HMM for Tele-Operation (스테레오 영상 추적 자동초기화와 HMM을 이용한 원격 작업용 제스처 인식)

  • Jeong, Ji-Won;Lee, Yong-Beom;Jin, Seong-Il
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.8
    • /
    • pp.2262-2270
    • /
    • 1999
  • In this paper, we describe gesture recognition algorithm using computer vision sensor and HMM. The automatic hand region extraction has been proposed for initializing the tracking of the tele-operation gestures. For this, distance informations(disparity map) as results of stereo matching of initial left and right images are employed to isolate the hand region from a scene. PDOE(positive difference of edges) feature images adapted here have been found to be robust against noise and background brightness. The KNU/KAERI(K/K) gesture instruction set is defined for tele-operation in atomic electric power stations. The composite recognition model constructed by concatenating three gesture instruction models including pre-orders, basic orders, and post-orders has been proposed and identified by discrete HMM. Our experimental results showed that consecutive orders composed of more than two ones are correctly recognized at the rate of above 97%.

  • PDF

A Word Dictionary Structure for the Postprocessing of Hangul Recognition (한글인식 후처리용 단어사전의 기억구조)

  • ;Yoshinao Aoki
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.9
    • /
    • pp.1702-1709
    • /
    • 1994
  • In the postprocessing of Hangul recognition system, the storage structure of contextual information is an important matter for the recognition rate and speed of the entire system. Trie in general is used to represent the context as word dictionary, but the memory space efficiency of the structure is low. Therefore we propose a new structure for word dictionary that has better space efficiency and the equivalent merits of trie. Because Hangul is a compound language, the language can be represented by phonemes or by characters. In the representation by phonemes(P-mode) the retrieval is fast, but the space efficiency is low. In the representation by characters(C-mode) the space efficiency is high, but the retrieval is slow. In this paper the two representation methods are combined to form a hybrid representation(H-mode). At first an optimal level for the combination is selected by two characteristic curves of node utilization and dispersion. Then the input words are represented with trie structure by P-mode from the first to the optimal level, and the rest are represented with sequentially linked list structure by C-mode. The experimental results for the six kinds of word set show that the proposed structure is more efficient. This result is based on the fact that the retrieval for H-mode is as fast as P-mode and the space efficiency is as good as C-mode.

  • PDF

Survey of Recognition of Trauma and Trauma Care System (외상 및 외상진료체계의 인식도 조사)

  • Chung, Il Yong;Kim, Joongsuck;Kim, Yeongcheol;Kim, Seongyup
    • Journal of Trauma and Injury
    • /
    • v.27 no.4
    • /
    • pp.165-169
    • /
    • 2014
  • Purpose: Trauma is one of the most common and lethal causes of death in Korea, especially in people under the age of 40. However, a considerable percentage of trauma patients are lost each year due to the scarce resources of the trauma system. The purpose of this study was to determine the recognition of trauma and trauma system. Methods: From April 8th to 22nd, 2014, visitors and in-patients in our medical center were interviewed and surveyed with a questionnaire, which included 28 questions regarding the trauma system, such as the most common cause of death, the locations of trauma centers, the importance of trauma centers, and consent for supporting trauma centers financially. Results: The majority of the respondents recognized trauma as a common cause of death; this was particularly true for people younger than 40. Most respondents' expectancy for the optimal time for trauma patient transport was high, recognizing that major trauma patients should receive urgent care. The respondents felt that trauma centers are important and needed, just as much as police stations and libraries are. Among 178 respondents, 140 (80.5%) were willing to financially support the trauma system. Conclusion: The respondents were aware of the seriousness of trauma and generally agreed on the need for trauma centers. In order to meet the needs and the demands of the people, and to reduce preventable death rate, the trauma system should be improved not only in quality but also in quantity with better and more facilities and manpower, with the aid of publicity from trauma organizations and funding from the government.

A Study on Space Recognition Change of the High School Students according to Geographic Information Quantities - Focused on Factors Influencing the Land Value - (지리 정보량에 따른 고등학생의 공간 인식 변화에 대한 연구 - 지가 형성 요인을 중심으로 -)

  • Shin, Yeong-Jae
    • Journal of the Korean association of regional geographers
    • /
    • v.17 no.4
    • /
    • pp.443-458
    • /
    • 2011
  • The purpose of this study is to research space recognition change of the high school students according to geographic information quantities, focused on Factors Influencing the Land Value. The region of case study was some places of Songtan Special Tourism Zone, which responding students were unfamiliar with. The results are as follows. First, through the results of analysing 'the highest valued standard land and choice reasons' in two ㄴregions of the old town and the new town, it is perceived that the relative factor of land is more important than the absolute factor as the factors influencing the land value of the highest valued standard land. Second, there are students' recognition differences in the choice reasons of the highest valued standard land of two regions which have different characteristics. Third, though in the same region, recognitions about factors influencing the land value change according to geographic information quantities, and as students' knowledge about geographic information increases, the choice rate of the highest valued standard land increases. Lastly, it is perceived that there is a facility which has a decisive effect on formating the land value of a certain region.

  • PDF

Road Lane and Vehicle Distance Recognition using Real-time Analysis of Camera Images (카메라 영상의 실시간 분석에 의한 차선 및 차간 인식)

  • Kang, Moon-Seol;Kim, Yu-Sin
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.12
    • /
    • pp.2665-2674
    • /
    • 2012
  • This paper propose the method to recognize the lanes and distance between cars in real-time which detects dangerous situations and helps safe driving in the actual road environment. First of all, it extracts the area of interest corresponding to roads and cars from the road image photographed by using the forward-looking camera. Through the hough transform for the area of interest, this study detects linear components and also selects the lane and conducts filtering by calculating probability. And through the shadow threshold analysis of the cars in front within the area of interest, it extracts the objects of cars in front and calculates the distance from cars in front. According to the result of applying the suggested technology to recognize the lane and distance between cars to the road situation for testing, it showed over 95% recognition rate; thus, it has been proved that it can respond to safe driving.

A Novel Least Square and Image Rotation based Method for Solving the Inclination Problem of License Plate in Its Camera Captured Image

  • Wu, ChangCheng;Zhang, Hao;Hua, JiaFeng;Hua, Sha;Zhang, YanYi;Lu, XiaoMing;Tang, YiChen
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.12
    • /
    • pp.5990-6008
    • /
    • 2019
  • Recognizing license plate from its traffic camera captured images is one of the most important aspects in many traffic management systems. Despite many sophisticated license plate recognition related algorithms available online, license plate recognition is still a hot research issue because license plates in each country all round the world lack of uniform format and their camera captured images are often affected by multiple adverse factors, such as low resolution, poor illumination effects, installation problem etc. A novel method is proposed in this paper to solve the inclination problem of license plates in their camera captured images through four parts: Firstly, special edge pixels of license plate are chosen to represent main information of license plates. Secondly, least square methods are used to compute the inclined angle of license plates. Then, coordinate rotation methods are used to rotate the license plate. At last, bilinear interpolation methods are used to improve the performance of license plate rotation. Several experimental results demonstrated that our proposed method can solve the inclination problem about license plate in visual aspect and can improve the recognition rate when used as the image preprocessing method.

Study on Vehicle Haptic-Seat for the Driving Information Transfer to Driver for the Elderly (고령운전자 운전정보전달을 위한 차량용 햅틱시트 연구)

  • Oh, S.Y.;Kim, K.T.;Yu, C.H.;Kwon, T.K.
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.8 no.3
    • /
    • pp.151-160
    • /
    • 2014
  • In this study, the effect of the automotive haptic-seat technology which can transmit the driving information by the vibro-stimulus from the seat was investigated to overcome previous system's limitation relied on the visual and audial method and to help handicap driving. A prototype haptic seat cover with 30 coin-type motors and driver module were developed for this sake. In an experiment of seat vibration stimulation being performed under virtual driving situation by targeting the elderly aged over 65 years old, average score of test subjects for total vibration recognition was 3.5/4 points and recognition rate of 87.5% was represented. In addition, a result that all the test subjects totally recognized overspeed warning signal of 4 times was represented. As a result of statistical analysis for vibration recognition score by each group depending on TMT score, a significant difference was not found and a result that tactile function of which vibration is recognized even by the aged whose visual, perceptional function is declined showed an equal ability was obtained.. In this study it was shown that the seat vibration stimulus could be used to transfer the old drivers' information while driving.

  • PDF

Utilization of Syllabic Nuclei Location in Korean Speech Segmentation into Phonemic Units (음절핵의 위치정보를 이용한 우리말의 음소경계 추출)

  • 신옥근
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.5
    • /
    • pp.13-19
    • /
    • 2000
  • The blind segmentation method, which segments input speech data into recognition unit without any prior knowledge, plays an important role in continuous speech recognition system and corpus generation. As no prior knowledge is required, this method is rather simple to implement, but in general, it suffers from bad performance when compared to the knowledge-based segmentation method. In this paper, we introduce a method to improve the performance of a blind segmentation of Korean continuous speech by postprocessing the segment boundaries obtained from the blind segmentation. In the preprocessing stage, the candidate boundaries are extracted by a clustering technique based on the GLR(generalized likelihood ratio) distance measure. In the postprocessing stage, the final phoneme boundaries are selected from the candidates by utilizing a simple a priori knowledge on the syllabic structure of Korean, i.e., the maximum number of phonemes between any consecutive nuclei is limited. The experimental result was rather promising : the proposed method yields 25% reduction of insertion error rate compared that of the blind segmentation alone.

  • PDF