통합 검색 | Korea Science

Tobacco Retail License Recognition Based on Dual Attention Mechanism

Shan, Yuxiang;Ren, Qin;Wang, Cheng;Wang, Xiuhui
- Journal of Information Processing Systems
- /
- 제18권4호
- /
- pp.480-488
- /
- 2022
Images of tobacco retail licenses have complex unstructured characteristics, which is an urgent technical problem in the robot process automation of tobacco marketing. In this paper, a novel recognition approach using a double attention mechanism is presented to realize the automatic recognition and information extraction from such images. First, we utilized a DenseNet network to extract the license information from the input tobacco retail license data. Second, bi-directional long short-term memory was used for coding and decoding using a continuous decoder integrating dual attention to realize the recognition and information extraction of tobacco retail license images without segmentation. Finally, several performance experiments were conducted using a largescale dataset of tobacco retail licenses. The experimental results show that the proposed approach achieves a correction accuracy of 98.36% on the ZY-LQ dataset, outperforming most existing methods.
https://doi.org/10.3745/JIPS.02.0177 인용 PDF KSCI

영상객체 spFACS ASM 알고리즘을 적용한 얼굴인식에 관한 연구 (ASM Algorithm Applid to Image Object spFACS Study on Face Recognition)

최병관
- 디지털산업정보학회논문지
- /
- 제12권4호
- /
- pp.1-12
- /
- 2016
Digital imaging technology has developed into a state-of-the-art IT convergence, composite industry beyond the limits of the multimedia industry, especially in the field of smart object recognition, face - Application developed various techniques have been actively studied in conjunction with the phone. Recently, face recognition technology through the object recognition technology and evolved into intelligent video detection recognition technology, image recognition technology object detection recognition process applies to skills through is applied to the IP camera, the image object recognition technology with face recognition and active research have. In this paper, we first propose the necessary technical elements of the human factor technology trends and look at the human object recognition based spFACS (Smile Progress Facial Action Coding System) for detecting smiles study plan of the image recognition technology recognizes objects. Study scheme 1). ASM algorithm. By suggesting ways to effectively evaluate psychological research skills through the image object 2). By applying the result via the face recognition object to the tooth area it is detected in accordance with the recognized facial expression recognition of a person demonstrated the effect of extracting the feature points.
https://doi.org/10.17662/ksdim.2016.12.4.001 인용 PDF KSCI

Review And Challenges In Speech Recognition (ICCAS 2005)

Ahmed, M.Masroor;Ahmed, Abdul Manan Bin
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 2005년도 ICCAS
- /
- pp.1705-1709
- /
- 2005
This paper covers review and challenges in the area of speech recognition by taking into account different classes of recognition mode. The recognition mode can be either speaker independent or speaker dependant. Size of the vocabulary and the input mode are two crucial factors for a speech recognizer. The input mode refers to continuous or isolated speech recognition system and the vocabulary size can be small less than hundred words or large less than few thousands words. This varies according to system design and objectives.[2]. The organization of the paper is: first it covers various fundamental methods of speech recognition, then it takes into account various deficiencies in the existing systems and finally it discloses the various probable application areas.
PDF

Vehicle Image Recognition Using Deep Convolution Neural Network and Compressed Dictionary Learning

Zhou, Yanyan
- Journal of Information Processing Systems
- /
- 제17권2호
- /
- pp.411-425
- /
- 2021
In this paper, a vehicle recognition algorithm based on deep convolutional neural network and compression dictionary is proposed. Firstly, the network structure of fine vehicle recognition based on convolutional neural network is introduced. Then, a vehicle recognition system based on multi-scale pyramid convolutional neural network is constructed. The contribution of different networks to the recognition results is adjusted by the adaptive fusion method that adjusts the network according to the recognition accuracy of a single network. The proportion of output in the network output of the entire multiscale network. Then, the compressed dictionary learning and the data dimension reduction are carried out using the effective block structure method combined with very sparse random projection matrix, which solves the computational complexity caused by high-dimensional features and shortens the dictionary learning time. Finally, the sparse representation classification method is used to realize vehicle type recognition. The experimental results show that the detection effect of the proposed algorithm is stable in sunny, cloudy and rainy weather, and it has strong adaptability to typical application scenarios such as occlusion and blurring, with an average recognition rate of more than 95%.
https://doi.org/10.3745/JIPS.01.0073 인용 PDF KSCI

필기한글 단어 인식에서 사전정보의 효과 (An effect of dictionary information in the handwritten Hangul word recognition)

김호연;임길택;남윤석
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 1999년도 추계종합학술대회 논문집
- /
- pp.1019-1022
- /
- 1999
In this paper, we analysis the effect of a dictionary in a handwritten Hangul word recognition problem in terms of its size and the length of the words in it. With our experimental results, we can account for the word recognition rate depending not only on character recognition performance, but also much on the amount of the information that the dictionary contains, as well as the reduction rate of a dictionary.
PDF

모바일 보안을 위한 모바일 폰 영상의 손 생체 정보 인식 시스템 (Hand Biometric Information Recognition System of Mobile Phone Image for Mobile Security)

홍경호;정은화
- 디지털융복합연구
- /
- 제12권4호
- /
- pp.319-326
- /
- 2014
모바일 보안의 증가에 따라, 지식에 근거한 사용자 이름, 패스워드 방식의 개인 인증에 대한 실패를 경험한 사용자들은 개인 식별과 인증에서 손 형상, 지문 인식, 목소리와 같은 생체 정보를 사용하는 것을 더욱 선호하게 되었다. 그러므로 모바일 보안을 위해 개인 식별과 인증에서 생체 인증을 사용하는 것은 인터넷 상에서 고객과 판매자들 모두에게 신뢰성을 준다. 본 연구는 개인 식별과 인증을 위해 iphone4와 galaxy s2의 모바일 폰 영상으로부터 손형상, 손 바닥 특징, 손가락 길이와 너비 등의 손 생체 정보를 인식하는 시스템을 개발한다. 본 연구의 손 생체 정보인식 시스템은 영상 획득, 전처리, 잡음 제거, 표준 특징패턴 추출, 개별 특징패턴 추출 그리고 손 생체 정보 인식의 6가지 단계로 구성한다. 실험에서 사용한 입력 데이터는 50명의 실험자의 손 형상 영상과 손 바닥 영상으로 구성한 250장의 데이터에 대한 평균 인식률은 93.5%이다.
https://doi.org/10.14400/JDC.2014.12.4.319 인용 PDF KSCI

A Robust Method for Partially Occluded Face Recognition

Xu, Wenkai;Lee, Suk-Hwan;Lee, Eung-Joo
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제9권7호
- /
- pp.2667-2682
- /
- 2015
Due to the wide application of face recognition (FR) in information security, surveillance, access control and others, it has received significantly increased attention from both the academic and industrial communities during the past several decades. However, partial face occlusion is one of the most challenging problems in face recognition issue. In this paper, a novel method based on linear regression-based classification (LRC) algorithm is proposed to address this problem. After all images are downsampled and divided into several blocks, we exploit the evaluator of each block to determine the clear blocks of the test face image by using linear regression technique. Then, the remained uncontaminated blocks are utilized to partial occluded face recognition issue. Furthermore, an improved Distance-based Evidence Fusion approach is proposed to decide in favor of the class with average value of corresponding minimum distance. Since this occlusion removing process uses a simple linear regression approach, the completely computational cost approximately equals to LRC and much lower than sparse representation-based classification (SRC) and extended-SRC (eSRC). Based on the experimental results on both AR face database and extended Yale B face database, it demonstrates the effectiveness of the proposed method on issue of partial occluded face recognition and the performance is satisfactory. Through the comparison with the conventional methods (eigenface+NN, fisherfaces+NN) and the state-of-the-art methods (LRC, SRC and eSRC), the proposed method shows better performance and robustness.
https://doi.org/10.3837/tiis.2015.07.019 인용 PDF KSCI KPUBS HTML

내부 객체 정보를 이용한 온톨로지 기반의 객체 영상 인식 (Ontology-based Object-Image Recognition by Using Information on Inner-Objects)

이인근;서석태;석지권;권순학
- 한국지능시스템학회논문지
- /
- 제19권6호
- /
- pp.760-765
- /
- 2009
객체 영상에서 색, 모양과 같은 특징은 객체의 특성을 명확하게 표현하지 못한다. 따라서 제한된 특징 정보는 객체 영상인식의 애매성을 야기한다. 최근에는 객체 인식에서의 애매성을 줄이기 위해 지식베이스에 기반한 영상의 인식에 관한 연구가 진행되고 있다. 그러나 영상은 수치적 정보로 표현되고 지식베이스는 개념적 정보로 표현되어 영상과 지식 베이스의 결합이 쉽지 않다. 본 논문에서는 영상과 지식베이스의 정보 표현의 차이를 줄이기 위해 온톨로지를 이용하여 지식베이스를 구성한다. 그리고 내부 객체 정보를 이용하여 객체 영상 인식 과정에서의 애매성을 줄이는 객체 영상 인식 방법을 제안한다. 또한, 과일 영역에서의 객체 영상 인식 실험을 통해 제안한 방법의 효용성을 확인한다.
https://doi.org/10.5391/JKIIS.2009.19.6.760 인용 PDF KSCI

Ship Number Recognition Method Based on An improved CRNN Model

Wenqi Xu;Yuesheng Liu;Ziyang Zhong;Yang Chen;Jinfeng Xia;Yunjie Chen
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제17권3호
- /
- pp.740-753
- /
- 2023
Text recognition in natural scene images is a challenging problem in computer vision. The accurate identification of ship number characters can effectively improve the level of ship traffic management. However, due to the blurring caused by motion and text occlusion, the accuracy of ship number recognition is difficult to meet the actual requirements. To solve these problems, this paper proposes a dual-branch network based on the CRNN identification network. The network couples image restoration and character recognition. The CycleGAN module is used for blur restoration branch, and the Pix2pix module is used for character occlusion branch. The two are coupled to reduce the impact of image blur and occlusion. Input the recovered image into the text recognition branch to improve the recognition accuracy. After a lot of experiments, the model is robust and easy to train. Experiments on CTW datasets and real ship maps illustrate that our method can get more accurate results.
https://doi.org/10.3837/tiis.2023.03.004 인용 PDF HTML

시 공간 정규화를 통한 딥 러닝 기반의 3D 제스처 인식 (Deep Learning Based 3D Gesture Recognition Using Spatio-Temporal Normalization)

채지훈;강수명;김해성;이준재
- 한국멀티미디어학회논문지
- /
- 제21권5호
- /
- pp.626-637
- /
- 2018
Human exchanges information not only through words, but also through body gesture or hand gesture. And they can be used to build effective interfaces in mobile, virtual reality, and augmented reality. The past 2D gesture recognition research had information loss caused by projecting 3D information in 2D. Since the recognition of the gesture in 3D is higher than 2D space in terms of recognition range, the complexity of gesture recognition increases. In this paper, we proposed a real-time gesture recognition deep learning model and application in 3D space using deep learning technique. First, in order to recognize the gesture in the 3D space, the data collection is performed using the unity game engine to construct and acquire data. Second, input vector normalization for learning 3D gesture recognition model is processed based on deep learning. Thirdly, the SELU(Scaled Exponential Linear Unit) function is applied to the neural network's active function for faster learning and better recognition performance. The proposed system is expected to be applicable to various fields such as rehabilitation cares, game applications, and virtual reality.
https://doi.org/10.9717/kmms.2018.21.5.626 인용 PDF KSCI

검색결과 9,120건 처리시간 0.033초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)