• Title/Summary/Keyword: OCR Technology

Search Result 130, Processing Time 0.025 seconds

Vehicle Identification Number Recognition using Edge Projection and PCA (에지 투영과 PCA를 이용한 차대 번호 인식)

  • Ahn, In-Mo;Ha, Jong-Eun
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.17 no.5
    • /
    • pp.479-483
    • /
    • 2011
  • The automation of production process is actively expanding for the purpose of the cost reduction and quality assurance. Among these, automatic tracking of the product along the whole process of the production is also important topic. Typically this is done by adopting OCR technology. Conventional OCR technology operates well on the rather good quality of the image like as printed characters on the paper. In industrial application, IDs are marked on the metal surface, and this cause the height difference between background material and character. Illumination systems that guarantee an image with good quality may be a solution, but it is rather difficult to design such an illumination system. This paper proposes an algorithm for the recognition of vehicle's ID characters using edge projection and PCA (Principal Component Analysis). Proposed algorithm robustly operates under illumination change using the same parameters. Experimental results show the feasibility of the proposed algorithm.

A Study on Construction of Technical Reports Management System Using Optical Technology (광기술을 이용한 연구보고서 관리시스템 구축)

  • 이상헌;김익철
    • Journal of the Korean Society for information Management
    • /
    • v.9 no.1
    • /
    • pp.131-164
    • /
    • 1992
  • In this study. a technical report management system using optical technology is described in detail. This management system is designed for both bibliographic (character) and full-text (image) information. Several optical filing systems already on the Korean market are scrutinized and compared with standard functions in order to build a more efficient management system for technical reports which can be easily integrated into existing KRISS library automation system. For that purpose, up-to-date technologies (i.e., digital image PI-ocessing (DIP), MARC standards, and optical character recognition (OCR), etc.) are applied to this system.

  • PDF

Effect of Yellow Clay on the Oxygen Consumption Rate of Korean rockfish, Sebastes schlegelii

  • Lee, Chang-Kyu;Kim, Wan-Soo;Park, Young-Tae;Jo, Q-Tae
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.19 no.3
    • /
    • pp.241-247
    • /
    • 2013
  • Yellow clay dispersion has been applied to minimize fisheries impact by the red tide Cochlodinium polykrikoides blooms in Korean coasts since 1995. The present preliminary study documents the effect of yellow clay on Korean rockfish, Sebastes schlegelii, in terms of oxygen consumption rate (OCR). The OCR in the low clay suspension (0.05 and 0.23 %, w/w) showed normal level compared to the control. In contrast, the OCR for each one of three replicates in the high clay suspension (1.16 and 5.58 %, w/w) was not returned to the previous level that clay was not treated, indicating that high clay suspension (${\geq}1.16%$, w/w) might give negative effect on Korean rockfish. Overall, this result suggests that field application of clay to control Harmful Algal Blooms (HABs) may not give impact on Korean rockfish once the clay is dispersed in a low concentration (${\leq}0.23%$). In order to understand the changes of OCR in the repeated exposure to clay, it is required to do further studies on the changes of OCR when the fish is exposed to clay repeatedly after recovery in the normal seawater.

An Autonomous Optimal Coordination Scheme in a Protection System of a Power Distribution Network by using a Multi-Agent Concept

  • Hyun, Seung-Ho;Min, Byung-Woon;Jung, Kwang-Ho;Lee, Seung-Jae;Park, Myeon-Song;Kang, Sang-Hee
    • KIEE International Transactions on Power Engineering
    • /
    • v.2A no.3
    • /
    • pp.89-94
    • /
    • 2002
  • In this paper, a protection system using a Multi-Agent concept for power distribution networks is proposed. Every digital over current relay(OCR) is developed as an agent by adding its own intelligence, self-tuning and communication ability. The main advantage of the Multi-Agent concept is that a group of agents work together to achieve a global goal which is beyond the ability of each individual agent. In order to cope with frequent changes in the network operation condition and faults, an OCR agent, suggested in this paper, is able to detect a fault or a change in the network and find its optimal parameters for protection in an autonomous manner considering information of the whole network obtained by communication between other agents. Through this kind of coordination and information exchanges, not only a local but also a global protective scheme is completed. Simulations in a simple distribution network show the effectiveness of the suggested protection system.

Building Database using Character Recognition Technology (문자 인식 기술을 이용한 데이터베이스 구축)

  • Han, Seon-Hwa;Lee, Chung-Sik;Lee, Jun-Ho;Kim, Jin-Hyeong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.7
    • /
    • pp.1713-1723
    • /
    • 1999
  • Optical character recognition(OCR) might be the most plausible method in building database out of printed matters. This paper describes the points to be considered when one selects an OCR system in order to build database. Based on the considerations, we evaluated four commercial OCR systems, and chose one which shows the best recognition rate to build OCT-text database. The subject text, the KT-test collection, is a set of abstracts from proceedings of different printing quality, fonts, and formats. KT-test collection is also provided with typed text database. Recognition rate was calculated by comparing the recognition result with the typed text. No preprocessing such as learning and slant correction was applied to the recognition process in order to simulate a practical environment. The result shows 90.5% of character recognition rate over 970 abstracts. This recognition rate is still insufficient for practical use. The errors in OCR texts are different from those of manually typed texts. In this paper, we classify the errors in OCR texts for the further research.

  • PDF

Spam Image Detection Model based on Deep Learning for Improving Spam Filter

  • Seong-Guk Nam;Dong-Gun Lee;Yeong-Seok Seo
    • Journal of Information Processing Systems
    • /
    • v.19 no.3
    • /
    • pp.289-301
    • /
    • 2023
  • Due to the development and dissemination of modern technology, anyone can easily communicate using services such as social network service (SNS) through a personal computer (PC) or smartphone. The development of these technologies has caused many beneficial effects. At the same time, bad effects also occurred, one of which was the spam problem. Spam refers to unwanted or rejected information received by unspecified users. The continuous exposure of such information to service users creates inconvenience in the user's use of the service, and if filtering is not performed correctly, the quality of service deteriorates. Recently, spammers are creating more malicious spam by distorting the image of spam text so that optical character recognition (OCR)-based spam filters cannot easily detect it. Fortunately, the level of transformation of image spam circulated on social media is not serious yet. However, in the mail system, spammers (the person who sends spam) showed various modifications to the spam image for neutralizing OCR, and therefore, the same situation can happen with spam images on social media. Spammers have been shown to interfere with OCR reading through geometric transformations such as image distortion, noise addition, and blurring. Various techniques have been studied to filter image spam, but at the same time, methods of interfering with image spam identification using obfuscated images are also continuously developing. In this paper, we propose a deep learning-based spam image detection model to improve the existing OCR-based spam image detection performance and compensate for vulnerabilities. The proposed model extracts text features and image features from the image using four sub-models. First, the OCR-based text model extracts the text-related features, whether the image contains spam words, and the word embedding vector from the input image. Then, the convolution neural network-based image model extracts image obfuscation and image feature vectors from the input image. The extracted feature is determined whether it is a spam image by the final spam image classifier. As a result of evaluating the F1-score of the proposed model, the performance was about 14 points higher than the OCR-based spam image detection performance.

Digital Library and Information Management (디지털 도서관(圖書館)과 정보관리)

  • Kim, Soon-Ja
    • Journal of Information Management
    • /
    • v.26 no.1
    • /
    • pp.16-51
    • /
    • 1995
  • Information management area faced new challenge arised from the developments of the computer and the information network, and the advent of information super highway. With deep perception of importance of the information, developments of information technologies, and change of the users' environment, we came to envision the digital library. This paper intends to describe the concept and function of the digital library, and to examine some of information technologies such as CD-ROM, OCR technology and image scanning, hypertext, hypermedia and multimedia. And it also considers the strategies for electronic information services and the applicability of the current information technology for digitalization by case studies of the existing database systems.

  • PDF

Recognition of Characters Printed on PCB Components Using Deep Neural Networks (심층신경망을 이용한 PCB 부품의 인쇄문자 인식)

  • Cho, Tai-Hoon
    • Journal of the Semiconductor & Display Technology
    • /
    • v.20 no.3
    • /
    • pp.6-10
    • /
    • 2021
  • Recognition of characters printed or marked on the PCB components from images captured using cameras is an important task in PCB components inspection systems. Previous optical character recognition (OCR) of PCB components typically consists of two stages: character segmentation and classification of each segmented character. However, character segmentation often fails due to corrupted characters, low image contrast, etc. Thus, OCR without character segmentation is desirable and increasingly used via deep neural networks. Typical implementation based on deep neural nets without character segmentation includes convolutional neural network followed by recurrent neural network (RNN). However, one disadvantage of this approach is slow execution due to RNN layers. LPRNet is a segmentation-free character recognition network with excellent accuracy proved in license plate recognition. LPRNet uses a wide convolution instead of RNN, thus enabling fast inference. In this paper, LPRNet was adapted for recognizing characters printed on PCB components with fast execution and high accuracy. Initial training with synthetic images followed by fine-tuning on real text images yielded accurate recognition. This net can be further optimized on Intel CPU using OpenVINO tool kit. The optimized version of the network can be run in real-time faster than even GPU.

Patent Document Similarity Based on Image Analysis Using the SIFT-Algorithm and OCR-Text

  • Park, Jeong Beom;Mandl, Thomas;Kim, Do Wan
    • International Journal of Contents
    • /
    • v.13 no.4
    • /
    • pp.70-79
    • /
    • 2017
  • Images are an important element in patents and many experts use images to analyze a patent or to check differences between patents. However, there is little research on image analysis for patents partly because image processing is an advanced technology and typically patent images consist of visual parts as well as of text and numbers. This study suggests two methods for using image processing; the Scale Invariant Feature Transform(SIFT) algorithm and Optical Character Recognition(OCR). The first method which works with SIFT uses image feature points. Through feature matching, it can be applied to calculate the similarity between documents containing these images. And in the second method, OCR is used to extract text from the images. By using numbers which are extracted from an image, it is possible to extract the corresponding related text within the text passages. Subsequently, document similarity can be calculated based on the extracted text. Through comparing the suggested methods and an existing method based only on text for calculating the similarity, the feasibility is achieved. Additionally, the correlation between both the similarity measures is low which shows that they capture different aspects of the patent content.

A Study on communication method between OverCurrent Relay Agents using DNP 3.0 (DNP3.0을 이용한 과전류 계전기 에이전트의 통신 방안 연구)

  • Lee, H.W.;Jung, K.H.;Lim, S.I.;Hyun, S.H.;Choi, M.S.;Lee, S.J.
    • Proceedings of the KIEE Conference
    • /
    • 2002.11b
    • /
    • pp.265-267
    • /
    • 2002
  • In this paper, the communication between Over-Current Relay agents is realized using DNP(Distributed Network Protocol), which is the standard communication protocol of distribution automation system in KEPKO. The key words in OCR agent communication are defined and represented by use of DNP application function code. And the DNP index for OCR agent is defined. The proposed communication scheme is tested by use of Communication Test Harness, a test tool for DNP protocol to show its soundness.

  • PDF