• Title/Summary/Keyword: character classifier

Search Result 49, Processing Time 0.026 seconds

Front Classification using Back Propagation Algorithm (오류 역전파 알고리즘을 이용한 영문자의 폰트 분류 방법에 관한 연구)

  • Jung Minchul
    • Journal of Intelligence and Information Systems
    • /
    • v.10 no.2
    • /
    • pp.65-77
    • /
    • 2004
  • This paper presents a priori and the local font classification method. The font classification uses ascenders, descenders, and serifs extracted from a word image. The gradient features of those sub-images are extracted, and used as an input to a neural network classifier to produce font classification results. The font classification determines 2 font styles (upright or slant), 3 font groups (serif sans-serif or typewriter), and 7-font names (Postscript fonts such as Avant Garde, Helvetica, Bookman, New Century Schoolbook, Palatine, Times, and Courier). The proposed a priori and local font classification method allows an OCR system consisting of various font-specific character segmentation tools and various mono-font character recognizers. Experiments have shown font classification accuracies reach high performance levels of about 95.4 percent even with severely touching characters. The technique developed for tile selected 7 fonts in this paper can be applied to any other fonts.

  • PDF

Recognition of Korean Text in Outdoor Signboard Images Using Directional Feature and Fisher Measure (방향성분 특징과 Fisher Measure를 이용한 간판영상 한글인식)

  • Lim, Jun-Sik;Kim, Soo-Hyung;Lee, Guee-Sang;Yang, Hyung-Jung;Lee, Myung-Eun
    • The KIPS Transactions:PartB
    • /
    • v.16B no.3
    • /
    • pp.239-246
    • /
    • 2009
  • In this paper, we propose a Korean character recognition method from outboard signboard images. We have chosen 808 classes of Korean characters by an analysis of frequencies of appearance in a dictionary of signboard names. The proposed method mainly consists of three steps: feature extraction, rough classification, and coarse classification. The first step is to extract a nonlinear directional segments feature, which is immune to the distortion of character shapes. The second step computes an ordered set of 10 recognition candidates using a minimum distance classifier. The last step reorders the recognition candidates using a Fisher discriminant measure. As experimental results, the recognition accuracy is 80.45% for the first choice, and 93.51% for the top five choices.

Character Recognition of Vehicle Number Plate using Modular Neural Network (모듈라 신경망을 이용한 자동차 번호판 문자인식)

  • Park, Chang-Seok;Kim, Byeong-Man;Seo, Byung-Hoon;Lee, Kwang-Ho
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.4
    • /
    • pp.409-415
    • /
    • 2003
  • Recently, the modular learning are very popular and receive much attention for pattern classification. The modular learning method based on the "divide and conquer" strategy can not only solve the complex problems, but also reach a better result than a single classifier′s on the learning quality and speed. In the neural network area, some researches that take the modular learning approach also have been made to improve classification performance. In this paper, we propose a simple modular neural network for characters recognition of vehicle number plate and evaluate its performance on the clustering methods of feature vectors used in constructing subnetworks. We implement two clustering method, one is grouping similar feature vectors by K-means clustering algorithm, the other grouping unsimilar feature vectors by our proposed algorithm. The experiment result shows that our algorithm achieves much better performance.

Optical Recognition of Credit Card Numbers (신용카드 번호의 광학적 인식)

  • Jung, Min Chul
    • Journal of the Semiconductor & Display Technology
    • /
    • v.13 no.1
    • /
    • pp.57-62
    • /
    • 2014
  • This paper proposes a new optical recognition method of credit card numbers. Firstly, the proposed method segments numbers from the input image of a credit card. It uses the significant differences of standard deviations between the foreground numbers and the background. Secondly, the method extracts gradient features from the segmented numbers. The gradient features are defined as four directions of grayscale pixels for 16 regions of an input number. Finally, it utilizes an artificial neural network classifier that uses an error back-propagation algorithm. The proposed method is implemented using C language in an embedded Linux system for a high-speed real-time image processing. Experiments were conducted by using real credit card images. The results show that the proposed algorithm is quite successful for most credit cards. However, the method fails in some credit cards with strong background patterns.

A Dataset of Online Handwritten Assamese Characters

  • Baruah, Udayan;Hazarika, Shyamanta M.
    • Journal of Information Processing Systems
    • /
    • v.11 no.3
    • /
    • pp.325-341
    • /
    • 2015
  • This paper describes the Tezpur University dataset of online handwritten Assamese characters. The online data acquisition process involves the capturing of data as the text is written on a digitizer with an electronic pen. A sensor picks up the pen-tip movements, as well as pen-up/pen-down switching. The dataset contains 8,235 isolated online handwritten Assamese characters. Preliminary results on the classification of online handwritten Assamese characters using the above dataset are presented in this paper. The use of the support vector machine classifier and the classification accuracy for three different feature vectors are explored in our research.

An Efficient Block Segmentation and Classification Method for Document Image Analysis Using SGLDM and BP (공간의존행렬과 신경망을 이용한 문서영상의 효과적인 블록분할과 유형분류)

  • Kim, Jung-Su;Lee, Jeong-Hwan;Choe, Heung-Mun
    • The Transactions of the Korea Information Processing Society
    • /
    • v.2 no.6
    • /
    • pp.937-946
    • /
    • 1995
  • We proposed and efficient block segmentation and classification method for the document analysis using SGLDM(spatial gray level dependence matrix) and BP (back Propagation) neural network. Seven texture features are extracted directly from the SGLDM of each gray-level block image, and by using the nonlinear classifier of neural network BP, we can classify document blocks into 9 categories. The proposed method classifies the equation block, the table block and the flow chart block, which are mostly composed of the characters, out of the blocks that are conventionally classified as non-character blocks. By applying Sobel operator on the gray-level document image beforebinarization, we can reduce the effect of the background noises, and by using the additional horizontal-vertical smoothing as well as the vertical-horizontal smoothing of images, we can obtain an effective block segmentation that does not lead to the segmentation into small pieces. The result of experiment shows that a document can be segmented and classified into the character blocks of large fonts, small fonts, the character recognigible candidates of tables, flow charts, equations, and the non-character blocks of photos, figures, and graphs.

  • PDF

Font Classification of English Printed Character using Non-negative Matrix Factorization (NMF를 이용한 영문자 활자체 폰트 분류)

  • Lee, Chang-Woo;Kang, Hyun;Jung, Kee-Chul;Kim, Hang-Joon
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.41 no.2
    • /
    • pp.65-76
    • /
    • 2004
  • Today, most documents are electronically produced and their paleography is digitalized by imaging, resulting in a tremendous number of electronic documents in the shape of images. Therefore, to process these document images, many methods of document structure analysis and recognition have already been proposed, including font classification. Accordingly, the current paper proposes a font classification method for document images that uses non-negative matrix factorization (NMF), which is able to learn part-based representations of objects. In the proposed method, spatially total features of font images are automatically extracted using NMF, then the appropriateness of the features specifying each font is investigated. The proposed method is expected to improve the performance of optical character recognition (OCR), document indexing, and retrieval systems, when such systems adopt a font classifier as a preprocessor.

A Study On The Improvement Of Vehicle Plate Recognition (차량 번호판 인식 효율 향상을 위한 연구)

  • Kong, Yong-Hae;Kwon, Chun-Ki;Kim, Myung-Sook
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.10 no.8
    • /
    • pp.1947-1954
    • /
    • 2009
  • Camera-captured car plate images contain much variation and noise and the character images in a plate are typically very small. We attempted to improve the plate identification efficiency suitable for this undesirable condition. We experimented various image preprocessing and feature extracting methods and the very effective features that can compensate one feature's limitation is determined through extensive experiments. Finally two very effective features that can complement the limitations of each other feature(classifier) are determined and the efficiency is proved by recognition experiments. This approach is very necessary when handling plate character images which are typically small, various, and noisy. Individual classification result, confidence factor, region name relation and feedback verification are comprehensively considered to enhance the overall recognition efficiency. The efficiency of our method is verified by a recognition experiment using real car plate images taken from traffic roads.

Real-Time Vehicle License Plate Detection Based on Background Subtraction and Cascade of Boosted Classifiers

  • Sarker, Md. Mostafa Kamal;Song, Moon Kyou
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.39C no.10
    • /
    • pp.909-919
    • /
    • 2014
  • License plate (LP) detection is the most imperative part of an automatic LP recognition (LPR) system. Typical LPR contains two steps, namely LP detection (LPD) and character recognition. In this paper, we propose an efficient Vehicle-to-LP detection framework which combines with an adaptive GMM (Gaussian Mixture Model) and a cascade of boosted classifiers to make a faster vehicle LP detector. To develop a background model by using a GMM is possible in the circumstance of a fixed camera and extracts the motions using background subtraction. Firstly, an adaptive GMM is used to find the region of interest (ROI) on which motion detectors are running to detect the vehicle area as blobs ROIs. Secondly, a cascade of boosted classifiers is executed on the blobs ROIs to detect a LP. The experimental results on our test video with the resolution of $720{\times}576$ show that the LPD rate of the proposed system is 99.14% and the average computational time is approximately 42ms.

A Feature Selection Technique for Multi-lingual Character Recognition (TV 제어 메뉴의 다국적 언어 인식을 위한 특징 선정 기법)

  • Kang, Keun-Seok;Park, Hyun-Jung;Kim, Ho-Joon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2005.11a
    • /
    • pp.199-202
    • /
    • 2005
  • TV OSD(On Screen Display) 메뉴 자동검증 시스템에서 다국적 언어의 문자 인식은 표준패턴의 구조적 분석이 쉽지 않을 뿐만 아니라 학습패턴 집합의 규모와 특징의 수가 증가함으로 인하여 특징추출 및 인식 과정에서 방대한 계산량이 요구된다. 이에 본 연구에서는 학습 데이터에 포함되는 다량의 특징 집합으로부터 인식에 필요한 효과적인 특징을 선별함으로써 패턴 분류기의 효율성을 개선하기 위한 방법론을 고찰한다. 이를 위하여 수정된 형태의 Adaboost 기법을 제안하고 이를 적용한 실험 결과로부터 그 유용성을 고찰한다. 제안된 알고리즘은 초기의 특징 집합을 취약한 성능을 갖는 다수의 분류기(classifier)로서 고려하며, 이로부터 반복학습을 통하여 개선된 분류기를 점진적으로 선별해 나가게 된다. 학습의 원리는 주어진 학습패턴 집합에 기초하여 일종의 교사학습(supervised learning) 방식으로 이루어진다. 각 패턴에 할당된 가중치 값은 각 단계에서 산출되는 분류결과에 따라 적응적으로 수정되어 반복학습이 진행됨에 따라 점차 보완적 성능을 갖는 분류기를 선택할 수 있게 한다. 즉, 주어진 각 학습패턴에 대하여 초기에 균등한 가중치가 부여되며, 반복학습의 각 단계에서 적용되는 분류기의 출력을 분석하여 오분류된 패턴의 가중치 분포를 증가시켜 나간다. 본 연구에서는 실제 응용으로서 OSD 메뉴검증 시스템을 대상으로 제안된 이론을 적용하고 그 타당성을 평가한다.

  • PDF