• Title/Summary/Keyword: Character Extraction

Search Result 304, Processing Time 0.024 seconds

A Study on Extraction of Character String in Document Image Using Morphology (Morphology를 이용한 문서화상내의 문자열 추출에 관한 연구)

  • 장희돈;김동현;김석태;남궁재찬
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.18 no.1
    • /
    • pp.123-132
    • /
    • 1993
  • This paper presents the segmentation of sentence area and diagram area from docwnent image. For extracting the sentence area, we perform the Dilation, basic operation of Morphology, to the document image and obtain the smeared document image. After the smeared docwnent image is blocked, we determine the writing form by the vertical and horizontal characteristics of the document image and calculate the skew from it. And then, we relocate the document image and extract the chatacter string from the relocated docwnent. 11 document images of three classes are considered and the character string has been well extracting from 11 document images.

  • PDF

Image Comparison Using Directional Expansion Operation

  • Yoo, Suk Won
    • International Journal of Advanced Culture Technology
    • /
    • v.6 no.3
    • /
    • pp.173-177
    • /
    • 2018
  • Masks are generated by adding different fonts of learning data characters in pixel unit, and pixel values belonging to each of the masks are divided into 3 groups. Using the directional expansion operators, we expand the text area of the test data character into 4 diagonal directions in order to create the boundary areas to distinguish it from the background area. A mask with a minimum average discordance is selected as the final recognition result by calculating the degree of discordance between the expanded test data and the masks. Image comparison using directional expansion operations more accurately recognizes test data through 4 subdivided recognition processes. It is also possible to expand the ranges of 3 groups of pixel values of masks more evenly such that new fonts can easily be added to the given learning data.

A Study on Vector-based Automatic Caricature Generation (벡터기반의 캐리커처 자동생성에 관한 연구)

  • Park, Yeon-Chool;Oh, Hae-Seok
    • The KIPS Transactions:PartB
    • /
    • v.10B no.6
    • /
    • pp.647-656
    • /
    • 2003
  • This paper proposes the system to generate caricature (character's face) resembling human face using extracted facial features automatically. Since this system is vector-based, the generated character's face has no size limit and constraint. So it is available to transform the shape freely and to apply various facial expressions to 2D face. Moreover, owing to the vector file's advantage, it can be used in mobile environment as small file site.

A Study on Extracting Car License Plate Numbers Using Image Segmentation Patterns

  • Jang, Eun-Gyeom
    • Journal of the Korea Society of Computer and Information
    • /
    • v.23 no.10
    • /
    • pp.87-94
    • /
    • 2018
  • This paper proposes a method of detecting the license plates of vehicles. The proposed technology applicable to different formats of license plates detects the numbers by standardizing the images at edge points. Specifically, in accordance with the format of each license plate, the technology captures the image in the character segment, and compares it against the sample model to derive their similarity and identify the numbers. Characters with high similarities are used to form a group of candidates and to extract the final characters. Analyzing the experimental results found the similarity of the extracted characters exceeded 90%, whereas that of less identifiable numbers was markedly lower. Still, the accuracy of the extracted characters with the highest similarity was over 80%. The proposed technology is applicable to extracting the character patterns of certain formats in diverse and useful ways.

A Study on Grapheme and Grapheme Recognition Using Connected Components Grapheme for Machine-Printed Korean Character Recognition

  • Lee, Kyong-Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.9
    • /
    • pp.27-36
    • /
    • 2016
  • Recognition of grapheme is a very important process in the recognition within 'Hangul(Korean written language)' letters using phoneme recognition. It is because the success or failure in the recognition of phoneme greatly affects the recognition of letters. For this reason, it is reported that separation of phonemes is the biggest difficulty in the phoneme recognition study. The current study separates and suggests the new phonemes that used the connective elements that are helpful for dividing phonemes, recommends the features for recognition of such suggested phonemes, databases this, and carried out a set of experiments of recognizing phonemes using the suggested features. The current study used 350 letters in the experiment of phoneme separation and recognition. In this particular kind of letters, there were 1,125 phonemes suggested. In the phoneme separation experiment, the phonemes were divided in the rate of 100%, and the phoneme recognition experiment showed the recognition rate of 98% in recognizing only 14 phonemes into different ones.

Development of Character Input System using Facial Muscle Signal and Minimum List Keyboard (안면근 신호를 이용한 최소 자판 문자 입력 시스템의 개발)

  • Kim, Hong-Hyun;Park, Hyun-Seok;Kim, Eung-Soo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2009.10a
    • /
    • pp.289-292
    • /
    • 2009
  • A person does communication between each other using language. But In the case of disabled person can not communication own idea to use writing and gesture. Therefore, In this paper, we embodied communication system using the facial muscle signals so that disabled person can do communication. Especially, After feature extraction of the EEG included facial muscle, it is converted the facial muscle into control signal, and then select character and communicate using a minimum list keyboard.

  • PDF

Manchu Script Letters Dataset Creation and Labeling

  • Aaron Daniel Snowberger;Choong Ho Lee
    • Journal of information and communication convergence engineering
    • /
    • v.22 no.1
    • /
    • pp.80-87
    • /
    • 2024
  • The Manchu language holds historical significance, but a complete dataset of Manchu script letters for training optical character recognition machine-learning models is currently unavailable. Therefore, this paper describes the process of creating a robust dataset of extracted Manchu script letters. Rather than performing automatic letter segmentation based on whitespace or the thickness of the central word stem, an image of the Manchu script was manually inspected, and one copy of the desired letter was selected as a region of interest. This selected region of interest was used as a template to match all other occurrences of the same letter within the Manchu script image. Although the dataset in this study contained only 4,000 images of five Manchu script letters, these letters were collected from twenty-eight writing styles. A full dataset of Manchu letters is expected to be obtained through this process. The collected dataset was normalized and trained using a simple convolutional neural network to verify its effectiveness.

An Adaptive Multi-Level Thresholding and Dynamic Matching Unit Selection for IC Package Marking Inspection (IC 패키지 마킹검사를 위한 적응적 다단계 이진화와 정합단위의 동적 선택)

  • Kim, Min-Ki
    • The KIPS Transactions:PartB
    • /
    • v.9B no.2
    • /
    • pp.245-254
    • /
    • 2002
  • IC package marking inspection system using machine vision locates and identifies the target elements from input image, and decides the quality of marking by comparing the extracted target elements with the standard patterns. This paper proposes an adaptive multi-level thresholding (AMLT) method which is suitable for a series of operations such as locating the target IC package, extracting the characters, and detecting the Pinl dimple. It also proposes a dynamic matching unit selection (DMUS) method which is robust to noises as well as effective to catch out the local marking errors. The main idea of the AMLT method is to restrict the inputs of Otsu's thresholding algorithm within a specified area and a partial range of gray values. Doing so, it can adapt to the specific domain. The DMUS method dynamically selects the matching unit according to the result of character extraction and layout analysis. Therefore, in spite of the various erroneous situation occurred in the process of character extraction and layout analysis, it can select minimal matching unit in any environment. In an experiment with 280 IC package images of eight types, the correct extracting rate of IC package and Pinl dimple was 100% and the correct decision rate of marking quality was 98.8%. This result shows that the proposed methods are effective to IC package marking inspection.

An Extraction Method of Number Plates for Various Vehicles Using Digital Signal Analysis Processing Techniques (디지털 신호 분석 기법을 이용한 다양한 번호판 추출 방법)

  • Yang, Sun-Ok;Jun, Young-Min;Jung, Ji-Sang;Ryu, Sang-Hwan
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.45 no.3
    • /
    • pp.12-19
    • /
    • 2008
  • Detection of a number plate consists of three stages; division of a number plate, extraction of each character from the plate, recognition of the characters. Among of these three states, division stage of a number plate is the most important part and also the most time-consuming state. This paper suggests an effective region extraction method of a number plate for various images obtained from unmanned inspection systems of illegal parking violation, especially when we have to consider the diverse surrounding environments of roads. Our approaching method detects each region by investigating the characteristics in changes of brightness and intensity between the background part and character part, and the characteristics on character parts such as the sizes, heights, widths, and distance in between two characters. The method also divides a number plate into different types of the plate. This research can solve the number plate region detection failure problems caused by plate edge damages not only for Korean domestic number plates but also for new European style number plates. The method also reduces the time consumption by processing the detection in real-time, therefore, it can be used as a practical solution.

Methods for Video Caption Extraction and Extracted Caption Image Enhancement (영화 비디오 자막 추출 및 추출된 자막 이미지 향상 방법)

  • Kim, So-Myung;Kwak, Sang-Shin;Choi, Yeong-Woo;Chung, Kyu-Sik
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.4
    • /
    • pp.235-247
    • /
    • 2002
  • For an efficient indexing and retrieval of digital video data, research on video caption extraction and recognition is required. This paper proposes methods for extracting artificial captions from video data and enhancing their image quality for an accurate Hangul and English character recognition. In the proposed methods, we first find locations of beginning and ending frames of the same caption contents and combine those multiple frames in each group by logical operation to remove background noises. During this process an evaluation is performed for detecting the integrated results with different caption images. After the multiple video frames are integrated, four different image enhancement techniques are applied to the image: resolution enhancement, contrast enhancement, stroke-based binarization, and morphological smoothing operations. By applying these operations to the video frames we can even improve the image quality of phonemes with complex strokes. Finding the beginning and ending locations of the frames with the same caption contents can be effectively used for the digital video indexing and browsing. We have tested the proposed methods with the video caption images containing both Hangul and English characters from cinema, and obtained the improved results of the character recognition.