• Title/Summary/Keyword: text image

Search Result 981, Processing Time 0.027 seconds

Document Image Layout Analysis Using Image Filters and Constrained Conditions (이미지 필터와 제한조건을 이용한 문서영상 구조분석)

  • Jang, Dae-Geun;Hwang, Chan-Sik
    • The KIPS Transactions:PartB
    • /
    • v.9B no.3
    • /
    • pp.311-318
    • /
    • 2002
  • Document image layout analysis contains the process to segment document image into detailed regions and the process to classify the segmented regions into text, picture, table or etc. In the region classification process, the size of a region, the density of black pixels, and the complexity of pixel distribution are the bases of region classification. But in case of picture, the ranges of these bases are so wide that it's difficult to decide the classification threshold between picture and others. As a result, the picture has a higher region classification error than others. In this paper, we propose document image layout analysis method which has a better performance for the picture and text region classification than that of previous methods including commercial softwares. In the picture and text region classification, median filter is used in order to reduce the influence of the size of a region, the density of black pixels, and the complexity of pixel distribution. Futhermore the classification error is corrected by the use of region expanding filter and constrained conditions.

A Study on the Integrated Coding of Image and Document Data (영상과 문자정보의 통합 부호화에 관한 연구)

  • Lee, Huen-Joo;Park, Goo-Man;Park, Kyu-Tae
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.26 no.7
    • /
    • pp.42-49
    • /
    • 1989
  • A new integrated coding method is proposed in this study for embedding the text information including Hangul into an image. A monochrome analog image may be quantized to a few leveled digital image and be displayed on bi-leveled output devices by using halftone processing techniques. Text data are embedded on each micro pattern. Based on this concept, the encoding and the decoding algorithm are implemented and experiments are performed. As a result, the average amount of the embedded text information is more than 8 bpp (bits per pixer) in this halftone processed image converted form a $64{\times}64$ image, i.e, corresponding to 2000 characters in Hangul, or 4000 characters in alphanumeral. using this algorithm, the integrated personal record management system is implemented.

  • PDF

An Analysis of Tourism Experience and Color Relationships Using Landmark Air Photos (랜드마크 항공 사진을 이용한 관광 경험과 색채 연관성 분석)

  • Yoon, Seungsik;Do, Jinwoo;Kang, Juyoung
    • The Journal of Bigdata
    • /
    • v.3 no.2
    • /
    • pp.51-57
    • /
    • 2018
  • The purpose of this study is to find a valid link between color and tourism experience. We analyzed color that extracted by Aerial photo by IRI Image Scale to find color image. As an indicator of the experience of tourism, a review of the Tripadvisor was selected and analyzed through text mining. Results using text mining results and IRI image scales were generally inconsistent. To identify problems with aerial photo, the results of the analysis using the representative photographs provided by the Tripadvisor in the same way were the same as before. This indicate that details are key of tourism than the image of the overall background. This study presents new research directions by combining color analysis studies with text mining.

Character Recognition Algorithm in Low-Quality Legacy Contents Based on Alternative End-to-End Learning (대안적 통째학습 기반 저품질 레거시 콘텐츠에서의 문자 인식 알고리즘)

  • Lee, Sung-Jin;Yun, Jun-Seok;Park, Seon-hoo;Yoo, Seok Bong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.11
    • /
    • pp.1486-1494
    • /
    • 2021
  • Character recognition is a technology required in various platforms, such as smart parking and text to speech, and many studies are being conducted to improve its performance through new attempts. However, with low-quality image used for character recognition, a difference in resolution of the training image and test image for character recognition occurs, resulting in poor accuracy. To solve this problem, this paper designed an end-to-end learning neural network that combines image super-resolution and character recognition so that the character recognition model performance is robust against various quality data, and implemented an alternative whole learning algorithm to learn the whole neural network. An alternative end-to-end learning and recognition performance test was conducted using the license plate image among various text images, and the effectiveness of the proposed algorithm was verified with the performance test.

Revival of Text Document Image Contents (텍스트 문서 영상 컨텐츠의 부활)

  • 오일석
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2003.11a
    • /
    • pp.96-102
    • /
    • 2003
  • The human knowledge has been integrated mainly through the text documents. The computer technologies changed the way of production and deliverly of the documents from analog to digital. During the paradigm shift, a serious problem must occur due to a large gap between the old contents and newly generated contents. This paper reviews some methods to reduce the gap for the text document image contents.

  • PDF

An Algorithm for Text Image Watermarking based on Word Classification (단어 분류에 기반한 텍스트 영상 워터마킹 알고리즘)

  • Kim Young-Won;Oh Il-Seok
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.8
    • /
    • pp.742-751
    • /
    • 2005
  • This paper proposes a novel text image watermarking algorithm based on word classification. The words are classified into K classes using simple features. Several adjacent words are grouped into a segment. and the segments are also classified using the word class information. The same amount of information is inserted into each of the segment classes. The signal is encoded by modifying some inter-word spaces statistics of segment classes. Subjective comparisons with conventional word-shift algorithms are presented under several criteria.

A Hangul Document Image Retrieval System Using Rank-based Recognition (웨이브렛 특징과 순위 기반 인식을 이용한 한글 문서 영상 검색 시스템)

  • Lee Duk-Ryong;Kim Woo-Youn;Oh Il-Seok
    • The Journal of the Korea Contents Association
    • /
    • v.5 no.2
    • /
    • pp.229-242
    • /
    • 2005
  • We constructed a full-text retrieval system for the scanned Hangul document images. The system consists of three parts; preprocessing, recognition, and retrieval components. The retrieval algorithm uses recognition results up to k-ranks. The algorithm is not only insensitive to the recognition errors, but also has the advantage of user-controllable recall and precision. For the objective performance evaluation, we used the scanned images of the Journal of Korea Information Science Society provided by KISTI. The system was shown to be practical through theevaluationofrecognitionandretrievalrates.

  • PDF

A study on the Interactive Expression of Human Emotions in Typography

  • Lim, Sooyeon
    • International Journal of Advanced Culture Technology
    • /
    • v.10 no.1
    • /
    • pp.122-130
    • /
    • 2022
  • In modern times, text has become an image, and typography is a style that is a combination of image and text that can be easily encountered in everyday life. It is developing not only for the purpose of conveying meaningful communication, but also to bring joy and beauty to our lives as a medium with aesthetic format. This study shows through case analysis that typography is a tool for expressing human emotions, and investigates its characteristics that change along with the media. In particular, interactive communication tools and methods used by interactive typography to express viewers' emotions are described in detail. We created interactive typography using the inputted text, the selected music by the viewer and the viewer's movement. As a result of applying it to the exhibition, we could confirm that interactive typography can function as an effective communication medium that shows the utility of both the iconography of letter signs and the cognitive function when combined with the audience's intentional motion.

On the Study of Textual Classics and Artistic Creation - Taking Buddhist Art Dunhuang Grottoes as an Example

  • Liu Tingting
    • International Journal of Advanced Culture Technology
    • /
    • v.11 no.3
    • /
    • pp.205-210
    • /
    • 2023
  • Stone cave paintings are continuous interactions as independent mediums in places such as text, images and stone cave architecture. Unlike Buddha statues, the narrative of the text always fascinates and guides the viewer to the timeliness of the image, that is, the narrative. In particular, in Buddhist art, Buddha statues are never simple images, and murals are never simple paintings. Before the Tang Dynasty, most unknown artists were artisans, and many artists still worked on murals in temples and palaces, and independent paintings such as scrolls and sides became an important form of painting after the Tang Dynasty, changing the mechanism of painting creation. In this paper, the graphic creation process prioritizes dedication and service, but we can still feel the creativity of the painters strongly. The historical resources of how to paint these paintings, the clues to the copies, and the precursor to the foreground, encourage the painters to constantly try to resemble each other and discover problems...Therefore, in this paper, it was confirmed that reinvention and creativity are very important, and that Dunhuang Buddhist art is the basis for artists' creation and the source of vitality.

Gradation Image Processing for Text Recognition in Road Signs Using Image Division and Merging

  • Chong, Kyusoo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.13 no.2
    • /
    • pp.27-33
    • /
    • 2014
  • This paper proposes a gradation image processing method for the development of a Road Sign Recognition Platform (RReP), which aims to facilitate the rapid and accurate management and surveying of approximately 160,000 road signs installed along the highways, national roadways, and local roads in the cities, districts (gun), and provinces (do) of Korea. RReP is based on GPS(Global Positioning System), IMU(Inertial Measurement Unit), INS(Inertial Navigation System), DMI(Distance Measurement Instrument), and lasers, and uses an imagery information collection/classification module to allow the automatic recognition of signs, the collection of shapes, pole locations, and sign-type data, and the creation of road sign registers, by extracting basic data related to the shape and sign content, and automated database design. Image division and merging, which were applied in this study, produce superior results compared with local binarization method in terms of speed. At the results, larger texts area were found in images, the accuracy of text recognition was improved when images had been gradated. Multi-threshold values of natural scene images are used to improve the extraction rate of texts and figures based on pattern recognition.