Search | Korea Science

Text Line Segmentation of Handwritten Documents by Area Mapping

Boragule, Abhijeet;Lee, GueeSang
- Smart Media Journal
- /
- v.4 no.3
- /
- pp.44-49
- /
- 2015
Text line segmentation is a preprocessing step in OCR, which can significantly influence the accuracy of document analysis applications. This paper proposes a novel methodology for the text line segmentation of handwritten documents. First, the average width of the connected components is used to form a 1-D Gaussian kernel and a smoothing operation is then applied to the input binary image. The adaptive binarization of the smoothed image forms the final text lines. In this work, the segmentation method involves two stages: firstly, the large connected components are labelled as a unique text line using text line area mapping. Secondly, the final refinement of the segmentation is performed using the Euclidean distance between the text line and small connected components. The group of uniquely labelled text candidates achieves promising segmentation results. The proposed approach works well on Korean and English language handwritten documents captured using a camera.
PDF KSCI

Image Semantic Segmentation Using Improved ENet Network

Dong, Chaoxian
- Journal of Information Processing Systems
- /
- v.17 no.5
- /
- pp.892-904
- /
- 2021
An image semantic segmentation model is proposed based on improved ENet network in order to achieve the low accuracy of image semantic segmentation in complex environment. Firstly, this paper performs pruning and convolution optimization operations on the ENet network. That is, the network structure is reasonably adjusted for better results in image segmentation by reducing the convolution operation in the decoder and proposing the bottleneck convolution structure. Squeeze-and-excitation (SE) module is then integrated into the optimized ENet network. Small-scale targets see improvement in segmentation accuracy via automatic learning of the importance of each feature channel. Finally, the experiment was verified on the public dataset. This method outperforms the existing comparison methods in mean pixel accuracy (MPA) and mean intersection over union (MIOU) values. And in a short running time, the accuracy of the segmentation and the efficiency of the operation are guaranteed.
https://doi.org/10.3745/JIPS.02.0164 인용 PDF KSCI

A Moving Picture Coding Method Based on Region Segmentation Using Genetic Algorithm (유전적 알고리즘을 이용한 동화상의 영역분할 부호화 방법)

Jung, Nam-Chae
- Journal of the Institute of Convergence Signal Processing
- /
- v.10 no.1
- /
- pp.32-39
- /
- 2009
In this paper, the method of region segmentation using genetic algorithm is proposed for an improvement of efficiency in moving picture coding. A genetic algorithm is the method that searches a large probing space using only a function value for a optimal combination consecutively. By progressing both motion presumption and region segmentation at once, we can assign the motion vector in a image to a small block or a pixel respectively, and transform the capacity of coding and a signal to noise rate into a problem of optimization. That is to say, there is close correlation between region segmentation and motion presumption in motion-compensated prediction coding. This is to optimize the capacity of coding and a S/N ratio. This is to arrange the motion vector in each block of picture according to the state of optimization. Therefore, we examined both the data type of genetic algorithm and the method of data processing to obtain the results of optimal region segmentation in this paper. And we confirmed the validity of a proposed method using the test pictures by means of computer simulation.
PDF

Document Image Segmentation and Classification using Texture Features and Structural Information (텍스쳐 특징과 구조적인 정보를 이용한 문서 영상의 분할 및 분류)

Park, Kun-Hye;Kim, Bo-Ram;Kim, Wook-Hyun
- Journal of the Institute of Convergence Signal Processing
- /
- v.11 no.3
- /
- pp.215-220
- /
- 2010
In this paper, we propose a new texture-based page segmentation and classification method in which table region, background region, image region and text region in a given document image are automatically identified. The proposed method for document images consists of two stages, document segmentation and contents classification. In the first stage, we segment the document image, and then, we classify contents of document in the second stage. The proposed classification method is based on a texture analysis. Each contents in the document are considered as regions with different textures. Thus the problem of classification contents of document can be posed as a texture segmentation and analysis problem. Two-dimensional Gabor filters are used to extract texture features for each of these regions. Our method does not assume any a priori knowledge about content or language of the document. As we can see experiment results, our method gives good performance in document segmentation and contents classification. The proposed system is expected to apply such as multimedia data searching, real-time image processing.
PDF KSCI

Semantic Object Segmentation Using Conditional Generative Adversarial Network with Residual Connections (잔차 연결의 조건부 생성적 적대 신경망을 사용한 시맨틱 객체 분할)

Ibrahem, Hatem;Salem, Ahmed;Yagoub, Bilel;Kang, Hyun Su;Suh, Jae-Won
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.26 no.12
- /
- pp.1919-1925
- /
- 2022
In this paper, we propose an image-to-image translation approach based on the conditional generative adversarial network for semantic segmentation. Semantic segmentation is the task of clustering parts of an image together which belong to the same object class. Unlike the traditional pixel-wise classification approach, the proposed method parses an input RGB image to its corresponding semantic segmentation mask using a pixel regression approach. The proposed method is based on the Pix2Pix image synthesis method. We employ residual connections-based convolutional neural network architectures for both the generator and discriminator architectures, as the residual connections speed up the training process and generate more accurate results. The proposed method has been trained and tested on the NYU-depthV2 dataset and could achieve a good mIOU value (49.5%). We also compare the proposed approach to the current methods in semantic segmentation showing that the proposed method outperforms most of those methods.
https://doi.org/10.6109/jkiice.2022.26.12.1919 인용 PDF KSCI

Document Layout Analysis Using Coarse/Fine Strategy (Coarse/fine 전략을 이용한 문서 구조 분석)

박동열;곽희규;김수형
- Proceedings of the IEEK Conference
- /
- 2000.06d
- /
- pp.198-201
- /
- 2000
We propose a method for analyzing the document structure. This method consists of two processes, segmentation and classification. The segmentation first divides a low resolution image, and then finely splits the original document image using projection profiles. The classification deterimines each segmented region as text, line, table or image. An experiment with 238 documents images shows that the segmentation accuracy is 99.1% and the classification accuracy is 97.3%.
PDF

A Study on the Implementation of the Picture segmentation for a Real-Time Automatic Video Tracker System (실시간 자동영상 추적기를 위한 영상영역화의 구현에 관한 연구)

문종환;김경수;김재희
- Proceedings of the Korean Institute of Communication Sciences Conference
- /
- 1986.10a
- /
- pp.186-190
- /
- 1986
This paper describes a way of implementing the segmentation of 128*128 pixel images to be used as the inputs. to a real-time automatic video tracker. The suggested method uses the lowest valley-value of the computed intensity historgram with 16 levels. This method improves smoothing effects and also significantly reduces hardware requirements. Entire segmentation process is caried out in 10msec thus making a real time application possible.
PDF

Slant Correction and Character String Segmentation using Vertical Transition (수직 천이점 검출을 통한 인쇄체 우편 영상에서의 회전각 보정 및 문자열 추출)

이재용;오현화;장승익;진성일
- Proceedings of the IEEK Conference
- /
- 2003.11a
- /
- pp.469-472
- /
- 2003
Skew is inevitably occurred in a scanned document image Thus, character recognition systems are generally very sensitive to a skew angle. In this paper, we propose a robust slant correction algorithm based on dithering and estimating vortical transition. Character strings are segmented by projecting the vertical transition point and the slant corrected image. The segmentation method using the vertical transition point can effectively split the character strings touching vertically each other. Experimental results show that the proposed method has achieved robust slant correction and good performance of character string segmentation.
PDF

Image Segmentation Using Bi-directional Distribution Functions of Histogram (히스토그램의 양방향 분포함수를 이용한 영상분할)

남윤석;하영호;김수중
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.24 no.6
- /
- pp.1020-1024
- /
- 1987
Image segmentation based on the curvature of bi-directiona distribution functions of histogram with no mode informations is proposed. The curvature is an oscillating function and can be approximated to a polynomial form with a least square method using the Chebyshev basis. Nonhomogeneous linea equations are solved by Gauss-elimination method. In the proposed algorithm, critical points of the curvature are obtained on each direction to compensate the segmentation parameters, which can be ignored in only one-directional histogram.
PDF

Shape Segmentation by Watersheds (Watershed에 의한 형태분할)

김태진;김주영;고광식
- Proceedings of the IEEK Conference
- /
- 1999.11a
- /
- pp.573-576
- /
- 1999
This paper presents a new shape segmentation algorithm. The procedure to achieve complete segmentation consists of two steps : the first step is mapping shape into two dimension by the using Distance Transform, the second step is partitioning the region by using the Watershed algorithm. As a application of the proposed algorithm, we perform the matching experiment for several objects by the use of segmented region. Simulation results demonstrate the efficiency of the proposed method, and the method has scale, rotation, and shift invariant properties.
PDF

Search Result 2,150, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)