Search | Korea Science

Character Segmentation in Chinese Handwritten Text Based on Gap and Character Construction Estimation

Zhang, Cheng Dong;Lee, Guee-Sang
- International Journal of Contents
- /
- v.8 no.1
- /
- pp.39-46
- /
- 2012
Character segmentation is a preprocessing step in many offline handwriting recognition systems. In this paper, Chinese characters are categorized into seven different structures. In each structure, the character size with the range of variations is estimated considering typical handwritten samples. The component removal and merge criteria are presented to remove punctuation symbols or to merge small components which are part of a character. Finally, the criteria for segmenting the adjacent characters concerning each other or overlapped are proposed.
https://doi.org/10.5392/IJoC.2012.8.1.039 인용 PDF KSCI

A Study on Stroke Extraction for Handwritten Korean Character Recognition (필기체 한글 문자 인식을 위한 획 추출에 관한 연구)

Choi, Young-Kyoo;Rhee, Sang-Burm
- The KIPS Transactions:PartB
- /
- v.9B no.3
- /
- pp.375-382
- /
- 2002
Handwritten character recognition is classified into on-line handwritten character recognition and off-line handwritten character recognition. On-line handwritten character recognition has made a remarkable outcome compared to off-line hacdwritten character recognition. This method can acquire the dynamic written information such as the writing order and the position of a stroke by means of pen-based electronic input device such as a tablet board. On the contrary, Any dynamic information can not be acquired in off-line handwritten character recognition since there are extreme overlapping between consonants and vowels, and heavily noisy images between strokes, which change the recognition performance with the result of the preprocessing. This paper proposes a method that effectively extracts the stroke including dynamic information of characters for off-line Korean handwritten character recognition. First of all, this method makes improvement and binarization of input handwritten character image as preprocessing procedure using watershed algorithm. The next procedure is extraction of skeleton by using the transformed Lu and Wang's thinning: algorithm, and segment pixel array is extracted by abstracting the feature point of the characters. Then, the vectorization is executed with a maximum permission error method. In the case that a few strokes are bound in a segment, a segment pixel array is divided with two or more segment vectors. In order to reconstruct the extracted segment vector with a complete stroke, the directional component of the vector is mortified by using right-hand writing coordinate system. With combination of segment vectors which are adjacent and can be combined, the reconstruction of complete stroke is made out which is suitable for character recognition. As experimentation, it is verified that the proposed method is suitable for handwritten Korean character recognition.
https://doi.org/10.3745/KIPSTB.2002.9B.3.375 인용 PDF KSCI

Design of Large-set Off-line Handwritten Hangul Character lmage Database (대용량 오프라인 한글 글씨 영상 데이타 베이스의 설계)

김두식;김대인
- Korean Journal of Cognitive Science
- /
- v.7 no.4
- /
- pp.63-74
- /
- 1996

A Study of Construction of Character Image Data for Recognition Handwritten Text (필기체 문자 인식을 위한 문자 영상 데이터 구축에 관한 연구)

Lee, H.R.;Ko, K.C.;Lee, M.R.
- Annual Conference on Human and Language Technology
- /
- 2000.10d
- /
- pp.63-67
- /
- 2000
In order to develop a character recognition system, it is an essential preceding work that gathers an image data of the standard. On this purpose a data of the digitized images of a handwritten characters was collected. The types of a gathered image data are Korean character, Chiness character, Numeral, English character, Special character, and so on. This paper deals with a handwritten character image data base, and the image data base different from the general storage structure of a lame capacity multimedia was designed and builded.
PDF

Implementation and Design of Handwritten Character Recognition Algorithm Using Touch Screen (터치스크린을 이용한 필기체 문자 인식 알고리즘 설계 및 구현)

Park, Sang-Bong
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.14 no.2
- /
- pp.141-146
- /
- 2014
This paper describes the implementation and algorithm of handwritten character recognition using mobile touch screen. The system is consisted of PXA320 processor, capacitive touch panel and QT4 interface. The proposed algorithm extracts pattern characteristics with straight, left circle, right circle on the inputting character. The definition of character is determined by 3-way tree searching method. The performance of proposed algorithm is verified using alphabet character. It is suitable to apply the mobile touch screen because of simple algorithm.
https://doi.org/10.7236/JIIBC.2014.14.2.141 인용 PDF KSCI

A Unicode based Deep Handwritten Character Recognition model for Telugu to English Language Translation

BV Subba Rao;J. Nageswara Rao;Bandi Vamsi;Venkata Nagaraju Thatha;Katta Subba Rao
- International Journal of Computer Science & Network Security
- /
- v.24 no.2
- /
- pp.101-112
- /
- 2024
Telugu language is considered as fourth most used language in India especially in the regions of Andhra Pradesh, Telangana, Karnataka etc. In international recognized countries also, Telugu is widely growing spoken language. This language comprises of different dependent and independent vowels, consonants and digits. In this aspect, the enhancement of Telugu Handwritten Character Recognition (HCR) has not been propagated. HCR is a neural network technique of converting a documented image to edited text one which can be used for many other applications. This reduces time and effort without starting over from the beginning every time. In this work, a Unicode based Handwritten Character Recognition(U-HCR) is developed for translating the handwritten Telugu characters into English language. With the use of Centre of Gravity (CG) in our model we can easily divide a compound character into individual character with the help of Unicode values. For training this model, we have used both online and offline Telugu character datasets. To extract the features in the scanned image we used convolutional neural network along with Machine Learning classifiers like Random Forest and Support Vector Machine. Stochastic Gradient Descent (SGD), Root Mean Square Propagation (RMS-P) and Adaptative Moment Estimation (ADAM)optimizers are used in this work to enhance the performance of U-HCR and to reduce the loss function value. This loss value reduction can be possible with optimizers by using CNN. In both online and offline datasets, proposed model showed promising results by maintaining the accuracies with 90.28% for SGD, 96.97% for RMS-P and 93.57% for ADAM respectively.
https://doi.org/10.22937/IJCSNS.2024.24.2.12 인용 PDF

Mass-Spring-Damper Model for Offline Handwritten Character Distortion Analysis

Cho, Beom-Joon
- Journal of Korea Multimedia Society
- /
- v.14 no.5
- /
- pp.642-649
- /
- 2011
Among the various aspects of offline handwritten character patterns, it is the great variety of writing styles and variations that renders the task of computer recognition very hard. The immense variety of character shape has been recognized but rarely studied during the past decades of numerous research efforts. This paper tries to address the problem of measuring image distortions and handwritten character patterns with respect to reference patterns. This work is based on mass-spring mesh model with the introduction of simulated electric charge as a source of the external force that can aid decoding the shape distortion. Given an input image and a reference image, the charge is defined, and then the relaxation procedure goes to find the optimum configuration of shape or patterns of least potential. The relaxation process is based on the fourth order Runge-Kutta algorithm, well-known for numerical integration. The proposed method of modeling is rigorous mathematically and leads to interesting results. Additional feature of the method is the global affine transformation that helps analyzing distortion and finding a good match by removing a large scale linear disparity between two images.
https://doi.org/10.9717/kmms.2011.14.5.642 인용 PDF KSCI

Efficient Handwritten Character Verification Using an Improved Dynamic Time Warping Algorithm (개선된 동적 타임 워핑 알고리즘을 이용한 효율적인 필기문자 감정)

Jang, Seok-Woo;Park, Young-Jae;Kim, Gye-Young
- Journal of the Korea Society of Computer and Information
- /
- v.15 no.7
- /
- pp.19-26
- /
- 2010
In this paper, we suggest a efficient handwritten character verification method in on-line environments which automatically analyses two input character string and computes their similarity degrees. The proposed algorithm first applies the circular projection method to input handwritten strings and extracts their representative features including shape, directions, etc. It then calculates the similarity between two character strings by using an improved dynamic time warping (DTW) algorithm. We improved the conventional DTW algorithm efficiently through adopting the branch-and-bound policy to the existing DTW algorithm which is well-known to produce good results in the various optimization problems. The experimental results to verify the performance of the proposed system show that the suggested handwritten character verification method operates more efficiently than the existing DTW and DDTW algorithms in terms of the speed.
https://doi.org/10.9708/jksci.2010.15.7.019 인용 PDF KSCI

A Comprehensive Approach for Tamil Handwritten Character Recognition with Feature Selection and Ensemble Learning

Manoj K;Iyapparaja M
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.18 no.6
- /
- pp.1540-1561
- /
- 2024
This research proposes a novel approach for Tamil Handwritten Character Recognition (THCR) that combines feature selection and ensemble learning techniques. The Tamil script is complex and highly variable, requiring a robust and accurate recognition system. Feature selection is used to reduce dimensionality while preserving discriminative features, improving classification performance and reducing computational complexity. Several feature selection methods are compared, and individual classifiers (support vector machines, neural networks, and decision trees) are evaluated through extensive experiments. Ensemble learning techniques such as bagging, and boosting are employed to leverage the strengths of multiple classifiers and enhance recognition accuracy. The proposed approach is evaluated on the HP Labs Dataset, achieving an impressive 95.56% accuracy using an ensemble learning framework based on support vector machines. The dataset consists of 82,928 samples with 247 distinct classes, contributed by 500 participants from Tamil Nadu. It includes 40,000 characters with 500 user variations. The results surpass or rival existing methods, demonstrating the effectiveness of the approach. The research also offers insights for developing advanced recognition systems for other complex scripts. Future investigations could explore the integration of deep learning techniques and the extension of the proposed approach to other Indic scripts and languages, advancing the field of handwritten character recognition.
https://doi.org/10.3837/tiis.2024.06.007 인용 PDF HTML

A Study on the Preprocessing Method Using Construction of Watershed for Character Image segmentation

Nam Sang Yep;Choi Young Kyoo;Kwon Yun Jung;Lee Sung Chang
- Proceedings of the IEEK Conference
- /
- 2004.08c
- /
- pp.814-818
- /
- 2004
Off-line handwritten character recognition is in difficulty of incomplete preprocessing because it has not dynamic and timing information besides has various handwriting, extreme overlap of the consonant and vowel and many error image of stroke. Consequently off-line handwritten character recognition needs to study about preprocessing of various methods such as binarization and thinning. This paper considers running time of watershed algorithm and the quality of resulting image as preprocessing For off-line handwritten Korean character recognition. So it proposes application of effective watershed algorithm for segmentation of character region and background region in gray level character image and segmentation function for binarization image and segmentation function for binarization by extracted watershed image. Besides it proposes thinning methods which effectively extracts skeleton through conditional test mask considering running time and quality. of skeleton, estimates efficiency of existing methods and this paper's methods as running time and quality. Watershed image conversion uses prewitt operator for gradient image conversion, extracts local minima considering 8-neighborhood pixel. And methods by using difference of mean value is used in region merging step, Converted watershed image by means of this methods separates effectively character region and background region applying to segmentation function. Average execution time on the previous method was 2.16 second and on this paper method was 1.72 second. We prove that this paper's method removed noise effectively with overlap stroke as compared with the previous method.
PDF

Search Result 103, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)