Search | Korea Science

A Robust Marker Detection Algorithm Using Hybrid Features in Augmented Reality (증강현실 환경에서 복합특징 기반의 강인한 마커 검출 알고리즘)

Park, Gyu-Ho;Lee, Heng-Suk;Han, Kyu-Phil
- The KIPS Transactions:PartA
- /
- v.17A no.4
- /
- pp.189-196
- /
- 2010
This paper presents an improved marker detection algorithm using hybrid features such as corner, line segment, region, and adaptive threshold values, etc. In usual augmented reality environments, there are often marker occlusion and poor illumination. However, existing ARToolkit fails to recognize the marker in these situations, especially, partial concealment of marker by user, large change of illumination and dim circumstances. In order to solve these problems, the adaptive threshold technique is adopted to extract a marker region and a corner extraction method based on line segments is presented against marker occlusions. In addition, a compensating method, corresponding the marker size and center between registered and extracted one, is proposed to increase the template matching efficiency, because the inside marker size of warped images is slightly distorted due to the movement of corner and warping. Therefore, experimental results showed that the proposed algorithm can robustly detect the marker in severe illumination change and occlusion environment and use similar markers because the matching efficiency was increased almost 30%.
https://doi.org/10.3745/KIPSTA.2010.17A.4.189 인용 PDF KSCI

The Geometric Layout Analysis of the Document Image Using Connected Components Method and Median Filter (연결요소 방법과 메디안 필터를 이용한 문서영상 기하학적 구조분석)

Jang, Dae-Geun;Hwang, Chan-Sik
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.27 no.8A
- /
- pp.805-813
- /
- 2002
Document image should be classified into detailed regions as text, picture, table and etc through the geometric layout analysis if paper documents can be converted automatically into electronic documents. However, complexity of the document layout and variety of the size and density of a picture are the reason to make it difficult to analyze the geometric layout of the document images. In this paper, we propose the method which have a better performance of the region segmentation and classifications, and the line extraction in the table region than the commercial softwares and previous methods. The proposed method can segment the document into detailed regions by using connected components method even if its layout is complex. This method also classifies texts and pictures by using separable median filter even. Though their size and density are diverse, In addition, this method extracts the lines from the table adapting one dimensional median filter to the each horizontal and vertical direction, even though lines are deformed or texts attached to them.
PDF KSCI

A Study on the Pitch Extraction Improvement Using LSP for the Synthesis of High Speech Quality (고음질 음성합성을 위한 LSP를 이용한 피치검출 성능향상에 관한 연구)

Seo, Ji-Ho;Kim, Jong-Kuk;Bae, Myung-Jin
- The Journal of the Acoustical Society of Korea
- /
- v.29 no.1
- /
- pp.69-75
- /
- 2010
In this paper, the pitch is detected after the elimination of formant ingredients by flattening the spectrum in frequency domain. In order to remove impact of formant and transition frequency in the signal spectrum, formant envelop is made by linear interpolation with any points each sub-band and the spectrum of speech signal is compensated by the reverse of the envelop interpolated linearly after we divide frequency band into several segment based on LSP and detect the points. The experimental result showed the proposed method appeared an outstanding performance in compared with LPC, Cepstrum, Lifter methods. The method reduced the gross error rate 1.30% than the LPC method which appeared a good performance except the proposed method. Also, the proposed method showed low error rate in noise environment.
https://doi.org/10.7776/ASK.2010.29.1.069 인용 PDF KSCI

An Efficient Numeric Character Segmentation of Metering Devices for Remote Automatic Meter Reading (원격 자동 검침을 위한 효과적인 계량기 숫자 분할)

Toan, Vo Van;Chung, Sun-Tae;Cho, Seong-Won
- Journal of Korea Multimedia Society
- /
- v.15 no.6
- /
- pp.737-747
- /
- 2012
Recently, in order to support automatic meter reading for conventional metering devices, an image processing-based approach of recognizing the number meter data in the captured meter images has attracted many researchers' interests. Numerical character segmentation is a very critical process for successful recognition. In this paper, we propose an efficient numeric character segmentation method which can segment numeric characters well for any metering device types under diverse illumination environments. The proposed method consists of two consecutive stages; detection of number area containing all numbers as a tight ROI(Region of Interest) and segmentation of numerical characters in the ROI. Detection of tight ROI is achieved in two steps: extraction of rough ROI by utilizing horizontal line segments after illumination enhancement preprocessing, and making the rough ROI more tight through clipping utilizing vertical and horizontal projection about binarized ROI. Numerical character segmentation in the detected ROI is stably achieved in two processes of 'vertical segmentation of each number region' and 'number segmentation in the each vertical segmented number region'. Through the experiments about a homegrown meter image database containing various meter type images of low contrast, low intensity, shadow, and saturation, it is shown that the proposed numeric character segmentation method performs effectively well for any metering device types under diverse illumination environments.
https://doi.org/10.9717/kmms.2012.15.6.737 인용 PDF KSCI

Multi-modal Image Processing for Improving Recognition Accuracy of Text Data in Images (이미지 내의 텍스트 데이터 인식 정확도 향상을 위한 멀티 모달 이미지 처리 프로세스)

Park, Jungeun;Joo, Gyeongdon;Kim, Chulyun
- Database Research
- /
- v.34 no.3
- /
- pp.148-158
- /
- 2018
The optical character recognition (OCR) is a technique to extract and recognize texts from images. It is an important preprocessing step in data analysis since most actual text information is embedded in images. Many OCR engines have high recognition accuracy for images where texts are clearly separable from background, such as white background and black lettering. However, they have low recognition accuracy for images where texts are not easily separable from complex background. To improve this low accuracy problem with complex images, it is necessary to transform the input image to make texts more noticeable. In this paper, we propose a method to segment an input image into text lines to enable OCR engines to recognize each line more efficiently, and to determine the final output by comparing the recognition rates of CLAHE module and Two-step module which distinguish texts from background regions based on image processing techniques. Through thorough experiments comparing with well-known OCR engines, Tesseract and Abbyy, we show that our proposed method have the best recognition accuracy with complex background images.

Search Result 35, Processing Time 0.017 seconds

A Robust Marker Detection Algorithm Using Hybrid Features in Augmented Reality (증강현실 환경에서 복합특징 기반의 강인한 마커 검출 알고리즘)

The Geometric Layout Analysis of the Document Image Using Connected Components Method and Median Filter (연결요소 방법과 메디안 필터를 이용한 문서영상 기하학적 구조분석)

A Study on the Pitch Extraction Improvement Using LSP for the Synthesis of High Speech Quality (고음질 음성합성을 위한 LSP를 이용한 피치검출 성능향상에 관한 연구)

An Efficient Numeric Character Segmentation of Metering Devices for Remote Automatic Meter Reading (원격 자동 검침을 위한 효과적인 계량기 숫자 분할)

Multi-modal Image Processing for Improving Recognition Accuracy of Text Data in Images (이미지 내의 텍스트 데이터 인식 정확도 향상을 위한 멀티 모달 이미지 처리 프로세스)

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)