Search | Korea Science

A Study on Effective Internet Data Extraction through Layout Detection

Sun Bok-Keun;Han Kwang-Rok
- International Journal of Contents
- /
- v.1 no.2
- /
- pp.5-9
- /
- 2005
Currently most Internet documents including data are made based on predefined templates, but templates are usually formed only for main data and are not helpful for information retrieval against indexes, advertisements, header data etc. Templates in such forms are not appropriate when Internet documents are used as data for information retrieval. In order to process Internet documents in various areas of information retrieval, it is necessary to detect additional information such as advertisements and page indexes. Thus this study proposes a method of detecting the layout of Web pages by identifying the characteristics and structure of block tags that affect the layout of Web pages and calculating distances between Web pages. This method is purposed to reduce the cost of Web document automatic processing and improve processing efficiency by providing information about the structure of Web pages using templates through applying the method to information retrieval such as data extraction.
PDF

Automatic Extraction of Blood Flow Area in Brachial Artery for Suspicious Hypertension Patients from Color Doppler Sonography with Fuzzy C-Means Clustering

Kim, Kwang Baek;Song, Doo Heon;Yun, Sang-Seok
- Journal of information and communication convergence engineering
- /
- v.16 no.4
- /
- pp.258-263
- /
- 2018
Color Doppler sonography is a useful tool for examining blood flow and related indices. However, it should be done by well-trained operator, that is, operator subjectivity exists. In this paper, we propose an automatic blood flow area extraction method from brachial artery that would be an essential building block of computer aided color Doppler analyzer. Specifically, our concern is to examine hypertension suspicious (prehypertension) patients who might develop their symptoms to established hypertension in the future. The proposed method uses fuzzy C-means clustering as quantization engine with careful seeding of the number of clusters from histogram analysis. The experiment verifies that the proposed method is feasible in that the successful extraction rates are 96% (successful in 48 out of 50 test cases) and demonstrated better performance than K-means based method in specificity and sensitivity analysis but the proposed method should be further refined as the retrospective analysis pointed out.
https://doi.org/10.6109/jicce.2018.16.4.258 인용 PDF KSCI HTML

A Knowledge-Based System for Address Block Location on Korean Envelope Images (우리나라 우편 봉투 영상에서의 주소 영역 추추을 위한 지식 기반 시스템)

김기철;이성환
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.31B no.8
- /
- pp.137-147
- /
- 1994
In this paper,we propose a knowledge-based system for locating Destination Address Block(DAB) by analyzing the structure of Korean envelope images. In the proposed system the preprocessing steps such as adaptive binarization connected component extraction and deskewing are carried out first for the effective structure analysis of the envelope image. Then DAB containing address name and zipcode parts of the input envelope image is extracted by an iterative procedure based on the knowledge acquired from the statistical feature analysis of the various envelope images. Most of the system for slocating address blocks on envelopes have extracted DAB by segmenting an envelope image into several candidate blocks followed by selecting one among the candidate blocks. Because it is very difficult to segment a Korean envelope image into several blocks due to the specific writing habits that the addresses on the envelope are written in close proximity to each other the proposed iterative procedure determines DAB by splitting or merging the connected components and verifies the determined DAB without segmentation and selection. Experiments with a great number of the live envelopes provided from Seoul Mail Center in Koorea were carried out. The results reveal that the proposed system is very effective for address block location on Korean envelopes.
PDF

Design of an Efficient VLSI Architecture and Verification using FPGA-implementation for HMM(Hidden Markov Model)-based Robust and Real-time Lip Reading (HMM(Hidden Markov Model) 기반의 견고한 실시간 립리딩을 위한 효율적인 VLSI 구조 설계 및 FPGA 구현을 이용한 검증)

Lee Chi-Geun;Kim Myung-Hun;Lee Sang-Seol;Jung Sung-Tae
- Journal of the Korea Society of Computer and Information
- /
- v.11 no.2 s.40
- /
- pp.159-167
- /
- 2006
Lipreading has been suggested as one of the methods to improve the performance of speech recognition in noisy environment. However, existing methods are developed and implemented only in software. This paper suggests a hardware design for real-time lipreading. For real-time processing and feasible implementation, we decompose the lipreading system into three parts; image acquisition module, feature vector extraction module, and recognition module. Image acquisition module capture input image by using CMOS image sensor. The feature vector extraction module extracts feature vector from the input image by using parallel block matching algorithm. The parallel block matching algorithm is coded and simulated for FPGA circuit. Recognition module uses HMM based recognition algorithm. The recognition algorithm is coded and simulated by using DSP chip. The simulation results show that a real-time lipreading system can be implemented in hardware.
PDF

Efficient Object Classification Scheme for Scanned Educational Book Image (교육용 도서 영상을 위한 효과적인 객체 자동 분류 기술)

Choi, Young-Ju;Kim, Ji-Hae;Lee, Young-Woon;Lee, Jong-Hyeok;Hong, Gwang-Soo;Kim, Byung-Gyu
- Journal of Digital Contents Society
- /
- v.18 no.7
- /
- pp.1323-1331
- /
- 2017
Despite the fact that the copyright has grown into a large-scale business, there are many constant problems especially in image copyright. In this study, we propose an automatic object extraction and classification system for the scanned educational book image by combining document image processing and intelligent information technology like deep learning. First, the proposed technology removes noise component and then performs a visual attention assessment-based region separation. Then we carry out grouping operation based on extracted block areas and categorize each block as a picture or a character area. Finally, the caption area is extracted by searching around the classified picture area. As a result of the performance evaluation, it can be seen an average accuracy of 83% in the extraction of the image and caption area. For only image region detection, up-to 97% of accuracy is verified.
https://doi.org/10.9728/dcs.2017.18.7.1323 인용 PDF KSCI

Face Image Compression Algorithm using Triangular Feature Extraction and GHA (삼각특징추출과 GHA를 이용한 얼굴영상 압축알고리즘)

Seo, Seok-Bae;Kim, Dae-Jin;Gang, Dae-Seong
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.38 no.1
- /
- pp.11-18
- /
- 2001
In this paper, we proposed the image compression algorithm using triangular feature based GHA. In feature extraction, the input images are divided into eight areas of triangular shape, that has positional information for face image compression. The proposed algorithm reduces blocking effects in image reconstruction and contains informations of face feature and shapes of face as input images are divided into eight. We used triangular feature extraction for positional information and GHA for shape information of face images. Simulation results show that the proposed algorithm has a better performance than the block based K-means and non-parsed image based GHA in PSNR at the same bpp.
PDF

Motion Segmentation based on Modified Hierarchical Block-based Motion Estimation and Contour Extraction (블록 기반 움직임 추정과 윤곽선 추출을 통한 움직임 분할)

장정진;김태용;최종수
- Proceedings of the IEEK Conference
- /
- 2001.09a
- /
- pp.333-336
- /
- 2001
본 논문에서는 영상 시퀀스 상에서 물체의 가려짐을 고려하여 상대적인 깊이 순서에 의해 정렬되는 계층을 분리하기 위한 새로운 움직임 분할 방법을 제안한다. 블록을 기반으로 한 움직임 추정 및 클러스터링 과정을 통하여 각 계층에 대한 블록영역을 구하고, 이 블록영역에 대하여 윤곽선 추출을 이용하여 각 계층에 대한 정확한 객체를 분리할 수 있다. 이러한 움직임 분할방법을 통한 동영상의 계층적인 표현은 영상에서 원하지 않는 물체, 전경, 배경의 제거나 기존의 영상을 이용한 새로운 영상의 합성에 이용될 수 있으며, 분할을 통해 얻어진 객체는 영상 압축, 영상 합성 등을 위한 데이터베이스에 저장되어 응용될 수 있다.
PDF

An algorithm for the multi-view image improvement with the restricted number of images in texture extraction (텍스쳐 추출시 제한된 수의 참여 영상을 이용한 multi-view 영상 개선 알고리즘)

김도현;양영일
- Proceedings of the IEEK Conference
- /
- 1998.06a
- /
- pp.773-776
- /
- 1998
In this paper, we propose an efficient multi-view images coding algorithm which finds the optimal texture from the restricted number of multi-view images. The X-Y plane of the normalized object space is divided into triangular patches. The depth value of the node is determined by applying the block based disparity compensation method and then the texture of the each patch is extracted by applying the affine transformation patch is extracted by applying the affine transformation based disparity compensation method to the multi-view images. We restricted the number of images contributed to determining the texture comapred to traditional methods which use all the multi-view images in the texture extraction. Experimental results show that the SNR of images encoded by the proposed algorithm is better than that of imaes encoded by the traditional method by the amount about 0.2dB for the test sets of multi-view images called dragon, kid, city and santa. The recovered images from the encoded data by the proposed method show the better visual images than the recovered images from the encoded data by the traditional methods.
PDF

Hierarchical stereo matching using feature extraction of an image

Kim, Tae-June;Yoo, Ji-Sang
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2009.01a
- /
- pp.99-102
- /
- 2009
In this paper a hierarchical stereo matching algorithm based on feature extraction is proposed. The boundary (edge) as feature point in an image is first obtained by segmenting an image into red, green, blue and white regions. With the obtained boundary information, disparities are extracted by matching window on the image boundary, and the initial disparity map is generated when assigned the same disparity to neighbor pixels. The final disparity map is created with the initial disparity. The regions with the same initial disparity are classified into the regions with the same color and we search the disparity again in each region with the same color by changing block size and search range. The experiment results are evaluated on the Middlebury data set and it show that the proposed algorithm performed better than a phase based algorithm in the sense that only about 14% of the disparities for the entire image are inaccurate in the final disparity map. Furthermore, it was verified that the boundary of each region with the same disparity was clearly distinguished.
PDF

Improving Cover Song Search Accuracy by Extracting Salient Chromagram Components (강인한 크로마그램 성분 추출을 통한 커버곡 검색 성능 개선)

Seo, Jin Soo
- Journal of Korea Multimedia Society
- /
- v.22 no.6
- /
- pp.639-645
- /
- 2019
This paper proposes a salient chromagram components extraction method based on the temporal discrete cosine transform of a chromagram block to improve cover song retrieval accuracy. The proposed salient chromagram emphasizes tonal contents of music, which are well-preserved between an original song and its cover version, while reducing the effects of timbre difference. We apply the proposed salient chromagram extraction method as a preprocessing step for the Fourier-transform based cover song matching. Experiments on two cover song datasets confirm that the proposed salient chromagram improves the cover song search accuracy.
https://doi.org/10.9717/kmms.2019.22.6.639 인용 PDF KSCI HTML

Search Result 132, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)