• Title/Summary/Keyword: Text Retrieval

Search Result 344, Processing Time 0.024 seconds

Efficient Content-Based Image Retrieval Method using Shape and Color feature (형태와 칼러성분을 이용한 효율적인 내용 기반의 이미지 검색 방법)

  • Youm, Sung-Ju;Kim, Woo-Saeng
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.4
    • /
    • pp.733-744
    • /
    • 1996
  • Content-based image retrieval(CBIR) is an image data retrieval methodology using characteristic values of image data those are generated by system automatically without any caption or text information. In this paper, we propose a content-based image data retrieval method using shape and color features of image data as characteristic values. For this, we present some image processing techniques used for feature extraction and indexing techniques based on trie and R tree for fast image data retrieval. In our approach, image query result is more reliable because both shape and color features are considered. Also, we how an image database which implemented according to our approaches and sample retrieval results which are selected by our system from 200 sample images, and an analysis about the result by considering the effect of characteristic values of shape and color.

  • PDF

A Novel Sub-image Retrieval Approach using Dot-Matrix (점 행렬을 이용한 새로운 부분 영상 검색 기법)

  • Kim, Jun-Ho;Kang, Kyoung-Min;Lee, Do-Hoon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.13 no.3
    • /
    • pp.1330-1336
    • /
    • 2012
  • The Image retrieval has been study different approaches which are text-based, contents-based, area-based method and sub-image finding. The sub-image retrieval is to find a query image in the target one. In this paper, we propose a novel sub-image retrieval algorithm by Dot-Matrix method to be used in the bioinformatics. Dot-Matrix is a method to evaluate similarity between two sequences and we redefine the problem for retrieval of sub-image to the finding similarity of two images. For the approach, the 2 dimensional array of image converts a the vector which has gray-scale value. The 2 converted images align by dot-matrix and the result shows candidate sub-images. We used 10 images as target and 5 queries: duplicated, small scaled, and large scaled images included x-axes and y-axes scaled one for experiment.

User Adaptation Using User Model in Intelligent Image Retrieval System (지능형 화상 검색 시스템에서의 사용자 모델을 이용한 사용자 적응)

  • Kim, Yong-Hwan;Rhee, Phill-Kyu
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.12
    • /
    • pp.3559-3568
    • /
    • 1999
  • The information overload with many information resources is an inevitable problem in modern electronic life. It is more difficult to search some information with user's information needs from an uncontrolled flood of many digital information resources, such as the internet which has been rapidly increased. So, many information retrieval systems have been researched and appeared. In text retrieval systems, they have met with user's information needs. While, in image retrieval systems, they have not properly dealt with user's information needs. In this paper, for resolving this problem, we proposed the intelligent user interface for image retrieval. It is based on HCOS(Human-Computer Symmetry) model which is a layed interaction model between a human and computer. Its' methodology is employed to reduce user's information overhead and semantic gap between user and systems. It is implemented with machine learning algorithms, decision tree and backpropagation neural network, for user adaptation capabilities of intelligent image retrieval system(IIRS).

  • PDF

Development of KTRIMS Using the Technology of Full Text DB Construction (전문(全文) DB 구축(構築)에 의한 한국통신연구정보관리(韓國通信硏究情報管理) 시스템 개발(開發))

  • Lee, Sang-Yeob;Ahn, Hyun-Soo;Lee, Yang-Ok
    • Journal of Information Management
    • /
    • v.24 no.1
    • /
    • pp.1-20
    • /
    • 1993
  • KTRC(Korea Telecom Research Center) has developed the KTRIMS(Korea Telecom Research Information Management System) to keep and share the full text of the various up-to-date research information which many research institutes in KT have produced. This paper has presented the structure and the features of the KTRIMS.

  • PDF

A Study of Main Contents Extraction from Web News Pages based on XPath Analysis

  • Sun, Bok-Keun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.20 no.7
    • /
    • pp.1-7
    • /
    • 2015
  • Although data on the internet can be used in various fields such as source of data of IR(Information Retrieval), Data mining and knowledge information servece, and contains a lot of unnecessary information. The removal of the unnecessary data is a problem to be solved prior to the study of the knowledge-based information service that is based on the data of the web page, in this paper, we solve the problem through the implementation of XTractor(XPath Extractor). Since XPath is used to navigate the attribute data and the data elements in the XML document, the XPath analysis to be carried out through the XTractor. XTractor Extracts main text by html parsing, XPath grouping and detecting the XPath contains the main data. The result, the recognition and precision rate are showed in 97.9%, 93.9%, except for a few cases in a large amount of experimental data and it was confirmed that it is possible to properly extract the main text of the news.

Information Retrieval Model Using Anchor Text Information and Link Information (Anchor Text 정보와 링크 정보를 이용한 정보 검색 모델)

  • 한기덕;정성원;허희근;이교운;권혁철
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.04b
    • /
    • pp.916-918
    • /
    • 2004
  • 90년대 이전에 정보 검색에 대한 연구는 문서의 내용을 기반으로 한 연구가 주류였으며, 90년대에는 링크를 이용한 연구가 활발하였다. 90년대 말에 Page Rank와 HITS가 링크를 이용한 연구의 대표적 사례이며, 최근에는 문서의 내용과 링크 정보를 같이 이용하는 연구가 많이 발표되고 있다. 본 논문도 문서의 정보와 링크 정보를 이용한 새로운 검색 모델을 제시하고자 한다. 본 논문에서 사용하는 링크 정보는 수집된 문서에서 추출한 Page Rank의 가중치와 한 페이지를 가리키는 링크들의 목록이며, 상용하고자 하는 문서의 정보는 본문 내용과 Anchor Text이다. 링크 정보와 문서 정보를 이용하여 Anchor 벡터와 문서 벡터를 만들고, 각각 질의어 벡터와 Cosine Measure를 하여 값을 구한 후, 더한 값을 해당 문서의 가중치로 하여 검색에 이용한다.

  • PDF

Obscene Material Searching Method in WWW (WWW상에서 음란물 검색기법)

  • 노경택;김경우;이기영;김규호
    • Journal of the Korea Society of Computer and Information
    • /
    • v.4 no.2
    • /
    • pp.1-7
    • /
    • 1999
  • World-Wide Web(WWW) is a protocol for changing information exchanges which is central to text documents in the existing network to make a multimedia data exchanges. It is possible for a beginner to search and access data which he wants to find as data were stored in the form of hypertext. The easiness for searching and accessing the multimedia data in WWW makes a important role for obscene materials to be toward generalization and multimedia and occurs social problems for them to be commercialized, while other researchers have actively studied the way to block effectively the site providing obscene materials for solving such problems. This paper presents and implements the blocking method for the sites having obscene material as it effectively search them. The proposed model was based on Link-Based information retrieval method and proved that it accomplished more efficient retrieval of relevant documents than probabilistic model when compared the one with the other which is known to generate the most correct results. The improvements in the average recall and precision ratio were shown as 12% and 8% respectively. Especially, the retrieval capability of relevant documents which include non-text data and have a few links increased highly.

  • PDF

A Study on the Evaluation of the Mobile Web Accessibility of Public Library Services (공공도서관 모바일 웹 접근성 평가에 관한 연구)

  • Park, Ok Nam
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.48 no.1
    • /
    • pp.415-439
    • /
    • 2014
  • The purpose of this study is to analyze mobile web accessibility of public libraries. To achieve this, accessibility checklists were driven from mobile web and application guidelines. The study also conducted automatic and manual evaluations. As results, the study found that text alternatives for resource image, main menu, image for text contents were mainly missing. In addition, focus for retrieval, operating system accessibility for retrieval, and id & password label, color difference for library calendar and seat reservation, brightness difference for main manu and resource display, and space between controls for retrieval were mainly missing. The study also suggested focal missing areas for main library services. It will be utilized as checklists for development of mobile services.

Usability Evaluation of Text-based Search and Visual Search of a Multidisciplinary Library Database (상용 학술데이터베이스의 텍스트 기반 검색과 비주얼검색의 사용성에 관한 연구)

  • Kim, Jong-Ae
    • Journal of the Korean Society for information Management
    • /
    • v.26 no.3
    • /
    • pp.111-129
    • /
    • 2009
  • This study examined the usability of text-based search and visual search of a large multidisciplinary library database to provide an empirical analysis of the acceptability of visual systems in the information retrieval environment. It also examined if there are differences in the usability assessment based on experimental order. The results indicated that the text-based search supported users' search behaviors more efficiently than the visual search. Also the text-based search was rated higher than the visual search in terms of user perceptions of four usability factors.

Construction of Full-Text Database and Implementation of Service Environment for Electronic Theses and Dissertations (학위논문 전문데이터베이스 구축 및 서비스환경 구현)

  • Lee, Kyi-Ho;Kim, Jin-Suk;Yoon, Wha-Muk
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.1
    • /
    • pp.41-49
    • /
    • 2000
  • Form the middle of 199os, most universities in Korea have requested their students to submit not only the original text books but also their Electronic Theses and Dissertations(ETD) for masters degree and doctorates degree. The ETD submitted by the students are usually developed by various kinds of word processors such as MS-Word, LaTex, and HWP. Since there is no standard format for ETD to merge various different formats yet, it is difficult to construct the integrated database that provides full-tex service. In this paper, we transform three different ETD formats into a unified one, construct a full-text database, and implement the full-text retrieval system for effective search in the Internet environment.

  • PDF