• Title/Summary/Keyword: and Information Retrieval

Search Result 3,455, Processing Time 0.031 seconds

Query Formulation for Heuristic Retrieval in Obfuscated and Translated Partially Derived Text

  • Kumar, Aarti;Das, Sujoy
    • Journal of Information Science Theory and Practice
    • /
    • v.3 no.1
    • /
    • pp.24-39
    • /
    • 2015
  • Pre-retrieval query formulation is an important step for identifying local text reuse. Local reuse with high obfuscation, paraphrasing, and translation poses a challenge of finding the reused text in a document. In this paper, three pre-retrieval query formulation strategies for heuristic retrieval in case of low obfuscated, high obfuscated, and translated text are studied. The strategies used are (a) Query formulation using proper nouns; (b) Query formulation using unique words (Hapax); and (c) Query formulation using most frequent words. Whereas in case of low and high obfuscation and simulated paraphrasing, keywords with Hapax proved to be slightly more efficient, initial results indicate that the simple strategy of query formulation using proper nouns gives promising results and may prove better in reducing the size of the corpus for post processing, for identifying local text reuse in case of obfuscated and translated text reuse.

Text Partitioned Indexing Method for Educational Documents (교육용 문서의 텍스트분할 색인)

  • Kang, Mu-Yeong;Lee, Sang-Gu
    • Journal of The Korean Association of Information Education
    • /
    • v.3 no.2
    • /
    • pp.72-84
    • /
    • 2000
  • Information retrieval system plays a key role in the information society to store digital documents with efficiency and to provide user with the information through the retrieval very fast. Especially, indexing is a prerequisite function for the information retrieval system in order to retrieve the information of the documents effectively which are saved in database. In this paper, we propose an indexing method using text partition. This method can retrieve educational documents in short processing time. We applied the suggested indexing method to real information retrieval system, and proved its excellent functions through the demonstration.

  • PDF

AN EFFICIENT DENSITY BASED ANT COLONY APPROACH ON WEB DOCUMENT CLUSTERING

  • M. REKA
    • Journal of applied mathematics & informatics
    • /
    • v.41 no.6
    • /
    • pp.1327-1339
    • /
    • 2023
  • World Wide Web (WWW) use has been increasing recently due to users needing more information. Lately, there has been a growing trend in the document information available to end users through the internet. The web's document search process is essential to find relevant documents for user queries.As the number of general web pages increases, it becomes increasingly challenging for users to find records that are appropriate to their interests. However, using existing Document Information Retrieval (DIR) approaches is time-consuming for large document collections. To alleviate the problem, this novel presents Spatial Clustering Ranking Pattern (SCRP) based Density Ant Colony Information Retrieval (DACIR) for user queries based DIR. The proposed first stage is the Term Frequency Weight (TFW) technique to identify the query weightage-based frequency. Based on the weight score, they are grouped and ranked using the proposed Spatial Clustering Ranking Pattern (SCRP) technique. Finally, based on ranking, select the most relevant information retrieves the document using DACIR algorithm.The proposed method outperforms traditional information retrieval methods regarding the quality of returned objects while performing significantly better in run time.

An Expert System for Content-based Image Retrieval with Object Database (객체 데이터베이스를 이용한 내용기반 이미지 검색 전문가 시스템)

  • Kim, Young-Min;Kim, Seong-In
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.14 no.5
    • /
    • pp.473-482
    • /
    • 2008
  • In this paper we propose an expert system for content-based image retrieval with object database. The proposed system finds keyword by using knowledge-base and feature of extracted object, and retrieves image by using keyword based image retrieval method. The system can decrease error of image retrieval and save running time. The system also checks whether similar objects exist or not. If not, user can store information of object in object database. Proposed system is flexible and extensible, enabling experts to incrementally add more knowledge and information. Experimental results show that the proposed system is more effective than existing content-based image retrieval method in running time and precision.

Design And Implementation of Video Retrieval System for Using Semantic-based Annotation (의미 기반 주석을 이용한 비디오 검색 시스템의 설계 및 구현)

  • 홍수열
    • Journal of the Korea Society of Computer and Information
    • /
    • v.5 no.3
    • /
    • pp.99-105
    • /
    • 2000
  • Video has become an important element of multimedia computing and communication environments, with applications as varied as broadcasting, education, publishing, and military intelligence. The necessity of the efficient methods for multimedia data retrieval is increasing more and more on account of various large scale multimedia applications. According1y, the retrieval and representation of video data becomes one of the main research issues in video database. As for the representation of the video data there have been mainly two approaches: (1) content-based video retrieval, and (2) annotation-based video retrieval This paper designs and implements a video retrieval system for using semantic-based annotation.

  • PDF

Implementing and Evaluating an Empirical Variable Retrieval System : The Entity-Relationship and Relational Approach (실험변수를 이용한 정보검색 시스템의 구축 및 평가 : 개체-관계 모델과 관계형 데이터베이스를 이용한 접근)

  • Oh Sam-Gyun
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.32 no.4
    • /
    • pp.53-67
    • /
    • 1998
  • This article investigates the potentialities of using empirical variables and their associated statistical relationships in document representation and retrieval. To this end, a newly devised empirical fact retrieval system was evaluated in comparison to a simulated traditional retrieval system involving a set of predetermined empirical queries. Results indicate that the EFRS generally outperformed the TRS in terms of the precision, search effort, and measures of user satisfaction.

  • PDF

Survey and Suggestion for Standardization of Online Catalog Retrieval Systems: Focused on the University Library Catalogs in Busan, Ulsan, Gyeongnam District (자동화목록 검색시스템의 현황과 표준화 방안 - 부산.울산.경남지역 대학도서관 목록의 분석을 중심으로 -)

  • Doh, Tae-Hyeon
    • Journal of Korean Library and Information Science Society
    • /
    • v.38 no.4
    • /
    • pp.357-376
    • /
    • 2007
  • This study surveyed the online catalog retrieval systems of the university libraries in Busan, Ulsan, Gyeongnam districts. The types of library materials, and the kinds of access points and retrieval conditions(Boolean logic, methods of index term Identification, and particularities of the retrieval) of these systems are various and different to each other Upon the result of this survey a suggestion for the standardization of online catalog retrieval systems is offered.

  • PDF

Design of Indexing Agent for Semantic-based Video Retrieval (의미기반 비디오 검색을 위한 인덱싱 에이전트의 설계)

  • Lee, Jong-Hee;Oh, Hae-Seok
    • The KIPS Transactions:PartB
    • /
    • v.10B no.6
    • /
    • pp.687-694
    • /
    • 2003
  • According to the rapid increase of multimedia data quantity recently, various means of video data search has been desired. In order to process video data effectively, it is required that the content information of video data is loaded in database and semantic-based retrieval method can be available for various query of users. Currently existent contents-based video retrieval systems search by single method such as annotation-based or feature-based retrieval, and show low search efficiency and requires many efforts of system administrator or annotator form less perfect automatic processing. In this paper, we propose semantic-based video retrieval system which support semantic retrieval of various users by feature-based retrieval and annotation-based retrieval of massive video data. By user's fundamental query and selection of image for key frame that extracted from query, the agent gives the detail shape for annotation of extracted key frame. Also, key frame selected by user become query image and searches the most similar key frame through feature based retrieval method that propose. Therefore, we design the system that can heighten retrieval efficiency of video data through semantic-based retrieval.

A Study on Icon Detection in Korean Traditional Paintings (한국 전통회화 내 도상 검출에 관한 연구)

  • Jiwon Lee;JungSoo Lee;Sungwon Moon;Do-Won Nam;Wonyoung Yoo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2023.11a
    • /
    • pp.446-448
    • /
    • 2023
  • 최근 문화유산 해설 분야에도 AI를 도입하기 위해 여러 노력을 기울이고 있으나, 관람객의 특성이나 관심사를 고려하지 않고 사전에 수동으로 입력한 동일한 문화해설 콘텐츠를 다수의 관람객에게 반복 전달하는 형태로만 제공되는데 그치고 있다. 본 논문에서는 관람객이 관람 중인 문화유산을 관람객의 다양한 관심사에 맞추어 문화유산을 다양하게 해설해주기 위한 기초 연구로 영상을 통해 입력된 한국 전통회화에서 도상을 검출하는 연구를 진행하였다. 아직 가능성 타진 연구로 진행되어 현재 제시된 실험 결과에서는 우수한 도상 검출 성능을 내지 못하였지만, 다양한 증강기법과 퓨샷 러닝기법을 통하여 성능 향상을 도모할 경우 충분히 관람객 맞춤형 문화유산 해설 분야에 활용 가능할 것으로 기대된다.