• Title/Summary/Keyword: Boolean retrieval

Search Result 58, Processing Time 0.024 seconds

A Study of Designing the Knowledge Base System for the Query Extension by Index File (색인파일 기반의 질의어 확장용 지식베이스 구축에 관한 연구)

  • Seo, Whee
    • Journal of Korean Library and Information Science Society
    • /
    • v.40 no.2
    • /
    • pp.139-159
    • /
    • 2009
  • This study is to develop knowledge base system for query extension to the user oriented information retrieval. This study has survey the theories of the concept-based information retrieval method and statistic based information retrieval method. In the construction method of knowledge base, the common hypothesis is that the emergence of related term is the frequency of simultaneous emergence of a set of documents. Using the subject index file algorithms and the 'and' operator of boolean logic based on this hypothesis, this study builds the knowledge base. In this research experiment, a subject of knowledge base is education. Using the book of the Introduction to Education, two experimental knowledge base systems is constructed by the different indexing method. One system has constructed by controlled language indexing method, and another system has constructed by natural language indexing method. The performance of two knowledge base system is evaluated.

  • PDF

A Development of the Test Set for Estimating the Retrieval Performance of an Automatic Indexer (자동색인기 성능시험을 위한 Test Set 개발)

  • 김성혁;서은경;이원규;김명철;김영환;김재군
    • Journal of the Korean Society for information Management
    • /
    • v.11 no.1
    • /
    • pp.81-102
    • /
    • 1994
  • Accordmg to the development of various information retneval system suitable for Korean database, many researchers have realized the need of R Test ColleAon which can be r d y used for evaluatmg a retneval system. Therefore, This study developed the TEST SET whch helps ob&vely evaluatmg the retrieval performance of an Hangul Automatic Indexer or Korean Information Retrieval System. The developed Test Set has four files such as: 1) Korean Document Set( * . all): 2) Natural Language Query Set(KTsetnq1): 3) Boolean Query Set(Ktset.bq1): 4) Query-Relevance Judgment Set ( KTsetrel) .

  • PDF

College Students' Preferences of Web-based OPAC Retrieval Techniques and their Blood Types: An Empirical Study (대학생들의 웹 기반 OPAC 검색기법 선호도와 혈액형에 대한 실험적 연구)

  • Kim, Hee-Sop
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.44 no.3
    • /
    • pp.81-102
    • /
    • 2010
  • The purpose of this study was to investigate college students' preferences of Web-based OPAC retrieval techniques and their ABO blood types as an empirical survey. Data was collected through a self-designed questionnaire with a total of 101 undergraduate students from the College of Social Sciences responding. The collected data was analyzed using descriptive statistics, and One-way ANOVA. The results show that 'title' was most preferred among the access points, 'AND' was the most preferred Boolean operator, 'publication year' and 'subject' were the most favored techniques in limiting the scope of retrieval, and 'record number limit per page' was the most frequently used for displaying retrieval results. The results also show that there were little(3 out of 22, i.e. 13.6%) statistically significant differences between the college students' preferences of Web-based OPAC techniques and their blood type.

A Study of Document Ranking Algorithms in a P-norm Retrieval System (P-norm 검색의 문헌 순위화 기법에 관한 실험적 연구)

  • 고미영;정영미
    • Journal of the Korean Society for information Management
    • /
    • v.16 no.1
    • /
    • pp.7-30
    • /
    • 1999
  • This study is to develop effective document ranking algorithms in the P-norm retrieval system which can be implemented to the Boolean retrieval system without major difficulties by using non-statistical term weights based on document structure. Also, it is to enhance the performance by introducing the rank adjustment process which rearranges the ranks of retrieved documents according to the similarity between the top ranked documents and the rest of them. Of the non-statistical term weight algorithms, this study uses field weight and term pair distance weight. In the rank adjustment process, five retrieval experiments were performed, ranging between the case of using one record for the similarity measurement and the case of using first five records. It is proved that non-statistical term weights are highly effective and the rank adjustment process enhance the performance further.

  • PDF

Facet Query Expansion with an Object-Based Thesaurus in Reusable Component Retrieval Systems (재사용 부품 검색 시스템에서 객체기반 시소러스를 이용한 패싯 질의의 확장)

  • Choi, Jae-Hun;Kim, Ki-Heon;Yang, Jae-Dong;Lee, Dong-Gil
    • Journal of KIISE:Software and Applications
    • /
    • v.27 no.2
    • /
    • pp.168-179
    • /
    • 2000
  • In reusable component retrieval systems with facet-based schemes, facet queries are generally used for representing the characteristics of components relevant to users. This paper proposes an expanded facet query equipped with an object-based thesaurus to precisely formulate user's intents. To evaluate the query, a component retrieval system is also designed and implemented. For exactly retrieving the components, user's query should include relevant facet values capable of fully specifying their characteristics. However, simply listing a series of facet values directly inputted by users, conventional queries fails to precisely represent user's intents. Our query, called expanded facet query, employs fuzzy boolean operators and object-based thesaurus; the former logically expresses the fuzzy connectives between facet queries and required components, whereas the latter helps users appropriately select the specific facet values into the query. A thesaurus query is provided to recommend the relevant facet values with their fuzzy degrees from the thesaurus as well. Furthermore, our retrieval system can automatically formulate queries with the recommended facet values, if necessary.

  • PDF

A Comparative Analysis of Content-based Music Retrieval Systems (내용기반 음악검색 시스템의 비교 분석)

  • Ro, Jung-Soon
    • Journal of the Korean Society for information Management
    • /
    • v.30 no.3
    • /
    • pp.23-48
    • /
    • 2013
  • This study compared and analyzed 15 CBMR (Content-based Music Retrieval) systems accessible on the web in terms of DB size and type, query type, access point, input and output type, and search functions, with reviewing features of music information and techniques used for transforming or transcribing of music sources, extracting and segmenting melodies, extracting and indexing features of music, and matching algorithms for CBMR systems. Application of text information retrieval techniques such as inverted indexing, N-gram indexing, Boolean search, truncation, keyword and phrase search, normalization, filtering, browsing, exact matching, similarity measure using edit distance, sorting, etc. to enhancing the CBMR; effort for increasing DB size and usability; and problems in extracting melodies, deleting stop notes in queries, and using solfege as pitch information were found as the results of analysis.

A Study on Improving the Effectiveness of Information Retrieval Through P-norm, RF, LCAF

  • Kim, Young-cheon;Lee, Sung-joo
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.2 no.1
    • /
    • pp.9-14
    • /
    • 2002
  • Boolean retrieval is simple and elegant. However, since there is no provision for term weighting, no ranking of the answer set is generated. As a result, the size of the output might be too large or too small. Relevance feedback is the most popular query reformulation strategy. in a relevance feedback cycle, the user is presented with a list of the retrieved documents and, after examining them, marks those which are relevant. In practice, only the top 10(or 20) ranked documents need to be examined. The main idea consists of selecting important terms, or expressions, attached to the documents that have been identified as relevant by the user, and of enhancing the importance of these terms in a new query formulation. The expected effect is that the new query will be moved towards the relevant documents and away from the non-relevant ones. Local analysis techniques are interesting because they take advantage of the local context provided with the query. In this regard, they seem more appropriate than global analysis techniques. In a local strategy, the documents retrieved for a given query q are examined at query time to determine terms for query expansion. This is similar to a relevance feedback cycle but might be done without assistance from the user.

Relational indexing: theory and practice (연관색인법(聯關索引法)의 이론(理論)과 실제(實際))

  • Kim, Tae Soo
    • Journal of the Korean Society for information Management
    • /
    • v.1 no.1
    • /
    • pp.25-42
    • /
    • 1984
  • The intellectual organization of information for storage and retrieval is one of the problem areas of information science. But the most of the methods have been developed are based on the coordination of keywords by Boolean logic or traditional classification systems. It is necessary to include in indexing process not just concepts or keywords, but also to express explicitly the relationship between them. Farradane believed that such a system should be founded on the psychology of thinking and developed an outline structure for concept organization and a series of explicit relations. The system of relational operators provides nine symbols which enable the semantic relationship between any two concepts to be explicitly specified in a analet. With the addition of the permutations of such analets, formed according to simple rules, alphabetical arrangement of the first term provides a complete logical subject index. Relational indexing as the basis of an indexing language has the potential to offer improved retrieval performance.

  • PDF

A System for converting natural language queries Into boolean queries for Information Retrieval (정보검색을 위한 자연언어 질의어의 불리언 질의로의 변환)

  • 서광준;최기선;나동열
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06c
    • /
    • pp.258-261
    • /
    • 1994
  • 자연언어 인터페이스는 초보자나 비숙련가의 입장에서는 새로운 시스템의 적응에 있어서 어떤 학습도 필요하지 않다는 장점이 있다. 이 연구에서는 불리언 질의를 처리하는 정보검색 시스템의 자연언어 인터페이스를 구혐하였다. 즉, 한국어 자연언어 질의를 불리언 질의로 변환해주는 시스템이다. 접근 방법은 먼저 자연언어 질의를 구문 해석한 후에, 그 결과인 문자의 의존 구조와 불용어 정보를 사용하여 기본적인 불리언 질의를 만든다음, 시소러스를 이용하여 불리언 질의를 확장한다. 여기에서 사용한 구문 해석 방법은 기존 문법에 기반한 방법이다. 변환 시스템은 SPARC-II 호환기종에서 구현되었으며, 약 5만 단어의 사전을 사용한다. 가공된 120 개의 질의를 대상으로 실험한 결과, 전체 소요시간은 13.5초가 걸렸다. 그리고, 변환된 불리언 연산식중에 110개가 적절하게 변환된 것으로 조사되었다.

  • PDF

A Study on Boolean Query Optimization in Information Retrieval (불리언 질의 최적화에 관한 연구)

  • Joo, Won-Kyun;Lee, Min-Ho;Kang, Moo-Young
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2002.11c
    • /
    • pp.1879-1882
    • /
    • 2002
  • 본 논문에서는 불리언 모델을 지원하는 정보검색 시스템에서 사용자로부터 입력받은 불리언 질의를 효율적으로 연산하기 위한 3 가지 방법을 제안한다. 첫째, 불리언 대수를 사용하여 형태적으로 불필요한 노드를 제거한다. 둘째 색인어 출현 빈도 정보를 사용함으로써 빈도 0 을 가지는 노드와 이를 포함하는 노드의 연산 제외 여부를 결정하고, 연산 수행 시 시간이 적게 걸리는 순으로 피 연산자와 연산자의 순서를 재배열한다. 셋째, 불리언 질의 내에 복합 명사가 포함되어 있을 경우 구성 명사와 연산자의 조합을 이용한 질의 확장을 실시한다. 처음 두 가지 방법은 검색 속도의 향상을, 세 번째 방법은 정확도의 향상을 목표로 한다.

  • PDF