Search | Korea Science

Collection Selection using Relevance Distribution Information between Queries and Collections in Meta Search (메타 검색에서 질의와 컬렉션 사이의 관련성 분포정보를 이용한 컬렉션 선택)

배종민;김현주
- Journal of Korea Multimedia Society
- /
- v.4 no.4
- /
- pp.287-296
- /
- 2001
This paper proposes an efficient algorithm to select the proper retrieval results from various information sources in Meta search. The algorithm collects and evaluates the related documents to the given query Then, it determines the appropriate retrieval results based on the relevance between the query and the collected documents. This algorithm depends on the Meta information such as the size N of population, top-ranked information of related documents and the precision in order to choose the most appropriate retrieval result.
PDF

Efficient Retrieval of Short Opinion Documents Using Learning to Rank (기계학습을 이용한 단문 오피니언 문서의 효율적 검색 기법)

Chang, Jae-Young
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.13 no.4
- /
- pp.117-126
- /
- 2013
Recently, as Social Network Services(SNS), such as Twitter, Facebook, are becoming more popular, much research has been doing on opinion mining. However, current related researches are mostly focused on sentiment classification or feature selection, but there were few studies about opinion document retrieval. In this paper, we propose a new retrieval method of short opinion documents. Proposed method utilizes previous sentiment classification methodology, and applies several features of documents for evaluating the quality of the opinion documents. For generating the retrieval model, we adopt Learning-to-rank technique and integrate sentiment classification model to Learning-to-rank. Experimental results show that proposed method can be applied successfully in opinion search.
https://doi.org/10.7236/JIIBC.2013.13.4.117 인용 PDF KSCI

Improving the Performance of the User Creative Contents Retrieval Using Content Reputation and User Reputation (콘텐츠 명성 및 사용자 명성 평가를 이용한 UCC 검색 품질 개선)

Bae, Won-Sik;Cha, Jeong-Won
- Journal of the Korea Society for Simulation
- /
- v.19 no.1
- /
- pp.83-90
- /
- 2010
We describe a novel method for improving the performance of the UCC retrieval using content reputation and user reputation. The UCC retrieval is a part of the information retrieval. The goal of the information retrieval system finds documents what users want, so the goal of the UCC retrieval system tries to find UCCs themselves instead of documents. Unlike the document, the UCC has not enough textual information. Therefore, we try to use the content reputation and the user reputation based on non-textual information to gain improved retrieval performance. We evaluate content reputation using the information of the UCC itself and social activities between users related with UCCs. We evaluate user reputation using individual social activities between users or users and UCCs. We build a network with users and UCCs from social activities, and then we can get the user reputation from the network by graph algorithms. We collect the information of users and UCCs from YouTube and implement two systems using content reputation and user reputation. And then we compare two systems. From the experiment results, we can see that the system using content reputation outperforms than the system using user reputation. This result is expected to use the UCC retrieval in the feature.
https://doi.org/10.9709/JKSS.2010.19.1.083 인용 PDF KSCI

Automatic In-Text Keyword Tagging based on Information Retrieval

Kim, Jin-Suk;Jin, Du-Seok;Kim, Kwang-Young;Choe, Ho-Seop
- Journal of Information Processing Systems
- /
- v.5 no.3
- /
- pp.159-166
- /
- 2009
As shown in Wikipedia, tagging or cross-linking through major keywords in a document collection improves not only the readability of documents but also responsive and adaptive navigation among related documents. In recent years, the Semantic Web has increased the importance of social tagging as a key feature of the Web 2.0 and, as its crucial phenotype, Tag Cloud has emerged to the public. In this paper we provide an efficient method of automated in-text keyword tagging based on large-scale controlled term collection or keyword dictionary, where the computational complexity of O(mN) - if a pattern matching algorithm is used - can be reduced to O(mlogN) - if an Information Retrieval technique is adopted - while m is the length of target document and N is the total number of candidate terms to be tagged. The result shows that automatic in-text tagging with keywords filtered by Information Retrieval speeds up to about 6 $\sim$ 40 times compared with the fastest pattern matching algorithm.
https://doi.org/10.3745/JIPS.2009.5.3.159 인용 PDF KSCI

An Improved Approach to Ranking Web Documents

Gupta, Pooja;Singh, Sandeep K.;Yadav, Divakar;Sharma, A.K.
- Journal of Information Processing Systems
- /
- v.9 no.2
- /
- pp.217-236
- /
- 2013
Ranking thousands of web documents so that they are matched in response to a user query is really a challenging task. For this purpose, search engines use different ranking mechanisms on apparently related resultant web documents to decide the order in which documents should be displayed. Existing ranking mechanisms decide on the order of a web page based on the amount and popularity of the links pointed to and emerging from it. Sometime search engines result in placing less relevant documents in the top positions in response to a user query. There is a strong need to improve the ranking strategy. In this paper, a novel ranking mechanism is being proposed to rank the web documents that consider both the HTML structure of a page and the contextual senses of keywords that are present within it and its back-links. The approach has been tested on data sets of URLs and on their back-links in relation to different topics. The experimental result shows that the overall search results, in response to user queries, are improved. The ordering of the links that have been obtained is compared with the ordering that has been done by using the page rank score. The results obtained thereafter shows that the proposed mechanism contextually puts more related web pages in the top order, as compared to the page rank score.
https://doi.org/10.3745/JIPS.2013.9.2.217 인용 PDF KSCI

Semi Automatic Ontology Generation about XML Documents

Gu Mi Sug;Hwang Jeong Hee;Ryu Keun Ho;Jung Doo Yeong;Lee Keum Woo
- Proceedings of the KSRS Conference
- /
- 2004.10a
- /
- pp.730-733
- /
- 2004
Recently XML (eXtensible Markup Language) is becoming the standard for exchanging the documents on the web. And as the amount of information is increasing because of the development of the technique in the Internet, semantic web is becoming to appear for more exact result of information retrieval than the existing one on the web. Ontology which is the basis of the semantic web provides the basic knowledge system to express a particular knowledge. So it can show the exact result of the information retrieval. Ontology defines the particular concepts and the relationships between the concepts about specific domain and it has the hierarchy similar to the taxonomy. In this paper, we propose the generation of semi-automatic ontology based on XML documents that are interesting to many researchers as the means of knowledge expression. To construct the ontology in a particular domain, we suggest the algorithm to determine the domain. So we determined that the domain of ontology is to extract the information of movie on the web. And we used the generalized association rules, one of data mining methods, to generate the ontology, using the tag and contents of XML documents. And XTM (XML Topic Maps), ISO Standard, is used to construct the ontology as an ontology language. The advantage of this method is that because we construct the ontology based on the terms frequently used documents related in the domain, it is useful to query and retrieve the related domain.
PDF

A Study on the Depth-Oriented Decomposition Indexing Method for Creating and Searching Structured Documents Based-on XML (XML을 이용한 구조적 문서 생성 및 탐색을 위한 깊이중심분할 색인기법에 관한 연구)

Yang, Ok-Yul;Lee, Yong-Ju
- The KIPS Transactions:PartD
- /
- v.9D no.6
- /
- pp.1025-1042
- /
- 2002
The goal of this study is to generate a structured document which improves the performance of an information retrieval system by using thesaurus, information on relations between words (terms), and to study on the technique for searching this structured document. In order to accomplish this goal, we propose a DODI (Depth -Oriented Decomposition Index) technique for the structured document and an algorithm to search for related information efficient]y through this index technique that uses a thesaurus. We establish a storage system by which the structured document generated by this index technique is saved in a database through OpenXML and XML documents are generated through ForXML methods.
https://doi.org/10.3745/KIPSTD.2002.9D.6.1025 인용 PDF KSCI

Fast, Flexible Text Search Using Genomic Short-Read Mapping Model

Kim, Sung-Hwan;Cho, Hwan-Gue
- ETRI Journal
- /
- v.38 no.3
- /
- pp.518-528
- /
- 2016
The searching of an extensive document database for documents that are locally similar to a given query document, and the subsequent detection of similar regions between such documents, is considered as an essential task in the fields of information retrieval and data management. In this paper, we present a framework for such a task. The proposed framework employs the method of short-read mapping, which is used in bioinformatics to reveal similarities between genomic sequences. In this paper, documents are considered biological objects; consequently, edit operations between locally similar documents are viewed as an evolutionary process. Accordingly, we are able to apply the method of evolution tracing in the detection of similar regions between documents. In addition, we propose heuristic methods to address issues associated with the different stages of the proposed framework, for example, a frequency-based fragment ordering method and a locality-aware interval aggregation method. Extensive experiments covering various scenarios related to the search of an extensive document database for documents that are locally similar to a given query document are considered, and the results indicate that the proposed framework outperforms existing methods.
https://doi.org/10.4218/etrij.16.0115.0594 인용 PDF KSCI

Document Retrieval using Concept Network (개념 네트워크를 이용한 정보 검색 방법)

Hur, Won-Chang;Lee, Sang-Jin
- Asia pacific journal of information systems
- /
- v.16 no.4
- /
- pp.203-215
- /
- 2006
The advent of KM(knowledge management) concept have led many organizations to seek an effective way to make use of their knowledge. But the absence of right tools for systematic handling of unstructured information makes it difficult to automatically retrieve and share relevant information that exactly meet user's needs. we propose a systematic method to enable content-based information retrieval from corpus of unstructured documents. In our method, a document is represented by using several key terms which are automatically selected based on their quantitative relevancy to the document. Basically, the relevancy is calculated by using a traditional TFIDF measure that are widely accepted in the related research, but to improve effectiveness of the measure, we exploited 'concept network' that represents term-term relationships. In particular, in constructing the concept network, we have also considered relative position of terms occurring in a document. A prototype system for experiment has been implemented. The experiment result shows that our approach can have higher performance over the conventional TFIDF method.
PDF KSCI

Relevant Image Retrieval of Korean Documents based on Sentence and Word Importance (문장 및 단어 중요도를 통한 한국어 문서 연관 이미지 검색)

Kim, Nam-Gyu;Kang, Shin-Jae
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.20 no.3
- /
- pp.43-48
- /
- 2019
While reading text-only documents and finding unknown words, readers will become the focus disturbed and not be able to understand the content of the documents. Because children have little experience, it is difficult to understand correctly if the description in context is unfamiliar or ambiguous. In this paper, in order to help understand the text and increase the interest of the readers, we analyze the texts of documents and select the contents that are considered important, and implement a system that displays the most relevant images automatically from the web and links the texts and the images together. The implementation of the system divides the article into paragraphs, analyzes the text, selects important sentences for each paragraph and the important words that best represent the meaning of the important sentences, searches for images related to the words on the web, and then links the images to each of the previous paragraphs. Experiments have shown how to select important sentences and how to select important words in the sentences. As a result of the experiment, we could get 60% performance by evaluating the accuracy of the relation between three selected images and corresponding important sentences.
https://doi.org/10.5762/KAIS.2019.20.3.43 인용 PDF KSCI HTML

Search Result 56, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)