• Title/Summary/Keyword: Web search engines

Search Result 210, Processing Time 0.032 seconds

Design and Implementation of Information Retrieval System Based on Ontology Using Semantic Web (시맨틱 웹을 이용한 온톨로지 기반의 정보검색 시스템 설계 및 구현)

  • Seo, Woo-Jin;Rhyu, Kyeong-Taek
    • Journal of Digital Convergence
    • /
    • v.17 no.1
    • /
    • pp.209-217
    • /
    • 2019
  • In this paper, the purpose of this paper is to lay the foundation for the search system by using and building an online search engine suitable for the search domain and enabling search, conversion, integration and sharing of information. It is to use the ontology to infer hierarchical relationships, deduce objects based on that layer, and extract attributes to search areas that are relevant to the data that the user wants. In order to search for information in this way, the information search system was implemented by entering key words related to 'qualifications'. The implemented system arranged the meaning and relationship of each attribute online so that the general public can search information quickly, easily, and accurately. In addition, the implementation results were compared with two different search engines. Comparable search engines are Naver and Daum, the two major search engines. The search engine of this study, which was built using an ontology suitable for the search domain to perform searches using the semantic web, was evaluated to have excellent results. However, it is thought that a more formalized online location is necessary to increase the accuracy and reliability of search engines and to include more comprehensive categories of search terms.

Development of A Plagiarism Detection System Using Web Search and Morpheme Analysis (인터넷 검색과 형태소분석을 이용한 표절검사시스템의 개발에 관한 연구)

  • Hwang, In-Soo
    • Journal of Information Technology Applications and Management
    • /
    • v.16 no.1
    • /
    • pp.21-36
    • /
    • 2009
  • As the World Wide Web (WWW) has become a major channel for information delivery, the data accumulated in the Internet increases at an incredible speed, and it derives the advances of information search technologies. It is the search engine that solves the problem of information overloading and helps people to identify relevant information. However, as search engines become a powerful tool for finding information, the opportunities of plagiarizing have increased significantly in e-Learning. In this paper, we developed an online plagiarism detection system for detecting plagiarized documents that incorporates the functions of search engines and acts in exactly the same way of plagiarizing. The plagiarism detection system uses morpheme analysis to improve the performance and sentence-based comparison to investigate document comes from multiple sources. As a result of applying this system in e-Learning, the performance of plagiarism detection was improved.

  • PDF

Folksonomy-based Personalized Web Search System (폭소노미 기반 개인화 웹 검색 시스템)

  • Kim, Dong-Wook;Kang, Soo-Yong;Kim, Han-Joon;Lee, Byung-Jeong
    • Journal of Digital Contents Society
    • /
    • v.11 no.1
    • /
    • pp.105-115
    • /
    • 2010
  • Search engines provide web documents that are related to user's query. However, using only the query terms that user provided, it is hard for search engines to know user's exact intention and provide the very matching web documents. To remedy this problem, search systems are needed to exploit personalized search technologies. In this paper, we propose not only a novel personalized query recommendation scheme based on folksonomy but also a new personalized search service architecture which reduces the risk of privacy violation while enabling search service providers to provide other various personalized services such as personalized advertisement.

An Improved Combined Content-similarity Approach for Optimizing Web Query Disambiguation

  • Kamal, Shahid;Ibrahim, Roliana;Ghani, Imran
    • Journal of Internet Computing and Services
    • /
    • v.16 no.6
    • /
    • pp.79-88
    • /
    • 2015
  • The web search engines are exposed to the issue of uncertainty because of ambiguous queries, being input for retrieving the accurate results. Ambiguous queries constitute a significant fraction of such instances and pose real challenges to web search engines. Moreover, web search has created an interest for the researchers to deal with search by considering context in terms of location perspective. Our proposed disambiguation approach is designed to improve user experience by using context in terms of location relevance with the document relevance. The aim is that providing the user a comprehensive location perspective of a topic is informative than retrieving a result that only contains temporal or context information. The capacity to use this information in a location manner can be, from a user perspective, potentially useful for several tasks, including user query understanding or clustering based on location. In order to carry out the approach, we developed a Java based prototype to derive the contextual information from the web results based on the queries from the well-known datasets. Among those results, queries are further classified in order to perform search in a broad way. After the result provision to users and the selection made by them, feedback is recorded implicitly to improve the web search based on contextual information. The experiment results demonstrate the outstanding performance of our approach in terms of precision 75%, accuracy 73%; recall 81% and f-measure 78% when compared with generic temporal evaluation approach and furthermore achieved precision 86%, accuracy 71%; recall 67% and f-measure 75% when compared with web document clustering approach.

A Study on Changes of the Intellectual Structure in Web Information Using the Co-links Analysis (동시링크분석을 이용한 웹정보원의 지적구조 변화에 관한 연구)

  • Lee, Sung-Sook
    • Journal of the Korean Society for information Management
    • /
    • v.22 no.2 s.56
    • /
    • pp.205-228
    • /
    • 2005
  • This research analyzed changes of the intellectual structure of web information by examining time changes and search engines using the co-links analysis. According to the results, the co-links web information clusters on the two maps appeared to contain changes in the intellectual structure over the two time periods. The intellectual structure that appeared in the information map for AltaVista and MSN Search engines was relatively similar. However. there were also cases where the clusters of some web information was different. The results of the research revealed that the cocitation analysis could be applied simultaneously to diachronous analysis in the web information.

Evaluation of Classified Information on Web Agent Using Fuzzy Theory

  • Kim Doo-Ywan;Kim Tae-Ywan
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.5 no.3
    • /
    • pp.216-221
    • /
    • 2005
  • The rapid growth and spread of the World Wide Web has made it possible to easily access a variety of useful information. It is, however, very difficult to retrieve, manage, and use the desired information in web. Various kinds of systems such as Search engines, MetaSearch engines, Spiders, Softbots, Intelligent Agents or Web Agents have been developed by a large number of researchers and companies. Those systems as intelligent agent are employed to avoid the overload of information. To efficiently improve the Software Agents, it is necessary to represent and classify the retrieved data. And to improve performance of the Intelligent Agents to create the classification, it is offered how to evaluate the propriety with other information retrieved from the Web and to recommend to the user the most suitable information.

Improving Performance of Search Engine Using Category based Evaluation (범주 기반 평가를 이용한 검색시스템의 성능 향상)

  • Kim, Hyung-Il;Yoon, Hyun-Nim
    • The Journal of the Korea Contents Association
    • /
    • v.13 no.1
    • /
    • pp.19-29
    • /
    • 2013
  • In the current Internet environment where there is high space complexity of information, search engines aim to provide accurate information that users want. But content-based method adopted by most of search engines cannot be used as an effective tool in the current Internet environment. As content-based method gives different weights to each web page using morphological characteristics of vocabulary, the method has its drawbacks of not being effective in distinguishing each web page. To resolve this problem and provide useful information to the users, this paper proposes an evaluation method based on categories. Category-based evaluation method is to extend query to semantic relations and measure the similarity to web pages. In applying weighting to web pages, category-based evaluation method utilizes user response to web page retrieval and categories of query and thus better distinguish web pages. The method proposed in this paper has the advantage of being able to effectively provide the information users want through search engines and the utility of category-based evaluation technique has been confirmed through various experiments.

An Improved Approach to Ranking Web Documents

  • Gupta, Pooja;Singh, Sandeep K.;Yadav, Divakar;Sharma, A.K.
    • Journal of Information Processing Systems
    • /
    • v.9 no.2
    • /
    • pp.217-236
    • /
    • 2013
  • Ranking thousands of web documents so that they are matched in response to a user query is really a challenging task. For this purpose, search engines use different ranking mechanisms on apparently related resultant web documents to decide the order in which documents should be displayed. Existing ranking mechanisms decide on the order of a web page based on the amount and popularity of the links pointed to and emerging from it. Sometime search engines result in placing less relevant documents in the top positions in response to a user query. There is a strong need to improve the ranking strategy. In this paper, a novel ranking mechanism is being proposed to rank the web documents that consider both the HTML structure of a page and the contextual senses of keywords that are present within it and its back-links. The approach has been tested on data sets of URLs and on their back-links in relation to different topics. The experimental result shows that the overall search results, in response to user queries, are improved. The ordering of the links that have been obtained is compared with the ordering that has been done by using the page rank score. The results obtained thereafter shows that the proposed mechanism contextually puts more related web pages in the top order, as compared to the page rank score.

An Exploratory Study of Performances between a Subject Directory and Keyword Search Engine in the Network Databases (네트웍 데이터베이스에서의 주제별 디렉토리와 키워드 검색엔진의 검색효율에 관한 탐색적 연구)

  • Lee Myeong-Hee
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.31 no.2
    • /
    • pp.177-197
    • /
    • 1997
  • The study measured whether two search engines retrieve different Web documents for 6 queries. Two different search engines, Alta Vista in terms of keyword search engines and Yahoo in terms of subject directory engines were measured using as criteria, total number of documents retrieved, total number of relevant documents retrieved, recall and precision ratios. In addition, Alta Vista was suitable for specific and technical terms, while Yahoo was effective for general and plain terms. However, more elaborate research needs to be tested in terms of query characteristics.

  • PDF

Design for RDF-based Semantic Web System (RDF 기반 시맨틱 웹 시스템 설계)

  • Lee, Jong-Won;Jang, Ki-Man;Kim, Kyng-Hwan;Yang, Xitong;Jung, Hoe-Kyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2014.10a
    • /
    • pp.684-686
    • /
    • 2014
  • It is difficult to effectively search and data management due to the increasing number of web is now. While Semantic Web technologies and the development of next-generation wepin this as a way to overcome them, and monopolize the domestic utilization is not overwhelming introduction to the Semantic Web technology is being used in existing search engines. This causes the development of the Semantic Web is becoming slower, and reluctant to use the Semantic Web users who use search engines as well. In this paper, compared to the currently used web and the next generation of the web, and why utilization is low compared to the search engine you are using an existing Web technology that uses the Semantic Web technology is a search engine, what research was that the inefficient because, as a RDF-based Semantic suggest how to improve the efficiency solved by designing the web.

  • PDF