• Title/Summary/Keyword: Information search engine

Search Result 476, Processing Time 0.031 seconds

A Study on Optimized Information Search Algorithm Using lava (Java를 이용한 정보 검색 최적화 알고리즘에 관한 연구)

  • 김용호;정종근;이윤배
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.6 no.6
    • /
    • pp.797-804
    • /
    • 2002
  • As internet use is being generalized central of WWW(World Wide Web) service of multimedia based recently, we could acquire many informations that exist to all over the world's computer network .Therefore, picking up of information became important problem before that internet is generalized, but it is risen to important problem to acquire correct information rapidly on modem society that use of internet is generalized. This paper designed internet search engine and understand structure of internet search engine drawing URL that is optimized, and secure embodiment technology using Java that is language of object base. Search engine that proposed in this paper maintained user's the convenience by offer keyword search, and simplify user interface And although quantity of searched information site is few, search engine show that the bad link rate of searched result is improved compare with existent domestic manufacture search engines.

Ontology-based User Customized Search Service Considering User Intention (온톨로지 기반의 사용자 의도를 고려한 맞춤형 검색 서비스)

  • Kim, Sukyoung;Kim, Gunwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.4
    • /
    • pp.129-143
    • /
    • 2012
  • Recently, the rapid progress of a number of standardized web technologies and the proliferation of web users in the world bring an explosive increase of producing and consuming information documents on the web. In addition, most companies have produced, shared, and managed a huge number of information documents that are needed to perform their businesses. They also have discretionally raked, stored and managed a number of web documents published on the web for their business. Along with this increase of information documents that should be managed in the companies, the need of a solution to locate information documents more accurately among a huge number of information sources have increased. In order to satisfy the need of accurate search, the market size of search engine solution market is becoming increasingly expended. The most important functionality among much functionality provided by search engine is to locate accurate information documents from a huge information sources. The major metric to evaluate the accuracy of search engine is relevance that consists of two measures, precision and recall. Precision is thought of as a measure of exactness, that is, what percentage of information considered as true answer are actually such, whereas recall is a measure of completeness, that is, what percentage of true answer are retrieved as such. These two measures can be used differently according to the applied domain. If we need to exhaustively search information such as patent documents and research papers, it is better to increase the recall. On the other hand, when the amount of information is small scale, it is better to increase precision. Most of existing web search engines typically uses a keyword search method that returns web documents including keywords which correspond to search words entered by a user. This method has a virtue of locating all web documents quickly, even though many search words are inputted. However, this method has a fundamental imitation of not considering search intention of a user, thereby retrieving irrelevant results as well as relevant ones. Thus, it takes additional time and effort to set relevant ones out from all results returned by a search engine. That is, keyword search method can increase recall, while it is difficult to locate web documents which a user actually want to find because it does not provide a means of understanding the intention of a user and reflecting it to a progress of searching information. Thus, this research suggests a new method of combining ontology-based search solution with core search functionalities provided by existing search engine solutions. The method enables a search engine to provide optimal search results by inferenceing the search intention of a user. To that end, we build an ontology which contains concepts and relationships among them in a specific domain. The ontology is used to inference synonyms of a set of search keywords inputted by a user, thereby making the search intention of the user reflected into the progress of searching information more actively compared to existing search engines. Based on the proposed method we implement a prototype search system and test the system in the patent domain where we experiment on searching relevant documents associated with a patent. The experiment shows that our system increases the both recall and precision in accuracy and augments the search productivity by using improved user interface that enables a user to interact with our search system effectively. In the future research, we will study a means of validating the better performance of our prototype system by comparing other search engine solution and will extend the applied domain into other domains for searching information such as portal.

Spamming page filtering algorithm using Web structure management management (Web Structure Management기법을 이용한 Spamming page filtering algorithm)

  • 신광섭;이우기;강석호
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.04b
    • /
    • pp.238-240
    • /
    • 2004
  • 정보 통신 기술의 발달로 엄청난 양의 정보가 World Wide Web을 통해 저장되고 공유된다. 특히, 사용자가 WWW을 이용하여 필요한 정보를 얻고자할 때, 가장 많이 사용되는 것이 Web search engine이다. 그러나 Web search engine의 algorithm 자체의 부정확성과 악의적으로 작성된 Web page로 인해 search engine 결과가 사용자의 요구와 일치하지 못하는 문제가 발생한다. 본 논문에서는 여러 Web search algorithm 중에서 Web structure management 기법을 중심으로 문제점을 분석하고 이를 해결할 수 있는 수정된 algorithm을 제시한다. 마지막으로 제시된 algorithm이 spamming page를 filtering하는 과정을 예시하여 논증한다.

  • PDF

Analysis of the Optimal Degree of Search Result Modification (검색결과의 최적 조정 비율 분석)

  • Woo, Soohan;Lee, Eun Hee;Kim, Kihoon
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.39 no.3
    • /
    • pp.133-144
    • /
    • 2014
  • Naver, a leading search engine in South Korea, may show modified and reorganized search results for some trendy and popular keywords; when popular words such as the titles of soap operas and films are searched for,all the detailed and well-organized information regarding them can be presented. By recognizing that search engines may modify and reorganize search results for some popular keywords, we mathematically model the impact of the degree of modification of search results on the search engine's profit to derive its optimal modification degree. We show how the optimal degree of search result modification may change according to the different shapes of the search engine's advertising revenue function.

Classification of Web Search Engines and Necessity of a Hybrid Search Engine (웹 검색엔진 분류 및 하이브리드 검색엔진의 필요성)

  • Paik, Juryon
    • Journal of Digital Contents Society
    • /
    • v.19 no.4
    • /
    • pp.719-729
    • /
    • 2018
  • Abstract In 2017, it has been reported that Google had more than 90% of the market share in search-engines of desktops and mobiles. Most people may consider that Google surely searches the entire web area. However, according to many researches for web data, Google only searches less than 10%, surprisingly. The most region is called the Deep Web, and it is indexable by special search engines, which are different from Google because they focus on a specific segment of interest. Those engines build their own deep-web databases and run particular algorithms to provide accurate and professional search results. There is no search engine that indexes the entire Web, currently. The best way is to use several search engines together for broad and efficient searches as best as possible. This paper defines that kind of search engine as Hybrid Search Engine and provides characteristics and differences compared to conventional search engines, along with a frame of hybrid search engine.

A Study on the Crawling and Classification Strategy for Local Website (로컬 웹사이트의 탐색전략과 웹사이트 유형분석에 관한 연구)

  • Hwang In-Soo
    • Journal of Information Technology Applications and Management
    • /
    • v.13 no.2
    • /
    • pp.55-65
    • /
    • 2006
  • Since the World-Wide Web (WWW) has become a major channel for information delivery, information overload also has become a serious problem to the Internet users. Therefore, effective information searching is critical to the success of Internet services. We present an integrated search engine for searching relevant web pages on the WWW in a certain Internet domain. It supports a local search on the web sites. The spider obtains all of the web pages from the web sites through web links. It operates autonomously without any human supervision. We developed state transition diagram to control navigation and analyze link structure of each web site. We have implemented an integrated local search engine and it shows that a higher satisfaction is obtained. From the user evaluation, we also find that higher precision is obtained.

  • PDF

Selection of Search Engine and the number of documents in Meta Search Engine to reduce network traffic (메타서치엔진에서 네트워크의 트래픽을 줄이기 위한 검색엔진의 선택 및 검색문서의 수 결정)

  • 이진호;박선진;박상호;남인길
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.4 no.4
    • /
    • pp.100-110
    • /
    • 1999
  • The decision method for the selection of search engine and the number of returned documents for meta search engine proposed in this paper could provide a solution to reduce network traffic and to maintain the precision ratio. The experiments are performed to evaluate the proposed scheme using currently popular search engines and most frequently used queries.

  • PDF

Improved Piracy Site Detection Technique using Search Engine

  • Kim, Eui-Jin;Kim, Deuk-Hun;Kwak, Jin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.7
    • /
    • pp.2459-2472
    • /
    • 2022
  • With the increase in copyright content exports to overseas markets due to the recent globalization of the Korean culture, the added value of the Korean digital content market is increasing at a significant rate. As such, as the size of the copyright market increases, different piracy sites have emerged that generate profits by illegally distributing works without the permission of the copyright holders, resulting in direct and indirect damage to these copyright holders. The existing copyright detection methods used in public institutions for solving this problem are limited, while the piracy sites are ever-changing. Methods are being continuously developed to achieve better detection results. To this end, it is possible to detect the latest infringement site domain by detecting the infringement site domain that is constantly changed through the search engine. This paper proposes an improved piracy site detection method using a search engine to prevent the damage caused by piracy sites.

Analysis of Search Engine Use, Search Behaviors and Aptitude by Web Users (웹 이용자의 검색엔진 활용 및 탐색 행위와 성향 분석)

  • Rieh, Hae-Young
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.36 no.3
    • /
    • pp.69-91
    • /
    • 2002
  • This study examines overall user experience associated with Web search engine use including selection, usage of search features, evaluation. The data were collected through individual interviews with 28 faculty members and graduate students. It was found that users tend to select a search engine based on experience and knowledge of certain features and familiarity with an engine itself more than based on previous experience with search results. The results showed the users had mixed opinions regarding cross language retrieval while they did not believe the usage of operators effect the search results. It appears that users are interested in interface design as well as the accuracy of search results.

FAST Search Engine Customizing for S&T Information Service (고객중심의 과학기술정보 서비스를 위한 FAST 검색엔진 커스터마이징)

  • Han, Hee-Jun;Yi, Tae-Seok;Kim, Sun-Tae;Yae, Yong-Hee;Lee, Sang-Gi;Yeo, Il-Yoen
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2008.05a
    • /
    • pp.480-483
    • /
    • 2008
  • According to develop the web technology, the data providers are trying to offer the efficient service for customers. Specially it is necessary to improve efficiency of the search function to help user access easily useful information their want. KISTI has introduced and customized the FAST search engine to improve search performance of the national science and technology information portal service system. But the design work for hardware and software implementation of search engine is important above all. In this paper, we discuss about the design and custormizing skill of FAST engine for the KISTI S&T information search service.

  • PDF