• Title/Summary/Keyword: 웹검색엔진

Search Result 10, Processing Time 0.026 seconds

Web Search Engine based on Database Management System (데이터베이스 관리 시스템에 기반한 웹검색엔진의 구현)

  • Kang, Byung-Ju;Lee, Ji-Dong;Choi, Key-Sun
    • Annual Conference on Human and Language Technology
    • /
    • 1997.10a
    • /
    • pp.211-218
    • /
    • 1997
  • 웹검색엔진은 색인되는 웹문서가 많아질수록 시스템 확장성(scalability)이라든지, 데이터베이스 유지 관리의 용이성, 데이터의 안전성 문제, 등의 많은 문제가 웹검색엔진에 부담으로 주어지게 된다. 반면에 인트라넷(intranet)용 검색엔진의 경우는 확장성보다는 검색엔진 자체의 개발의 용이성이 더욱 중요하다. Oracle $ConText^{TM}$는 오라클 사(社의) RDBMS인 $Oracle7^{TM}$의 정보검색 확장 옵션으로 텍스트를 Oracle7의 기본 데이터 타입으로 사용될 수 있게 한다. Oracle7+ConText는 대용량의 문서 베이스와 개발의 용이성을 동시에 보장할 수 있는 매우 훌륭한 웹검색엔진 개발 도구이다. 우리는 이를 검증하기 위하여 Oracle7+ConText에 기반한 WEBSECT(Web Search Engine With ConText)라는 웹검색엔진을 개발하였다. 본 논문은 WEBSECT의 개발과 시험 운영을 통해 데이터베이스에 기반한 웹검색엔진의 우수한 확장성과 텍스트 애플리케이션 개발의 용이성 등을 소개한다.

  • PDF

Analysis and Design for the System of Korean Web Document Classification (웹문서분류체계의 분석 및 새로운 설계)

  • Nam Young-Joon
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.32 no.3
    • /
    • pp.207-230
    • /
    • 1998
  • Because of a rapid increase of information available through web site, a user often falls into confusion of which web sites should be visited for his information needs. If a web site search engine can classify web sites according to their subject or topics, it can help the user to determine which web sites are worth accessing and thus to easily acquire relevant information. In this study, I propose new classifying system with a two level hierarchy and 57 items.

  • PDF

Appraising the Interface Features of Web Search Engines Based on User-defined Relevance Criteria (이용자정의형 적합성 기준을 토대로 한 웹검색엔진 인터페이스 평가)

  • Kim, Yang-Woo
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.22 no.1
    • /
    • pp.247-262
    • /
    • 2011
  • Although research has shown a significant amount of work identifying various dimensions of relevance along with exhaustive lists of relevance criteria, there seem to have been less effort to apply the findings to improve actual systems design. Based on this assumption, this paper investigates to what extent those relevance criteria have been incorporated into the interface features of major commercial Web search engines, suggesting what can/should be done more. Before stepping into the actual system features, this paper compares recent relevance research in Information Science with other human factor studies both in Information Science and its neighboring discipline (HCI), as an attempt to identify studies that are conceptually similar to the relevance research, but not named as such way. Similarities and differences between these studies are presented. Recommendations suggested to support applicable interface features include: 1) further personalization of interface designs; 2) author-supplied meta tags for the Web contents; and 3) extensions of beyond-topical representations based on link structure.

Development of Yóukè Mining System with Yóukè's Travel Demand and Insight Based on Web Search Traffic Information (웹검색 트래픽 정보를 활용한 유커 인바운드 여행 수요 예측 모형 및 유커마이닝 시스템 개발)

  • Choi, Youji;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.155-175
    • /
    • 2017
  • As social data become into the spotlight, mainstream web search engines provide data indicate how many people searched specific keyword: Web Search Traffic data. Web search traffic information is collection of each crowd that search for specific keyword. In a various area, web search traffic can be used as one of useful variables that represent the attention of common users on specific interests. A lot of studies uses web search traffic data to nowcast or forecast social phenomenon such as epidemic prediction, consumer pattern analysis, product life cycle, financial invest modeling and so on. Also web search traffic data have begun to be applied to predict tourist inbound. Proper demand prediction is needed because tourism is high value-added industry as increasing employment and foreign exchange. Among those tourists, especially Chinese tourists: Youke is continuously growing nowadays, Youke has been largest tourist inbound of Korea tourism for many years and tourism profits per one Youke as well. It is important that research into proper demand prediction approaches of Youke in both public and private sector. Accurate tourism demands prediction is important to efficient decision making in a limited resource. This study suggests improved model that reflects latest issue of society by presented the attention from group of individual. Trip abroad is generally high-involvement activity so that potential tourists likely deep into searching for information about their own trip. Web search traffic data presents tourists' attention in the process of preparation their journey instantaneous and dynamic way. So that this study attempted select key words that potential Chinese tourists likely searched out internet. Baidu-Chinese biggest web search engine that share over 80%- provides users with accessing to web search traffic data. Qualitative interview with potential tourists helps us to understand the information search behavior before a trip and identify the keywords for this study. Selected key words of web search traffic are categorized by how much directly related to "Korean Tourism" in a three levels. Classifying categories helps to find out which keyword can explain Youke inbound demands from close one to far one as distance of category. Web search traffic data of each key words gathered by web crawler developed to crawling web search data onto Baidu Index. Using automatically gathered variable data, linear model is designed by multiple regression analysis for suitable for operational application of decision and policy making because of easiness to explanation about variables' effective relationship. After regression linear models have composed, comparing with model composed traditional variables and model additional input web search traffic data variables to traditional model has conducted by significance and R squared. after comparing performance of models, final model is composed. Final regression model has improved explanation and advantage of real-time immediacy and convenience than traditional model. Furthermore, this study demonstrates system intuitively visualized to general use -Youke Mining solution has several functions of tourist decision making including embed final regression model. Youke Mining solution has algorithm based on data science and well-designed simple interface. In the end this research suggests three significant meanings on theoretical, practical and political aspects. Theoretically, Youke Mining system and the model in this research are the first step on the Youke inbound prediction using interactive and instant variable: web search traffic information represents tourists' attention while prepare their trip. Baidu web search traffic data has more than 80% of web search engine market. Practically, Baidu data could represent attention of the potential tourists who prepare their own tour as real-time. Finally, in political way, designed Chinese tourist demands prediction model based on web search traffic can be used to tourism decision making for efficient managing of resource and optimizing opportunity for successful policy.

Internet Search Engine: Technological Mode that Draws User's Attention to Make Its Expertise Reinforce (인터넷 검색엔진: 사용자의 관심을 흡수하여 전문성을 강화하는 기술)

  • Kim, Ji Yeon
    • Journal of Science and Technology Studies
    • /
    • v.13 no.1
    • /
    • pp.181-216
    • /
    • 2013
  • This paper tries to analyze technologies of search engine generally, and reveal the additional modes of Korean search engine at the same time. Recently it said that search engine becomes a self-moving and is getting more strong power than the former one existed. There are many difference interpretative views from technological determination to instrumentalism surrounding this system. Search engine invents the technological mode that draws user's attention to make its own expertise reinforce. It is stemmed from the rationality of its own. Especially Korean search engine exposed unique mutation as self-proliferation of it during past a decade, as for example "related keyword" or "real-time popular keyword" service. Its automatic decision aroused democracy matter, now it is not only web guide. How we do make it to serve in democracy, accepting the independent expertise of it simultaneously? We might find new prospect when focusing on interactional modality between engine and human actor, instead counting both as a separate one.

  • PDF

User Satisfaction related Perception of the Web Portal for Scholarly Information: Focused on the Academic Version of NAVER Search Engine (학술정보포털에 대한 이용자만족 관련 인식에 관한 연구 - NAVER 전문정보의 학술자료 검색 기능을 중심으로 -)

  • Kim, Yang-Woo
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.51 no.2
    • /
    • pp.255-279
    • /
    • 2017
  • In a qualitative approach, this study investigated users' perceptions associated with their satisfactions in the process of using the scholarly resource search functions of the academic version of the NAVER search engine. For this study, the data was collected from a group of undergraduate students, who conducted academic information searches in the field of own major disciplinary areas, using the Web portal. Based on the data, students' satisfactions and dissatisfactions along with the reasons of their perceptions were analyzed. The results presented users' perceptions in various evaluation criteria based on the three major domains: system interfaces, retrieval mechanisms and search results. Based on the results, the study proposed the following suggestions: 1) the enhancements of the system interfaces and HELP guidances based the limited user knowledge on basic system terminologies 2) the improvements of the retrieval mechanisms associated with understanding the contexts of the search terms presented by users 3) the necessity of the user education due to the insufficient user knowledge of the retrieval mechanisms and the search functions.

Examining Categorical Transition and Query Reformulation Patterns in Image Search Process (이미지 검색 과정에 나타난 질의 전환 및 재구성 패턴에 관한 연구)

  • Chung, Eun-Kyung;Yoon, Jung-Won
    • Journal of the Korean Society for information Management
    • /
    • v.27 no.2
    • /
    • pp.37-60
    • /
    • 2010
  • The purpose of this study is to investigate image search query reformulation patterns in relation to image attribute categories. A total of 592 sessions and 2,445 queries from the Excite Web search engine log data were analyzed by utilizing Batley's visual information types and two facets and seven sub-facets of query reformulation patterns. The results of this study are organized with two folds: query reformulation and categorical transition. As the most dominant categories of queries are specific and general/nameable, this tendency stays over various search stages. From the perspective of reformulation patterns, while the Parallel movement is the most dominant, there are slight differences depending on initial or preceding query categories. In examining categorical transitions, it was found that 60-80% of search queries were reformulated within the same categories of image attributes. These findings may be applied to practice and implementation of image retrieval systems in terms of assisting users' query term selection and effective thesauri development.

An Exploratory Study of Information Search Behaviors of International Students in Korea (국내 거주 외국인 유학생의 정보검색행위에 관한 탐색적 연구)

  • Yoon, JungWon
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.33 no.1
    • /
    • pp.259-277
    • /
    • 2022
  • This study aims to understand international students' web search behaviors. During the experiment, fifteen international students were asked to conduct three search tasks which includes six search questions. Depending on the characteristics of search task, there were differences in search performance and search behavior. It was commonly found that participants with higher Korean fluency showed higher search performance; however, prior knowledge about the search topic did not always affect the search performance. In the search tasks that required navigation through menus and links within one web domain, participants often overlooked the correct answers, even if they were at the webpages containing the correct answer. Also, some participants did not realized that they found wrong answers. For enhancing information seeking behaviors among foreigners in Korea, the followings were suggested: 1) to design websites which are easy for non-native speakers to navigate, and 2) to use social media as a means of interactive communication.

Representation of ambiguous word in Latent Semantic Analysis (LSA모형에서 다의어 의미의 표상)

  • 이태헌;김청택
    • Korean Journal of Cognitive Science
    • /
    • v.15 no.2
    • /
    • pp.23-31
    • /
    • 2004
  • Latent Semantic Analysis (LSA Landauer & Dumais, 1997) is a technique to represent the meanings of words using co-occurrence information of words appearing in he same context, which is usually a sentence or a document. In LSA, a word is represented as a point in multidimensional space where each axis represents a context, and a word's meaning is determined by its frequency in each context. The space is reduced by singular value decomposition (SVD). The present study elaborates upon LSA for use of representation of ambiguous words. The proposed LSA applies rotation of axes in the document space which makes possible to interpret the meaning of un. A simulation study was conducted to illustrate the performance of LSA in representation of ambiguous words. In the simulation, first, the texts which contain an ambiguous word were extracted and LSA with rotation was performed. By comparing loading matrix, we categorized the texts according to meanings. The first meaning of an ambiguous wold was represented by LSA with the matrix excluding the vectors for the other meaning. The other meanings were also represented in the same way. The simulation showed that this way of representation of an ambiguous word can identify the meanings of the word. This result suggest that LSA with axis rotation can be applied to representation of ambiguous words. We discussed that the use of rotation makes it possible to represent multiple meanings of ambiguous words, and this technique can be applied in the area of web searching.

  • PDF

Exploring the Effects of Task Language and Complexity in College Students' Web Searching (질의 언어 및 복잡성이 대학생의 웹 정보탐색에 미치는 영향에 관한 연구)

  • Shim, Wonsik;Ahn, Hye-yeon;Byun, Jeayeon
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.49 no.2
    • /
    • pp.51-73
    • /
    • 2015
  • The Web now provides instant access to an unprecedented amount of information that was unthinkable even 20-30 years ago. However, the full potential of the contents available through the Internet can only be realized when one can speak and understand foreign languages, especially English which accounts for more than half of web contents. In this study, we try to investigate the effect of search task languages and task complexity on searching performance. A total of thirty students enrolled at a top private university in Korea were recruited as study subjects. We set up a quasi-experimental design in which thirty subjects are randomly assigned to a set of eight different search tasks containing an equal number of simple and complex tasks and an equal number of tasks in Korean and in English. The results show that there is a significant difference between simple and complex tasks in terms of SERP time, number of queries used, correctness of results and total search time. However, task language does not seem to have affected search performance for this study group. In addition, students with high English proficiency test scores show comparable search performance in English tasks compared with lower test scores. But we note differences in behavioral patterns (different search engines used and search tactics) among the study participants.