• Title/Summary/Keyword: 검색모형

Search Result 388, Processing Time 0.039 seconds

A study on improving the effectiveness of a boolean retrieval system with feedback information (피드백 정보를 이용한 불논리 검색 시스템의 성능 증진에 관한 실험적 연구)

  • 신은자;정영미
    • Journal of the Korean Society for information Management
    • /
    • v.15 no.1
    • /
    • pp.129-148
    • /
    • 1998
  • The objective of this study is to develop a useful relevance feedback retrieval technique that can be applied to the current Boolean retrieval system. A feedback retrieval technique based on user model is recommended here to achieve this objective. To prove the usefulness of this feedback retrieval technique, two enhanced Boolean retrieval models including DNF model and P-norm model were evaluated first through retrieval effectiveness experiments. After selecting DNF model as the retrieval model, two feedback retrieval experiments were performed using initial and extended user models. It is proved that the feedback retrieval based on user model can greatly enhance the effectiveness of a Boolean retrieval system with a small modification.

  • PDF

Interactive Information Retrieval (IR) Models: Tradition and Development (인터액티브 정보검색 모형)

  • Kim, Yang-Woo
    • Journal of the Korean Society for information Management
    • /
    • v.24 no.2
    • /
    • pp.45-69
    • /
    • 2007
  • This paper is divided into two parts. The first part elaborates on four Information Retrieval (IR) models: a traditional IR model and three more recent, user-oriented models of It interaction presented by Belkin, Ingwersen, and Saracevic. The strengths and limitations of each model are discussed. The second part, based on an analysis of the previous models, presents the author's interactive model, namely, the Iceberg Model. The rationales that are given to explain the design of this model are associated with the following: a greater specificity of system attributes; more concrete interplays among different components of IR interaction; and, the increased role of the Human Information Intermediary (HII). In sum, the new model presents a framework that can evolve in varying information-seeking contexts.

Application of the 2-Poisson Model to Full-Text Information Retrieval System (2-포아송 모형의 전문검색시스템 응용에 관한 연구)

  • 문성빈
    • Journal of the Korean Society for information Management
    • /
    • v.16 no.3
    • /
    • pp.49-63
    • /
    • 1999
  • The purpose of this study is to investigate whether the terms in queries are distributed according to the 2-Poisson model in the documents represented by abstract/title or full-text. In this study, retrieval experiments using Binary independence and 2-Poisson independence model, which are based on the probabilistic theory, were conducted to see if the 2-Poisson distribution of the query terms has an influence on the retrieval effectiveness, particularly of full-text information retrieval system.

  • PDF

Comparative Usefulness of Naver and Google Search Information in Predictive Models for Youth Unemployment Rate in Korea (한국 청년실업률 예측 모형에서 네이버와 구글 검색 정보의 유용성 분석)

  • Jung, Jae Un
    • Journal of Digital Convergence
    • /
    • v.16 no.8
    • /
    • pp.169-179
    • /
    • 2018
  • Recently, web search query information has been applied in advanced predictive model research. Google dominates the global web search market in the Korean market; however, Naver possesses a dominant market share. Based on this characteristic, this study intends to compare the utility of the Korean web search query information of Google and Naver using predictive models. Therefore, this study develops three time-series predictive models to estimate the youth unemployment rate in Korea using the ARIMA model. Model 1 only used the youth unemployment rate in Korea, whereas Models 2 and 3 added the Korean web search query information of Naver and Google, respectively, to Model 1. Compared to the predictability of the models during the training period, Models 2 and 3 showed better fit compared with Model 1. Models 2 and 3 correlated different query information. During predictive periods 1 (continuous with the training period) and 2 (discontinuous with the training period), Model 3 showed the best performance. During predictive period 2, only Model 3 exhibited a significant prediction result. This comparative study contributes to a general understanding of the usefulness of Korean web query information using the Naver and Google search engines.

Analysis of Highway Traffic Indices Using Internet Search Data (검색 트래픽 정보를 활용한 고속도로 교통지표 분석 연구)

  • Ryu, Ingon;Lee, Jaeyoung;Park, Gyeong Chul;Choi, Keechoo;Hwang, Jun-Mun
    • Journal of Korean Society of Transportation
    • /
    • v.33 no.1
    • /
    • pp.14-28
    • /
    • 2015
  • Numerous research has been conducted using internet search data since the mid-2000s. For example, Google Inc. developed a service predicting influenza patterns using the internet search data. The main objective of this study is to prove the hypothesis that highway traffic indices are similar to the internet search patterns. In order to achieve this objective, a model to predict the number of vehicles entering the expressway and space-mean speed was developed and the goodness-of-fit of the model was assessed. The results revealed several findings. First, it was shown that the Google search traffic was a good predictor for the TCS entering traffic volume model at sites with frequent commute trips, and it had a negative correlation with the TCS entering traffic volume. Second, the Naver search traffic was utilized for the TCS entering traffic volume model at sites with numerous recreational trips, and it was positively correlated with the TCS entering traffic volume. Third, it was uncovered that the VDS speed had a negative relationship with the search traffic on the time series diagram. Lastly, it was concluded that the transfer function noise time series model showed the better goodness-of-fit compared to the other time series model. It is expected that "Big Data" from the internet search data can be extensively applied in the transportation field if the sources of search traffic, time difference and aggregation units are explored in the follow-up studies.

Typology of Retrieval Systems based on the Degree of Connections between Systems and Information Resources: Specific Domain Focus Model (SDFM) for Information Retrieval Interaction (시스템-정보자료 군(群) 연계정도 기반 검색시스템 유형화 - 특정영역 초점 정보검색 상호작용 모형 -)

  • Kim, Yang-woo
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.30 no.2
    • /
    • pp.145-166
    • /
    • 2019
  • While a significant number of user-related models have been presented in Human Information Behavior (HIB) research community, the basic assumption of the present study is most of those models including information interaction models are multi-domain models associated with comprehensive research components. Based on such an assumption, this study discusses the shortcomings of multi-domain models and proposes the need to present a new type of model. Accordingly, the study elaborates four essential models of HIB reach community and presents a new type of model based on Specific Domain Focus Modeling (SDFM). As an example of such modeling, this study presents the present author's information retrieval interaction model based on the degree of connections between systems and information resources.

Design and Evaluation of a Personalized Search Service Model Based on Web Portal User Activities (웹 포털 이용자 로그 데이터에 기반한 개인화 검색 서비스 모형의 설계 및 평가)

  • Lee, So-Young;Chung, Young-Mee
    • Journal of the Korean Society for information Management
    • /
    • v.23 no.4 s.62
    • /
    • pp.179-196
    • /
    • 2006
  • This study proposes an expanded model of personalized search service based on community activities on a Korean Web portal. The model is composed of defining subject categories of users, providing personalized search results, and recommending additional subject categories and queries. Several experiments were performed to verify the feasibility and effectiveness of the proposed model. It was found that users' activities on community services provide valuable data for identifying their Interests, and the personalized search service increases users' satisfaction.

Short-term Predictive Models for Influenza-like Illness in Korea: Using Weekly ILI Surveillance Data and Web Search Queries (한국 인플루엔자 의사환자 단기 예측 모형 개발: 주간 ILI 감시 자료와 웹 검색 정보의 활용)

  • Jung, Jae Un
    • Journal of Digital Convergence
    • /
    • v.16 no.9
    • /
    • pp.147-157
    • /
    • 2018
  • Since Google launched a prediction service for influenza-like illness(ILI), studies on ILI prediction based on web search data have proliferated worldwide. In this regard, this study aims to build short-term predictive models for ILI in Korea using ILI and web search data and measure the performance of the said models. In these proposed ILI predictive models specific to Korea, ILI surveillance data of Korea CDC and Korean web search data of Google and Naver were used along with the ARIMA model. Model 1 used only ILI data. Models 2 and 3 added Google and Naver search data to the data of Model 1, respectively. Model 4 included a common query used in Models 2 and 3 in addition to the data used in Model 1. In the training period, the goodness of fit of all predictive models was higher than 95% ($R^2$). In predictive periods 1 and 2, Model 1 yielded the best predictions (99.98% and 96.94%, respectively). Models 3(a), 4(b), and 4(c) achieved stable predictability higher than 90% in all predictive periods, but their performances were not better than that of Model 1. The proposed models that yielded accurate and stable predictions can be applied to early warning systems for the influenza pandemic in Korea, with supplementary studies on improving their performance.

Estimation of performance for random binary search trees (확률적 이진 검색 트리 성능 추정)

  • 김숙영
    • Journal of the Korea Computer Industry Society
    • /
    • v.2 no.2
    • /
    • pp.203-210
    • /
    • 2001
  • To estimate relational models and test the theoretical hypotheses of binary tree search algorithms, we built binary search trees with random permutations of n (number of nodes) distinct numbers, which ranged from three to seven. Probabilities for building binary search trees corresponding to each possible height and balance factor were estimated. Regression models with variables of number of nodes, height, and average number of comparisons were estimated and the theorem of O(1g(n)) was accepted experimentally by a Lack of Test procedure. Analysis of Variance model was applied to compare the average number of comparisons with three groups by height and balance factor of the trees to test theoretical hypotheses of a binary search tree performance statistically.

  • PDF

Liuux Cluster based Biological Sequence Parallel Processing Model Development and Efficiency Verification (리눅스 클러스터기반 유전자서열분석 병렬처리 모형 개발 및 성능 검증)

  • 박미화;김재우;박춘규;유승식
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2003.04a
    • /
    • pp.106-108
    • /
    • 2003
  • Human Genome Project와 같은 대형 Sequencing 프로젝트와 High-throughput Sequencing 기술의 발전으로 현재 Expressed Sequence Tag (EST)와 같은 대량의 DNA 서열들이 생산되고 있다. 이를 효과적이고 효율적으로 분석해야 할 필요성이 증대되고 있다. 대부분의 실험자들이 서열 분석을 위해 우선적으로 BLAST 검색을 이용하고 있다. 하지만 대량의 서열, 검색 DB의 크기, BLAST 검색 결과의 복잡성에 의해 어려움을 겪고 있다. 이에 빠르고 정리된 결과를 보여줄 수 있는 BLAST 검색 시스템의 필요성이 커지고 있다. 이에 본 논문은 미국 생명공학연구소(NCBI)에서 제공하는 유전자 서열 검색 툴인 BLAST(Basic Logical Alignment Tool)를 클러스터 수퍼 컴퓨터 구축 기술을 기반으로 한 병렬처리와 Gene Ontology를 이용하여 방대한 양의 서열 검색 결과를 요약하는 모형을 제시한다. 이것은 신약개발 및 유전자 발굴 등의 연구기간을 획기적으로 단축시켜 신약 개 발, 농업, 화학, 의료, 환경 등 생명공학 연구에 핵심적인 역할을 할 수 있다. 또한 성능 실험을 통하여 분석결과 대기시간을 최소화하는 병렬처리모형의 효율성을 검증하였다.

  • PDF