• Title/Summary/Keyword: Web search engines

Search Result 210, Processing Time 0.033 seconds

The Relationship between Internet Search Volumes and Stock Price Changes: An Empirical Study on KOSDAQ Market (개별 기업에 대한 인터넷 검색량과 주가변동성의 관계: 국내 코스닥시장에서의 산업별 실증분석)

  • Jeon, Saemi;Chung, Yeojin;Lee, Dongyoup
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.2
    • /
    • pp.81-96
    • /
    • 2016
  • As the internet has become widespread and easy to access everywhere, it is common for people to search information via online search engines such as Google and Naver in everyday life. Recent studies have used online search volume of specific keyword as a measure of the internet users' attention in order to predict disease outbreaks such as flu and cancer, an unemployment rate, and an index of a nation's economic condition, and etc. For stock traders, web search is also one of major information resources to obtain data about individual stock items. Therefore, search volume of a stock item can reflect the amount of investors' attention on it. The investor attention has been regarded as a crucial factor influencing on stock price but it has been measured by indirect proxies such as market capitalization, trading volume, advertising expense, and etc. It has been theoretically and empirically proved that an increase of investors' attention on a stock item brings temporary increase of the stock price and the price recovers in the long run. Recent development of internet environment enables to measure the investor attention directly by the internet search volume of individual stock item, which has been used to show the attention-induced price pressure. Previous studies focus mainly on Dow Jones and NASDAQ market in the United States. In this paper, we investigate the relationship between the individual investors' attention measured by the internet search volumes and stock price changes of individual stock items in the KOSDAQ market in Korea, where the proportion of the trades by individual investors are about 90% of the total. In addition, we examine the difference between industries in the influence of investors' attention on stock return. The internet search volume of stocks were gathered from "Naver Trend" service weekly between January 2007 and June 2015. The regression model with the error term with AR(1) covariance structure is used to analyze the data since the weekly prices in a stock item are systematically correlated. The market capitalization, trading volume, the increment of trading volume, and the month in which each trade occurs are included in the model as control variables. The fitted model shows that an abnormal increase of search volume of a stock item has a positive influence on the stock return and the amount of the influence varies among the industry. The stock items in IT software, construction, and distribution industries have shown to be more influenced by the abnormally large internet search volume than the average across the industries. On the other hand, the stock items in IT hardware, manufacturing, entertainment, finance, and communication industries are less influenced by the abnormal search volume than the average. In order to verify price pressure caused by investors' attention in KOSDAQ, the stock return of the current week is modelled using the abnormal search volume observed one to four weeks ahead. On average, the abnormally large increment of the search volume increased the stock return of the current week and one week later, and it decreased the stock return in two and three weeks later. There is no significant relationship with the stock return after 4 weeks. This relationship differs among the industries. An abnormal search volume brings particularly severe price reversal on the stocks in the IT software industry, which are often to be targets of irrational investments by individual investors. An abnormal search volume caused less severe price reversal on the stocks in the manufacturing and IT hardware industries than on average across the industries. The price reversal was not observed in the communication, finance, entertainment, and transportation industries, which are known to be influenced largely by macro-economic factors such as oil price and currency exchange rate. The result of this study can be utilized to construct an intelligent trading system based on the big data gathered from web search engines, social network services, and internet communities. Particularly, the difference of price reversal effect between industries may provide useful information to make a portfolio and build an investment strategy.

Contents Analysis and Synthesis Scheme for Music Album Cover Art

  • Moon, Dae-Jin;Rho, Seung-Min;Hwang, Een-Jun
    • Journal of IKEEE
    • /
    • v.14 no.4
    • /
    • pp.305-311
    • /
    • 2010
  • Most recent web search engines perform effective keyword-based multimedia contents retrieval by investigating keywords associated with multimedia contents on the Web and comparing them with query keywords. On the other hand, most music and compilation albums provide professional artwork as cover art that will be displayed when the music is played. If the cover art is not available, then the music player just displays some dummy or random images, but this has been a source of dissatisfaction. In this paper, in order to automatically create cover art that is matched with music contents, we propose a music album cover art creation scheme based on music contents analysis and result synthesis. We first (i) analyze music contents and their lyrics and extract representative keywords, (ii) expand the keywords using WordNet and generate various queries, (iii) retrieve related images from the Web using those queries, and finally (iv) synthesize them according to the user preference for album cover art. To show the effectiveness of our scheme, we developed a prototype system and reported some results.

Design and Implementation of a High Performance Web Crawler (고성능 웹크롤러의 설계 및 구현)

  • Kim Hie-Cheol;Chae Soo-Hoan
    • Journal of Digital Contents Society
    • /
    • v.4 no.2
    • /
    • pp.127-137
    • /
    • 2003
  • A Web crawler is an important Internet software technology used in a variety of Internet application software which includes search engines. As Internet continues to grow, implementations of high performance web crawlers are urgently demanded. In this paper, we study how to support dynamic scheduling for a multiprocess-based web crawler. For high performance, web crawlers are usually based on multiprocess in their implementations. In these systems, crawl scheduling which manages the allocation of web pages to each process for loading is one of the important issues. In this paper, we identify issues which are important and challenging in the crawl scheduling. To address the issue, we propose a dynamic crawl scheduling framework and subsequently a system architecture for a web crawler with dynamic crawl scheduling support. This paper presents the design of the Web crawler with dynamic scheduling support.

  • PDF

Concept Classification System of Jeju Oreum based on Web Search (웹 검색 기반으로 한 제주 오름의 콘셉트 분류 시스템)

  • Ahn, Jinhyun;Byun, So-Young;Woo, Seo-Jung;An, Ye-Ji;Kang, Jungwoon;Kim, Mincheol
    • Journal of Digital Convergence
    • /
    • v.19 no.8
    • /
    • pp.235-240
    • /
    • 2021
  • Currently, the number of visitors to Oreum is increasing and the trend of tourism is changing rapidly. The motivation for visiting Oreum is also changing from relaxation and pleasure to experiences. In line with this change, people visit the mountain by selecting motivation such as marriage and family photos, not just exercise. However, it is difficult to search for an Oreum that matches the tourists' motivation. In order to solve these problems, we proposed a system that provides the association between Oreum and concept based on the number of search results from web search engines in real time. User can select the desired date to check the associations for past or selected periods and concepts. Through this research, visitors to Oreum, Jeju's natural heritage, can contribute to the development of tourism in Jeju. In the future, the concept of visiting beaches or seas, not just Jeju Oreum, can be provided. In this work, search results from websites are collected, stored in a database, and search results of Oreum and concept are provided on the homepage to classify Oreum trends.

Blog Search Method using User Relevance Feedback and Guru Estimation (사용자 적합성 피드백과 구루 평가 점수를 고려한 블로그 검색 방법)

  • Jeong, Kyung-Seok;Park, Hyuk-Ro
    • The KIPS Transactions:PartB
    • /
    • v.15B no.5
    • /
    • pp.487-492
    • /
    • 2008
  • Most Web search engines use ranking methods that take both the relevancy and the importance of documents into consideration. The importance of a document denotes the degree of usefulness of the document to general users. One of the most successful methods for estimating the importance of a document has been Page-Rank algorithm which uses the hyperlink structure of the Web for the estimation. In this paper, we propose a new importance estimation algorithm for the blog environment. The proposed method, first, calculates the importance of each document using user's bookmark and click count. Then, the Guru point of a blogger is computed as the sum of all importance points of documents which he/she wrote. Finally, the guru points are reflected in document ranking again. Our experiments show that the proposed method has higher correlation coefficient than the traditional methods with respect to correct answers.

Implementation Techniques to Apply the PageRank Algorithm (페이지랭크 알고리즘 적용을 위한 구현 기술)

  • Kim, Sung-Jin;Lee, Sang-Ho;Bang, Ji-Hwan
    • The KIPS Transactions:PartD
    • /
    • v.9D no.5
    • /
    • pp.745-754
    • /
    • 2002
  • The Google search site (http://www.google.com), which was introduced in 1998, implemented the PageRank algorithm for the first time. PageRank is a ranking method based on the link structure of the Web pages. Even though PageRank has been implemented and being used in various commercial search engines, implementation details did not get documented well, primarily due to business reasons. Implementation techniques introduced in [4,8] are not sufficient to produce PageRank values of Web pages. This paper explains the techniques[4,8], and suggests major data structure and four implementation techniques in order to apply the PageRank algorithm. The paper helps understand the methods of applying PageRank algorithm by means of showing a real system that produces PageRank values of Web pages.

The Use of Reinforcement Learning and The Reference Page Selection Method to improve Web Spidering Performance (웹 탐색 성능 향상을 위한 강화학습 이용과 기준 페이지 선택 기법)

  • 이기철;이선애
    • Journal of the Korea Computer Industry Society
    • /
    • v.3 no.3
    • /
    • pp.331-340
    • /
    • 2002
  • The web world is getting so huge and untractable that without an intelligent information extractor we would get more and more helpless. Conventional web spidering techniques for general purpose search engine may be too slow for the specific search engines, which concentrate only on specific areas or keywords. In this paper a new model for improving web spidering capabilities is suggested and experimented. How to select adequate reference web pages from the initial web Page set relevant to a given specific area (or keywords) can be very important to reduce the spidering speed. Our reference web page selection method DOPS dynamically and orthogonally selects web pages, and it can also decide the appropriate number of reference pages, using a newly defined measure. Even for a very specific area, this method worked comparably well almost at the level of experts. If we consider that experts cannot work on a huge initial page set, and they still have difficulty in deciding the optimal number of the reference web pages, this method seems to be very promising. We also applied reinforcement learning to web environment, and DOPS-based reinforcement learning experiments shows that our method works quite favorably in terms of both the number of hyper links and time.

  • PDF

Web Information Retrieval Exploiting Markup Pattern (마크업 패턴을 이용한 웹 검색)

  • Kim, Min-Soo;Kim, Min-Koo
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.13 no.6
    • /
    • pp.407-411
    • /
    • 2007
  • Over the years, great attention has been paid to the question of exploiting inherent semantic of HTML in the area of web document retrieval. Although HTML is mainly presentation oriented, HTML tags implicitly contain useful semantics that can be catch meaning of text. Focusing on this idea. in this paper we define 'markup pattern' and try to improve performance of web document retrieval using markup patterns. Markup pattern is a mirror of intends of web document publisher and an internal semantic of text on web document. To discover the markup pattern and exploit it, we suggest a new scheme for extracting concepts and weighting documents. For evaluation task, we select two domains-BBC and CNN web sites, and use their search engines to gather domain documents. We re-weight and re-score documents using proposed scheme, and show the performance improvement in the two domains.

Evaluation of Internet Web-sites Providing Infant Health Related Information for Daycare Staffs (국내 인터넷 상의 보육시설 영유아 건강관련 웹사이트의 평가)

  • Kim, Ji-Soo
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.1
    • /
    • pp.334-342
    • /
    • 2010
  • The purpose of this study was to evaluate web-sites related to infant health at day-care centers. A total of 87 web-sites were ultimately selected through Internet search engines, and educare sites. The basis of assessment in which standards and items were amended for the assessment of health information in previously used, and purpose, content, pertinence, accuracy, confidence, utility, authority, interactivity and continuance was assessed. Though the total score were 32 point, mean score was relatively low, 20.39 point. The score of accuracy and authority was relatively low while that of confidence and interactivity was high. As a result, it was revealed that there were no specific infant health related web-site with reliability guaranteed. Therefore, there is a need to develop a web-site providing accurate health information for infant at day-care centers. It is expected that these results can be utilized as a data for developing reliable web-site related infant health.

Assessment Using Practices and Quality of Food Safety Information on the Internet (인터넷 위생 정보 이용 실태 조사 및 질 평가)

  • Lee, Hye-Yeon;Lim, Tae-Youn;Lee, Jin-Hyang;Park, Min-Kyung;Chae, Mi-Jin;Bae, Hyun-Joo
    • Korean journal of food and cookery science
    • /
    • v.26 no.3
    • /
    • pp.229-237
    • /
    • 2010
  • This study was conducted to suggest potential future directions in the effective operations of food safety information on the internet. This study consisted of two sections. The first section was a survey that was performed by 186 dieticians in Daegu and Gyeongbuk area. The second section was the assessment of the quality of food safety information on the internet. 163 web sites that provided food safety information on three major searching engines using the search words 'food safety' and 'HACCP' were investigated. The trained panels evaluated the contents of the web sites using a 3-point scale assessment tool. The data collected data from the questionnaires were statistically analyzed using the SPSS package program (version 14.0). The results of this analysis were as follow. The primary purpose of internet usage by detitians was to search for information (87.2%). In addition, the detitians' satisfaction degree of food safety information on the internet was 3.17 out of 5 points. The operators of web sites that were evaluated by the researchers were business and industries (76.1%), public organizations (15.3%), and individuals (8.8%). The following four items, update of content, offer scientific basis or explanation of content, offer useful information and new informations, should be improved immediately. Among evaluation items, six items were shown to be significantly different according to the operators of web sites. The evaluation scores of web sites that were operated by public organizations were significantly higher than those of web sites operated by individuals or business and industries. In conclusion, regular and consistent qualitative evaluation of web sites is needed to improve the quality of information that is provided via the internet. In addition, web sites that provided information on food safety should be actively operated by public organizations.