• Title/Summary/Keyword: Internet Search Engine

Search Result 185, Processing Time 0.024 seconds

An Automated Technique for Illegal Site Detection using the Sequence of HTML Tags (HTML 태그 순서를 이용한 불법 사이트 탐지 자동화 기술)

  • Lee, Kiryong;Lee, Heejo
    • Journal of KIISE
    • /
    • v.43 no.10
    • /
    • pp.1173-1178
    • /
    • 2016
  • Since the introduction of BitTorrent protocol in 2001, everything can be downloaded through file sharing, including music, movies and software. As a result, the copyright holder suffers from illegal sharing of copyright content. In order to solve this problem, countries have enacted illegal share related law; and internet service providers block pirate sites. However, illegal sites such as pirate bay easily reopen the site by changing the domain name. Thus, we propose a technique to easily detect pirate sites that are reopened. This automated technique collects the domain names using the google search engine, and measures similarity using Longest Common Subsequence (LCS) algorithm by comparing the tag structure of the source web page and reopened web page. For evaluation, we colledted 2,383 domains from google search. Experimental results indicated detection of a total of 44 pirate sites for collected domains when applying LCS algorithm. In addition, this technique detected 23 pirate sites for 805 domains when applied to foreign pirate sites. This experiment facilitated easy detection of the reopened pirate sites using an automated detection system.

A Case Study on the Personalized Online Recruitment Services : Focusing on Worldjob+'s Use of Splunk (개인화된 구직정보서비스 제공에 관한 사례연구 : 월드잡플러스의 스플렁크 활용을 중심으로)

  • Rhee, MoonKi Kyle;Lee, Jae Deug;Park, Seong Taek
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.2
    • /
    • pp.241-250
    • /
    • 2018
  • Online recruitment services have emerged as one of the most popular Internet services, providing job seekers with a comprehensive list of jobs and a search engine. But many recruitment services suffer from shortcomings due to their reliance on traditional client-pull information access model, in manay cases resulting in unfocused search results. Worldjob+, being operated by The Human Resources Development Service of Korea, addresses these problems and uses Splunk, a platform for analyzing machine data, to provide a more proactive and personalised services. It focuses on enhancing the existing system in two different ways: (a) using personalised automated matching techniques to proactively recommend most preferrable profile or specification information for each job opening announcement or recruiting company, (b) and to recommend most preferrable or desirable job opening announcement for each job-seeker. This approach is a feature-free recommendation technique that recommends information items to a given user based on what similar users have previously liked. A brief discussion about the potential benefit is also provided as a conclusion.

Library Information Service on the Web 2.0 (웹 2.0 기반의 도서관 정보서비스)

  • Yang, Byeong-Hoon
    • Journal of Information Management
    • /
    • v.39 no.1
    • /
    • pp.199-220
    • /
    • 2008
  • Most people choose Internet search engine first more than the library for their information search in these days. Many users do not know library homepage's content. How to improve the users in library homepage? This study aims to suggest the direction of library homepage service in Web 2.0. For this study the author analyzed library homepage that is introducing some representative Web 2.0 and other Web 2.0 sites. AJAX, RSS, Open API, MashUp, Wikis, Blog are the main technologies in Web 2.0. Those technologies become a tool that can do user centered library homepage. But, more important thing is information production that introduce to users. Web 2.0 suggests good information transfer for users. It needs to produce the information that stimulates the library user. It means that Web 2.0 give a good opportunity for libraries as an information production.

Construction of web-based nutrition education contents and searching engine for usage of healthy menu of children

  • Hong, Soon-Myung;Lee, Tae-Kyong;Chung, Hea-Jung;Park, Hye-Kyung;Lee, Eun-Ju;Nam, Hye-Seon;Jung, Soon-Im;Cho, Jee-Ye;Lee, Jin-Hee;Kim, Gon;Kim, Min-Chan
    • Nutrition Research and Practice
    • /
    • v.2 no.2
    • /
    • pp.114-120
    • /
    • 2008
  • A diet habit, which is developed in childhood, lasts for a life time. In this sense, nutrition education and early exposure to healthy menus in childhood is important. Children these days have easy access to the internet. Thus, a web-based nutrition education program for children is an effective tool for nutrition education of children. This site provides the material of the nutrition education for children with characters which are personified nutrients. The 151 menus are stored in the site together with video script of the cooking process. The menus are classified by the criteria based on age, menu type and the ethnic origin of the menu. The site provides a search function. There are three kinds of search conditions which are key words, menu type and "between" expression of nutrients such as calorie and other nutrients. The site is developed with the operating system Windows 2003 Server, the web server ZEUS 5, development language JSP, and database management system Oracle 10 g.

Intelligent Retrieval System for finding important travel information (중요 여행 정보를 찾기 위한 지능 검색 시스템)

  • Yun, Un-Il;Shin, Hyeon-Il;Ryu, Keun-Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.14 no.11
    • /
    • pp.113-121
    • /
    • 2009
  • The increasing interest in leisure activities of a five-day work per week has been recently prevailed. Additionally, as internet and mobile infrastructures have been becoming widespread, the user can get specific information using a search engine. However, it is difficult for the user to get accurate information they really want as shared information has been rapidly increased and the information has been searched. For example, users can retrieve required travel information, but they also must see a huge number of travel advertisements. In this paper, we design and implement a retrieval system using travel information collecting agent. The information gathering agent regularly visits travel-related category pages of the portal sites and major media travel-article pages to collect information related to travel, and the agent stores the gathered information to a database. Then, users can search the travel information conveniently without the need to view advertisements.

Hidden Markov Model-based Extraction of Internet Information (은닉 마코브 모델을 이용한 인터넷 정보 추출)

  • Park, Dong-Chul
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.46 no.3
    • /
    • pp.8-14
    • /
    • 2009
  • A Hidden Markov Model(HMM)-based information extraction method is proposed in this paper. The proposed extraction method is applied to extraction of products' prices. The input of the proposed IESHMM is the URLs of a search engine's interface, which contains the names of the product types. The output of the system is the list of extracted slots of each product: name, price, image, and URL. With the observation data set Maximum Likelihood algorithm and Baum-Welch algorithm are used for the training of HMM and The Viterbi algorithm is then applied to find the state sequence of the maximal probability that matches the observation block sequence. When applied to practical problems, the proposed HMM-based system shows improved results over a conventional method, PEWEB, in terms of recall ration and accuracy.

Weight-based Wellbeing Food Retrieval System (가중치 기반 웰빙식품 정보 검색 시스템)

  • Pyun, Gwang-Bum;Yun, Un-Il;Ryu, Keun-Ho
    • Journal of Internet Computing and Services
    • /
    • v.11 no.3
    • /
    • pp.75-86
    • /
    • 2010
  • As the interests in health grow higher, necessity of Well-being relation informations get more importance. We get the information of well-being, tinternet retrieval system or blog, homepage and media. Although, it is not easy to find informations of well-being food. So, retrieval system has been requiring information about well-being food. In this paper, Weight-based Wellbeing Food Retrieval System is designed and implemention. Finding numerous pages and if well-being keywords includes page, it was identified and add weight. User searching for keywords, it implement, well-being food pages comes at the first. Keywords for discrimination makes type of dictionary, so it can insert, delete, modify. Inverted files saves hasing(direct-based file). Retrieval System in this paper is experimental result, at keywords of well-being food show 5~15% imprement than another Retrieval System. In this paper, Weight-based Wellbeing Food Retrieval System's designed and proposed way to raking for well-being food.

A Markup Language for Describing the Linkage between Sensor Data and Service in the Ubiquitous Environment (유비쿼터스 환경에서 센서 데이터와 서비스의 연계를 표현하는 마크업 언어)

  • Lee, Hun-Soon;Jin, Seung-Il
    • The KIPS Transactions:PartD
    • /
    • v.15D no.2
    • /
    • pp.247-256
    • /
    • 2008
  • In the ubiquitous environment, it is scattered all over our neighboring in many smart objects. These smart objects constantly produce the information and the amount of the generated information is massive. As the internet search engine came out to help us to find the useful data from the sea of the information connected to the internet, the sensor data stream processing middleware is appearing to make us to develop the ubiquitous service easily by extracting the meaningful information from the massive sensor data and delivering the extracted information to the application which makes our life convenient. We have to inform the information relating to the provided service to a middleware so that the ubiquitous service can be provided by using sensor data stream processing middleware. In this paper, we classify the information which is needed to express the ubiquitous service which uses sensor data for the service providing. And we propose a distinct markup language called Context-driven Service Markup Language (CSML) to effectively describe this information. We can easily express the various ubiquitous services which have to be provided in the various situations using proposed CSML.

Evaluating the Quality of Basic Life Support Information for Primary Korean-Speaking Individuals on the Internet (국내 인터넷 웹 페이지에 나타난 기본심폐소생술 정보의 질 평가)

  • Kang, Hee Do;Moon, Hyung Jun;Lee, Jung Won;Choi, Jae Hyung;Lee, Dong Wook;Kim, Hyun Su;Kang, In Gu;Kim, Doh Eui;Lee, Hyung Jung;Lee, Han You
    • Health Communication
    • /
    • v.13 no.2
    • /
    • pp.125-132
    • /
    • 2018
  • Purpose: The aim of this study is to investigate the quality of basic life support (BLS) information for primary Korean-speaking individuals on the internet. Methods: Using the $Google^{(C)}$ search engine, we searched for the terms 'CPR', 'cardiopulmonary resuscitation (in Korean)' and 'cardiac arrest (in Korean)'. The accuracy, reliability and accessibility of web pages was evaluated based on the 2015 American heart association(AHA) guidelines for CPR & emergency cardiovascular care, the health on the net foundation code of conduct and Korean web content accessibility guidelines 2.1, respectively. Results: Of the 178 web pages screened, 50 met criteria for inclusion. The overall quality of BLS information was not enough (median 5/7, IQR 4.75-6). 23(36%) pages were created in accordance with 2010 AHA guidelines. Only 24(48%) web pages educated on how to use the automated electrical defibrillator. The attribution and transparency of the reliability of pages was relatively low, 20(40%) and 16(32%). The web accessibility score was relatively high. Conclusion: A small of proportion of internet web pages searched by Google have high quality BLS information for a Korean-speaking population. Web pages based on past guideline were still being searched. The notation of the source of CPR information and the transparency of the author should be improved. The verification and evaluation of the quality of BLS information exposed to the Internet are continuously needed.

Analysis of Posting Preferences and Prediction of Update Probability on Blogs (블로그에서 포스팅 성향 분석과 갱신 가능성 예측)

  • Lee, Bum-Suk;Hwang, Byung-Yeon
    • Journal of KIISE:Databases
    • /
    • v.37 no.5
    • /
    • pp.258-266
    • /
    • 2010
  • In this paper, we introduce a novel method to predict next update of blogs. The number of RSS feeds registered on meta-blogs is on the order of several million. Checking for updates is very time consuming and imposes a heavy burden on network resources. Since blog search engine has limited resources, there is a fix number of blogs that it can visit on a day. Nevertheless we need to maximize chances of getting new data, and the proposed method which predicts update probability on blogs could bring better chances for it. Also this work is important to avoid distributed denial-of-service attack for the owners of blogs. Furthermore, for the internet as whole this work is important, too, because our approach could minimize traffic. In this study, we assumed that there is a specific pattern to when a blogger is actively posting, in terms of days of the week and, more specifically, hours of the day. We analyzed 15,119 blogs to determine a blogger's posting preference. This paper proposes a method to predict the update probability based on a blogger's posting history and preferred days of the week. We applied proposed method to 12,115 blogs to check the precision of our predictions. The evaluation shows that the model has a precision of 0.5 for over 93.06% of the blogs examined.