• Title/Summary/Keyword: Web data

Search Result 5,605, Processing Time 0.036 seconds

A Study of Main Contents Extraction from Web News Pages based on XPath Analysis

  • Sun, Bok-Keun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.20 no.7
    • /
    • pp.1-7
    • /
    • 2015
  • Although data on the internet can be used in various fields such as source of data of IR(Information Retrieval), Data mining and knowledge information servece, and contains a lot of unnecessary information. The removal of the unnecessary data is a problem to be solved prior to the study of the knowledge-based information service that is based on the data of the web page, in this paper, we solve the problem through the implementation of XTractor(XPath Extractor). Since XPath is used to navigate the attribute data and the data elements in the XML document, the XPath analysis to be carried out through the XTractor. XTractor Extracts main text by html parsing, XPath grouping and detecting the XPath contains the main data. The result, the recognition and precision rate are showed in 97.9%, 93.9%, except for a few cases in a large amount of experimental data and it was confirmed that it is possible to properly extract the main text of the news.

Characterization Studies on Data Access Bias in Mobile Platforms

  • Bahn, Hyokyung
    • International journal of advanced smart convergence
    • /
    • v.10 no.4
    • /
    • pp.52-58
    • /
    • 2021
  • Data access bias can be observed in various types of computing systems. In this paper, we characterize the data access bias in modern mobile computing platforms. In particular, we focus on the access bias of data observed at three different subsystems based on our experiences. First, we show the access bias of file data in mobile platforms. Second, we show the access bias of memory data in mobile platforms. Third, we show the access bias of web data and web servers. We expect that the characterization study in this paper will be helpful in the efficient management of mobile computing systems.

A Study of Web Site Hacking Through Vulnerability Analysis (취약점 분석을 통한 Web Site 해킹 연구)

  • Song, Jin-Young;Park, Dea-Woo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2010.05a
    • /
    • pp.303-306
    • /
    • 2010
  • Personal information being leaked, and personal assets that through a malicious web site for hackers to exploit. Other confidential information via the web site of the country, and your personal information by illegally accessing the data has been obtained who Hacker forces are operating in some countries. Due to the problem of web site management has many vulnerabilities that web sites, as well as programs. In this paper, in the trend world, as well as domestic XSS, SQL Injection, Web Shell analysis of the vulnerability to attacks and XSS, SQL Injection, Web Shell is a direct attack to attack. Security measures are presented what after the attack demonstrated the hack to data collection, analysis. In this study, web site management, web site security and safety can be improved and research will contribute.

  • PDF

Development of GIS Application using Web-based CAD (Web기반 CAD를 이용한 지리정보시스템 구현)

  • Kim, Han-Su;Im, Jun-Hong;Kim, Jae-Deuk;Shin, So-Eun
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.3 no.3
    • /
    • pp.69-76
    • /
    • 2000
  • This study deals with development GIS application using web-based CAD, this application serves to user, designer, manager that more convenient and various functions. Development to this application, collect attribute data from fieldwork and geographic data from cadastral map and aerial survey map and then development to user interface using HTML, JavaScript, ASP, Whip ActiveX control. This application's characters are as follows ; First, system designer designed that anyone who have basic knowledge about web and CAD can develop this application. A system structure simplification by 2-Tier. Geographic information use DWF(drawing web format) file and attribute information use DBMS in consideration of extension. Second, system manager can service independently GIS in Web need not high priced GIS engine, so more economical. Third, internet user get service GIS information and function that search of information, zoom in/out, pan, print etc., if you need more functions, add function without difficultly. Developed application as above, not only save volume but fast of speed as use vector data exclude character and image data. Also, this application can used by means of commercial and travel information service but also various GIS service of public institution and private in web.

  • PDF

An Analysis Method of User Preference by using Web Usage Data in User Device (사용자 기기에서 이용한 웹 데이터 분석을 통한 사용자 취향 분석 방법)

  • Lee, Seung-Hwa;Choi, Hyoung-Kee;Lee, Eun-Seok
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.15 no.3
    • /
    • pp.189-199
    • /
    • 2009
  • The amount of information on the Web is explosively growing as the Internet gains in popularity. However, only a small portion of the information on the Web is truly relevant or useful to the user. Thus, offering suitable information according to user demand is an important subject in information retrieval. In e-commerce, the recommender system is essential to revitalize commercial transactions, raise user satisfaction and loyalty towards the information provider. The existing recommender systems are mostly based on user data collected at servers, so user data are dispersed over several servers. Therefore, web servers that lack sufficient user behavior data cannot easily infer user preferences. Also, if the user visits the server infrequently, it may be hard to reflect the dynamically changing user's interest. This paper proposes a novel personalization system analyzing the user preference based on web documents that are accessed by the user on a user device. The system also identifies non-content blocks appearing repeatedly in the dynamically generated web documents, and adds weight to the keywords extracted from the hyperlink sentence selected by the user. Therefore, the system establishes at an early stage recommendation strategies for the web server that has little user data. Also, user profiles are generated rapidly and more accurately by identifying the information blocks. In order to evaluate the proposed system, this study collected web data and purchase history from users who have current purchase activity. Then, we computed the similarity between purchase data and the user profile. We confirm the accuracy of the generated user profile since the web page containing the purchased item has higher correlation than other item pages.

A Study on the Development of Traffic Accident Information System Based on WebGIS (WebGIS 기반 교통사고정보관리 시스템 개발에 관한 연구)

  • Jeong, Su-Jin;Lim, Seung-Hyeon;Cho, Gi-Sung
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.26 no.6D
    • /
    • pp.1003-1010
    • /
    • 2006
  • This study developed a traffic accident information management system based on WebGIS that can process a lot of data for giving effectively diagnosis of traffic accidents in serious damage circumstances by traffic accident. Also, this study presents a way to compose and to convey traffic accident information. In addition, non-spatial attributes as well as spatial attributes about traffic accidents information be integrated and managed by the system. To provide Web service, we developed modules that can supply visually spatial information and traffic accidents data through ASP, Javascript, ArcIMS based on Web and constructed a server. And constructed system include a function that offer the now situation of traffic accident in real time, which supply the statistical data of traffic accident through Web as soon as user entry data in comparison with previous way that preparatory period until traffic accidents data is supplied to peoples had been long. Traffic accidents are analyzed with only nonspatial attribute by simply collecting in the past. However, system constructed by this study offer new function that can grasp visually accident spot circumstance and use detailed content and accurate location data as well as statistical data of traffic accidents. Also, it offer interface that can connect directly with accident charge policeman.

A Study On Web Contents Accessibility of Hospital Web Sites in Korea (국내 의료기관의 규모별 웹 콘텐츠 접근성 현황에 관한 연구)

  • Kim, Jong-Min;Ryu, Hwang-Gun
    • The Korean Journal of Health Service Management
    • /
    • v.4 no.2
    • /
    • pp.33-46
    • /
    • 2010
  • In this study, we investigated web contents accessibility of 60 hospital web sites in Korea. The eight evaluation criteria were used for estimating the web contents accessibility of the web sites. These criteria were as follows: providing an alternative text, providing caption for moving picture, providing a skip navigation, usage of pop-up windows, usage of a summary or a caption tag for data table, providing a page title, providing a label for online form, and usage of java scripts. K-WAH 3.0 was used for estimating five evaluation criteria. According to Internet web contents accessibility guideline 1.0, we estimated the rest three evaluation criteria manually and described good or bad examples for the evaluation results technically. The results show that the web accessibility of hospital web sites is generally insufficient and the constant interests in improvement for accessibility are urgently needed.

A Study on Factors Influencing the Web Site Development (웹사이트 개발 수준에 영향을 미치는 요인에 관한 연구)

  • Moon Yong-Eun;Jung Eau-Jin
    • The Journal of Information Systems
    • /
    • v.12 no.2
    • /
    • pp.73-90
    • /
    • 2003
  • IT and e-business is a hot issue of the world business and economic communities. The web site is the portal through which most of electronic transactions are conducted today. The main purpose of this paper is to investigate on factors influencing the web site development at the firms in Korea. Survey questionnaires were distributed via email & post to 281 firms that have implemented web site. Out of 44 responded questionnaires, 39 data sets were available for statistical analysis with SPPSSWin 10.1. Factor analysis identified five dimensions of web site development : (1) support of top management, (2) information technologies maturity, (3) strategic utilization of web site, (4) strength of rivalry among competing firm, (5) support policy of the government. Results of this study show that information technologies maturity and strategic utilization of web site positively influence on web site development. But support of top management, strength of rivalry among competing firm and support policy of the government don't influence on web site development. This study may provide a guideline to improve the quality of web site development.

  • PDF

Web crawler Improvement and Dynamic process Design and Implementation for Effective Data Collection (효과적인 데이터 수집을 위한 웹 크롤러 개선 및 동적 프로세스 설계 및 구현)

  • Wang, Tae-su;Song, JaeBaek;Son, Dayeon;Kim, Minyoung;Choi, Donggyu;Jang, Jongwook
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.11
    • /
    • pp.1729-1740
    • /
    • 2022
  • Recently, a lot of data has been generated according to the diversity and utilization of information, and the importance of big data analysis to collect, store, process and predict data has increased, and the ability to collect only necessary information is required. More than half of the web space consists of text, and a lot of data is generated through the organic interaction of users. There is a crawling technique as a representative method for collecting text data, but many crawlers are being developed that do not consider web servers or administrators because they focus on methods that can obtain data. In this paper, we design and implement an improved dynamic web crawler that can efficiently fetch data by examining problems that may occur during the crawling process and precautions to be considered. The crawler, which improved the problems of the existing crawler, was designed as a multi-process, and the work time was reduced by 4 times on average.

Construction of an International Standard-Based Plant Data Repository Utilizing Web Services Technology (웹 서비스 기술을 활용한 국제 표준 기반의 플랜트 데이터 저장소의 구현)

  • Mun, Du-Hwan;Kim, Byung-Chul
    • IE interfaces
    • /
    • v.23 no.3
    • /
    • pp.213-220
    • /
    • 2010
  • As the market becomes increasingly globalized and competition among companies increases in severity, various specialized organizations are participating across the process plant lifecycle, including the stages of design, construction, operation and maintenance, and dismantlement, in order to ensure efficiency and elevate competitiveness. In this regard, it is an important technical issue to develop services or information systems for sharing process plant data among participating organizations. ISO 15926 is an international standard for integration of lifecycle data for process plants including oil and gas facilities. ISO 15926 Part 7, a part of the ISO 15926 standard, specifies an implementation method called a facade that uses Web Services and ontology technologies for constructing plant data repositories and related services, with the aim of sharing lifecycle data of process plants. This paper discusses the ISO 15926-based prototype facade implemented for storing equipment data of nuclear power plants and servicing the data to interested organizations.