• Title/Summary/Keyword: Web data

Search Result 5,605, Processing Time 0.038 seconds

Crawlers and Morphological Analyzers Utilize to Identify Personal Information Leaks on the Web System (크롤러와 형태소 분석기를 활용한 웹상 개인정보 유출 판별 시스템)

  • Lee, Hyeongseon;Park, Jaehee;Na, Cheolhun;Jung, Hoekyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2017.10a
    • /
    • pp.559-560
    • /
    • 2017
  • Recently, as the problem of personal information leakage has emerged, studies on data collection and web document classification have been made. The existing system judges only the existence of personal information, and there is a problem in that unnecessary data is not filtered because classification of documents published by the same name or user is not performed. In this paper, we propose a system that can identify the types of data or homonyms using the crawler and morphological analyzer for solve the problem. The user collects personal information on the web through the crawler. The collected data can be classified through the morpheme analyzer, and then the leaked data can be confirmed. Also, if the system is reused, more accurate results can be obtained. It is expected that users will be provided with customized data.

  • PDF

Adaptive Web Search based on User Web Log (사용자 웹 로그를 이용한 적응형 웹 검색)

  • Yoon, Taebok;Lee, Jee-Hyong
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.11
    • /
    • pp.6856-6862
    • /
    • 2014
  • Web usage mining is a method to extract meaningful patterns based on the web users' log data. Most existing patterns of web usage mining, however, do not consider the users' diverse inclination but create general models. Web users' keywords can have a variety of meanings regarding their tendency and background knowledge. This study evaluated the extraction web-user's pattern after collecting and analyzing the web usage information on the users' keywords of interest. Web-user's pattern can supply a web page network with various inclination information based on the users' keywords of interest. In addition, the Web-user's pattern can be used to recommend the most appropriate web pages and the suggested method of this experiment was confirmed to be useful.

A Semantic Web Service for Tourism Information over the Mobile Web (시맨틱 웹에 기초한 모바일 관광정보 서비스)

  • Lee, Yang-Won
    • Journal of the Korean Geographical Society
    • /
    • v.42 no.5
    • /
    • pp.788-807
    • /
    • 2007
  • To better publish geographical information on the Web, it is important to capture how Web technologies are changing. For a recent decade, Semantic Web has been developed by incorporating ontologies into the current Web, with an aim to make computers understand rather than simply display. Ontology, an explicit specification of a conceptualization, and the Semantic Web grounded on the ontology, have the potential for effective sharing and appropriate retrieval of geographical information. This paper describes a Semantic Web Service over the mobile Web that can offer pertinent tourism information according to user contexts. To do this, a tourism ontology was formalized in the PARA(Place-Attraction-Resource-Activity) ontology model by organizing tourist places, tourist attractions, tourism resources, and activities. Locational relationships between tourist places were also included in the PARA ontology model to take into account the movements of tourists on a railway network. The XML(Extensible Markup Language) Web Service in the middle tier manages the client-side request for information retrieval and the corresponding server-side response from the data provider. The PARA ontology was integrated into the XML Web Service for the concept-based discovery of tourism information. The applicability of the proposed system was tested through a simulation experiment for Tokyo tourism.

Detecting Intentionally Biased Web Pages In terms of Hypertext Information (하이퍼텍스트 정보 관점에서 의도적으로 왜곡된 웹 페이지의 검출에 관한 연구)

  • Lee Woo Key
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.1 s.33
    • /
    • pp.59-66
    • /
    • 2005
  • The organization of the web is progressively more being used to improve search and analysis of information on the web as a large collection of heterogeneous documents. Most people begin at a Web search engine to find information. but the user's pertinent search results are often greatly diluted by irrelevant data or sometimes appear on target but still mislead the user in an unwanted direction. One of the intentional, sometimes vicious manipulations of Web databases is a intentionally biased web page like Google bombing that is based on the PageRank algorithm. one of many Web structuring techniques. In this thesis, we regard the World Wide Web as a directed labeled graph that Web pages represent nodes and link edges. In the Present work, we define the label of an edge as having a link context and a similarity measure between link context and target page. With this similarity, we can modify the transition matrix of the PageRank algorithm. By suggesting a motivating example, it is explained how our proposed algorithm can filter the Web intentionally biased web Pages effective about $60\%% rather than the conventional PageRank.

  • PDF

Implementation of the Secure Web Server-Client Module Based on Protocol Architecture (프로토콜 기반 웹 클라이언트-서버 보안 모듈 구현)

  • Jang, Seung-Ju;Han, Soo-Whan
    • The KIPS Transactions:PartD
    • /
    • v.9D no.5
    • /
    • pp.931-938
    • /
    • 2002
  • We implement the PBSM (Protocol-Based Security Module) system which guarantees the secure data transmission under web circumstances. There are two modules to implement for the PBSM architecture. One is Web Server Security Module (WSSM) which is working on a web server, the other is the Winsock Client Security Module (WSCSM) which is working on a client. The WSCSM security module decrypts the encrypted HTML document that is received from the security web server The decrypted HTML document is displayed on the screen of a client. The WSSM module contains the encryption part for HTML file and the decryption part for CGI (Common Gateway Interface). We also implement the proposed idea at the web system.

Biological Data Analysis Using CCBB Web Services (CCBB Web Services를 이용한 생명정보 데이터 분석)

  • Cho Hee-Hyung;Ahn Sung-Soo;Ahn Bu-Young;Kim Kyoung-Su;Park Hyung-Seong
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2005.11a
    • /
    • pp.43-47
    • /
    • 2005
  • With the high interest and the results of active research in life science and Bioinformatics recent years, various information and resources have been produced and new algorithms have been developed to analyze those. Web Services is a computer technology using XML, SOAP, WSDL and UDDI. This paper introduces the One Stop Web Services program developed in CCBB which has a graphic user interface for users.

  • PDF

Efficient Content-based Load Distribution for Web Server Clusters (웹 서버 클러스터를 위한 효율적인 내용 기반의 부하 분배)

  • Chung Ji Yung;Kim Sungsoo
    • Journal of KIISE:Information Networking
    • /
    • v.32 no.1
    • /
    • pp.60-67
    • /
    • 2005
  • A cluster consists of a collection of interconnected stand-alone computers working together and provides a high-availability solution in application area such as web services or information systems. Content-based load distribution for web server clusters uses the detailed data found in the application layer to intelligently route user requests among web servers. In this paper, we propose a content-based load distribution algorithm that considers cache hit and load information of the web servers under the web server clusters. In addition, we expand this algorithm in order to manage user requests for dynamic file. Specially, our algorithm does not keep track of any frequency of access information or try to model the contents of the caches of the web servers.

Mobile Web Service Architecture Using Context-store

  • Oh, Sang-Yoon;Aktas, Mehmet;Fox, Geoffrey C.
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.4 no.5
    • /
    • pp.836-858
    • /
    • 2010
  • Web Services allow a user to integrate applications from different platforms and languages. Since mobile applications often run on heterogeneous platforms and conditions, Web Service becomes a popular solution for integrating with server applications. However, because of its verbosity, XML based SOAP messaging gives the possible overhead to the less powerful mobile devices. Based on the mobile client's behavior that it usually exchanges messages with Web Service continuously in a session, we design the Handheld Flexible Representation architecture. Our proposed architecture consists of three main components: optimizing message representation by using a data format language (Simple_DFDL), streaming communication channel to reduce latency and the Context-store to store context information of a session as well as redundant parts of the messages. In this paper, we focus on the Context-store and describe the architecture with the Context-store for improving the performance of mobile Web Service messaging. We verify our approach by conducting various evaluations and investigate the performance and scalability of the proposed architecture. The empirical results show that we save 40% of transit time between a client and a service by reducing the message size. In contrast to solutions for a single problem such as the compression or binarization, our architecture addresses the problem at a system level. Thus, by using the Context-store, we expect reliable recovery from the fault condition and enhancing interoperability as well as improving the messaging performance.

Relation between the Image Analysis of Internet Fashion Shopping Site and Consumption Emotion - Focused on T-shirts Web Pages - (인터넷 쇼핑 사이트의 이미지 분석과 소비감성과의 관계 - 티셔츠 웹 페이지를 중심으로 -)

  • Kim, Eun-Jeong;Lee, Kyoung-Hee
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.31 no.8
    • /
    • pp.1273-1285
    • /
    • 2007
  • The purpose of this study is to understand consumer emotion about T-shirts web pages and to provide the basis for effective design plan of them. 72 T-shirts web pages through 62 sites have been chosen as stimulus pictures, and the valuation tools are composed of 21 pairs of image adjective and 3 questions for valuation of consumption emotion. Data has been collected on subjects of 480 men and women at the age of $16{\sim}27$ who live in Busan. The image factors are Aestheticism, Activeness, Stability, Intimacy. The types of T-shirts web pages are classified into four groups. The image according to the type of T-shirts web pages has showed meaningful differences in all factors, and the differences of image factors according to design elements have been meaningfully presented. In the relation between consumption emotion and image of T-shirts web pages, Impulse needs, Buying needs, Recommendation needs are related to Aestheticism factor and Stability factor. The consumption emotion according to the type of T-shirts web pages is appeared high in the type 2(Refine image) and 3(Vivid image). The valuation of consumption emotion according design elements has presented meaningful differences all design elements except menu.

Effects of Web Browsing Motivation and Retail Strategy on Purchase Conversion Behavior for Apparel (의류제품 웹브라우징 동기와 소매전략요소가 구매전환행동에 미치는 효과)

  • Kim, Eun-Young
    • Korean Journal of Human Ecology
    • /
    • v.20 no.4
    • /
    • pp.849-860
    • /
    • 2011
  • This study explores a structural model to examine the relationship between web browsing motivation, retail strategy and purchase conversion for apparel on shopping websites. A self-administered questionnaire based on existing scales includes web browsing motivation, retail strategy, and purchase conversion intention of apparel on the shopping websites. A total of 499 usable questionnaires were obtained from consumers aging 20 to 49 who reside in metropolitan cities in Korea. For data analysis, descriptive statistics, exploratory factor analysis, confirmatory factor analysis, and structural equation models were used via SPSS 12.0 and LISREL 8.8. Findings concluded that web browsing motivations consisted of three factors: hedonic, informational, and recreational browsing for apparel. Hedonic browsing had a negative effect on purchase conversion intention, whereas informational browsing had a positive effect on the purchase conversion intention for apparel on shopping sites. Retail strategies on the website were classified into service, merchandise assortment, and price & promotion; the three elements of retail strategies mediated the relationship between web browsing motivations and purchase conversion intention for apparel. Specially, merchandise assortment had significantly direct effect on the purchase conversion intention of apparel on shopping websites. Managerial implications were discussed for fashion marketers to develop retail strategies and web content in order to convert web browsers or visitors into purchasers.