• Title/Summary/Keyword: Website Classification

Search Result 54, Processing Time 0.021 seconds

A Comparative Study of Phishing Websites Classification Based on Classifier Ensemble

  • Tama, Bayu Adhi;Rhee, Kyung-Hyune
    • Journal of Korea Multimedia Society
    • /
    • v.21 no.5
    • /
    • pp.617-625
    • /
    • 2018
  • Phishing website has become a crucial concern in cyber security applications. It is performed by fraudulently deceiving users with the aim of obtaining their sensitive information such as bank account information, credit card, username, and password. The threat has led to huge losses to online retailers, e-business platform, financial institutions, and to name but a few. One way to build anti-phishing detection mechanism is to construct classification algorithm based on machine learning techniques. The objective of this paper is to compare different classifier ensemble approaches, i.e. random forest, rotation forest, gradient boosted machine, and extreme gradient boosting against single classifiers, i.e. decision tree, classification and regression tree, and credal decision tree in the case of website phishing. Area under ROC curve (AUC) is employed as a performance metric, whilst statistical tests are used as baseline indicator of significance evaluation among classifiers. The paper contributes the existing literature on making a benchmark of classifier ensembles for web phishing detection.

A Comparative Study of Phishing Websites Classification Based on Classifier Ensembles

  • Tama, Bayu Adhi;Rhee, Kyung-Hyune
    • Journal of Multimedia Information System
    • /
    • v.5 no.2
    • /
    • pp.99-104
    • /
    • 2018
  • Phishing website has become a crucial concern in cyber security applications. It is performed by fraudulently deceiving users with the aim of obtaining their sensitive information such as bank account information, credit card, username, and password. The threat has led to huge losses to online retailers, e-business platform, financial institutions, and to name but a few. One way to build anti-phishing detection mechanism is to construct classification algorithm based on machine learning techniques. The objective of this paper is to compare different classifier ensemble approaches, i.e. random forest, rotation forest, gradient boosted machine, and extreme gradient boosting against single classifiers, i.e. decision tree, classification and regression tree, and credal decision tree in the case of website phishing. Area under ROC curve (AUC) is employed as a performance metric, whilst statistical tests are used as baseline indicator of significance evaluation among classifiers. The paper contributes the existing literature on making a benchmark of classifier ensembles for web phishing detection.

Website Classification based on Occurrence Frequency of Medical Terms and Hyperlinks in Webpage (웹페이지의 의학용어 출현 빈도와 하이퍼링크에 기반한 웹사이트 분류)

  • Lee, In Keun;Kim, Hwa Sun;Cho, Hune
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.23 no.2
    • /
    • pp.126-132
    • /
    • 2013
  • This study proposed a method to classify internet websites based on occurrence frequency of medical terms in the webpages and website structure composed with webpages and hyperlinks. The classification was done by using the suitability measure defined by three factors: (1)occurrence frequency of medical terms in the whole terms involved in a webpage, (2)occurrence frequency of medical terms in de-duplicated terms involved in the webpage, and (3)the number of hyperlinks to reach to a specific webpage from homepage. We conducted an experiment to verify the proposed method with the 80 websites registered in directories related to medical field and 127 websites in nonmedical field directories, and the experiment result showed 82.5 % of accuracy of the classification.

Automated Link Tracing for Classification of Malicious Websites in Malware Distribution Networks

  • Choi, Sang-Yong;Lim, Chang Gyoon;Kim, Yong-Min
    • Journal of Information Processing Systems
    • /
    • v.15 no.1
    • /
    • pp.100-115
    • /
    • 2019
  • Malicious code distribution on the Internet is one of the most critical Internet-based threats and distribution technology has evolved to bypass detection systems. As a new defense against the detection bypass technology of malicious attackers, this study proposes the automated tracing of malicious websites in a malware distribution network (MDN). The proposed technology extracts automated links and classifies websites into malicious and normal websites based on link structure. Even if attackers use a new distribution technology, website classification is possible as long as the connections are established through automated links. The use of a real web-browser and proxy server enables an adequate response to attackers' perception of analysis environments and evasion technology and prevents analysis environments from being infected by malicious code. The validity and accuracy of the proposed method for classification are verified using 20,000 links, 10,000 each from normal and malicious websites.

A Study on the Menu Structure and Term of Academic Library Web Site (국내 대학도서관 웹사이트 메뉴구조와 용어 분석)

  • 최흥식
    • Journal of the Korean Society for information Management
    • /
    • v.19 no.4
    • /
    • pp.137-161
    • /
    • 2002
  • The purpose of this study is to propose new menu structure and terms to be used by Website design for utilization of academic library Website. The menu structure was analyzed, based on seven menu patterns of Website which is widely used, and terms were analyzed by the frequency appearing at the Website. According to the analyzed result, the menu structure used to more than two menu patterns and the terms appear variety. The profitable menu pattern appears 〈table〉 and 〈frame + table〉 menu structures and the terms needs to systematic re-classification and controlled presentation. It is expected that this study can help a designer to the development and implementation of efficient Website. It helps not only to solve the problem of menu structure and term selection for librarian, but get rid of confusion of library services for users.

Classification of Service Types using Website Fingerprinting in Anonymous Encrypted Communication Networks (익명 암호통신 네트워크에서의 웹사이트 핑거프린팅을 활용한 서비스 유형 분류)

  • Koo, Dongyoung
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.11 no.4
    • /
    • pp.127-132
    • /
    • 2022
  • An anonymous encrypted communication networks that make it difficult to identify the trace of a user's access by passing through several virtual computers and/or networks, such as Tor, provides user and data privacy in the process of Internet communications. However, when it comes to abuse for inappropriate purposes, such as sharing of illegal contents, arms trade, etc. through such anonymous encrypted communication networks, it is difficult to detect and take appropriate countermeasures. In this paper, by extending the website fingerprinting technique that can identify access to a specific site even in anonymous encrypted communication, a method for specifying and classifying service types of websites for not only well-known sites but also unknown sites is proposed. This approach can be used to identify hidden sites that can be used for malicious purposes.

A Study on Website Analysis of apparel Brand through Marketing Mix -Focusing on Unisex Brand- (마케팅 믹스를 활용(活用)한 의류(衣類)브랜드 웹사이트 분석(分析) -유니섹스 브랜드를 중심(中心)으로-)

  • Lee, Min-Gyung;Rha, Soo-Im
    • Journal of Fashion Business
    • /
    • v.11 no.4
    • /
    • pp.69-81
    • /
    • 2007
  • This study, for the purpose of comparing and analyzing 23ea of national unisex apparel brands website consists of product, price, promotion and place divided by marketin gmix. Based of theoretical study and pre-research about the marketing mix, we made the classification standard for the marketing mix and analyzed the unisex apparel brand website according to 4P's individual item and the result was appeared like this. First of all, in the product section, this study provide information about product introduction/guidance, a product figure for item, introduction for new items, propose for coordination and brand introduction/information. Secondly, in the price part, almost apparel brands are provide their product's image, or present their goods photo with price, or displayed through the banner advertisement of discount or special price. Thirdly, For the marketing promotion part, compare to the other component in the most of apparel brand's website, marketing promotion has more section than the other marketing mix. And, especially, various events and customer service space has more weight than the others. Forth, in the place section, it's focused on the information of shopping mall location, contact number, address, and on-line shopping mall. In Conclusion, when the most of apparel brands are doing internet marketing, they're concern to product and promotion, but price and place needs more supplement in the unisex apparel brand's marketing mix.

Academic Conference Categorization According to Subjects Using Topical Information Extraction from Conference Websites (학회 웹사이트의 토픽 정보추출을 이용한 주제에 따른 학회 자동분류 기법)

  • Lee, Sue Kyoung;Kim, Kwanho
    • The Journal of Society for e-Business Studies
    • /
    • v.22 no.2
    • /
    • pp.61-77
    • /
    • 2017
  • Recently, the number of academic conference information on the Internet has rapidly increased, the automatic classification of academic conference information according to research subjects enables researchers to find the related academic conference efficiently. Information provided by most conference listing services is limited to title, date, location, and website URL. However, among these features, the only feature containing topical words is title, which causes information insufficiency problem. Therefore, we propose methods that aim to resolve information insufficiency problem by utilizing web contents. Specifically, the proposed methods the extract main contents from a HTML document collected by using a website URL. Based on the similarity between the title of a conference and its main contents, the topical keywords are selected to enforce the important keywords among the main contents. The experiment results conducted by using a real-world dataset showed that the use of additional information extracted from the conference websites is successful in improving the conference classification performances. We plan to further improve the accuracy of conference classification by considering the structure of websites.

Research on the Design of a Deep Learning-Based Automatic Web Page Generation System

  • Jung-Hwan Kim;Young-beom Ko;Jihoon Choi;Hanjin Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.2
    • /
    • pp.21-30
    • /
    • 2024
  • This research aims to design a system capable of generating real web pages based on deep learning and big data, in three stages. First, a classification system was established based on the industry type and functionality of e-commerce websites. Second, the types of components of web pages were systematically categorized. Third, the entire web page auto-generation system, applicable for deep learning, was designed. By re-engineering the deep learning model, which was trained with actual industrial data, to analyze and automatically generate existing websites, a directly usable solution for the field was proposed. This research is expected to contribute technically and policy-wise to the field of generative AI-based complete website creation and industrial sectors.

A Study on the Internet Marketing Communication Strategy of Young Casual Fashion Brand through the Website Analysis (영 캐주얼 패션브랜드 웹사이트를 활용한 마케팅 커뮤니케이션 전략)

  • Lee, Min-Gyung;Rha, Soo-Im
    • Journal of Fashion Business
    • /
    • v.12 no.4
    • /
    • pp.46-55
    • /
    • 2008
  • The purpose of this study is to provide the effective internet marketing communication strategy as marketing tools by analyzing the web sites of young casual fashion brands. We've selected 19 young casual fashion brands in 3 department stores and made the classification standard - advertising, promotion, public relation(PR), customer management - and analysed the young casual fashion brands according to 4 classification standard on the web sites. As a result of study, it is found that 19 young casual brands' web sites put an emphasis on activity of customer management and promotion in general. However, they did not conduct the PR and advertising actively compared with other parts. Especially, the promotion strategy occupies more parts than any other parts through the variety of membership card's services. Also they are sending e-mails or providing 1:1(FAQ/Q&A) board to the members as a customer management to be able to help to communicate with customer through the web site.