• Title/Summary/Keyword: web statistics

Search Result 385, Processing Time 0.027 seconds

A Splog Detection System Using Support Vector Systems (지지벡터기계를 이용한 스팸 블로그(Splog) 판별 시스템)

  • Lee, Song-Wook
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.1
    • /
    • pp.163-168
    • /
    • 2011
  • Blogs are an easy way to publish information, engage in discussions, and form communities on the Internet. Recently, there are several varieties of spam blog whose purpose is to host ads or raise the PageRank of target sites. Our purpose is to develope the system which detects these spam blogs (splogs) automatically among blogs on Web environment. After removing HTML of blogs, they are tagged by part of speech(POS) tagger. Words and their POS tags information is used as a feature type. Among features, we select useful features with X2 statistics and train the SVM with the selected features. Our system acquired 90.5% of F1 measure with SPLOG data set.

Improving Satellite Derived Soil Moisture Data Using Data Assimilation Methods (자료동화 기법을 이용한 위성영상 추출 토양수분 자료 개선)

  • Hwang, Soonho;Ryu, Jeong Hoon;Kang, Moon Seong
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2018.05a
    • /
    • pp.152-152
    • /
    • 2018
  • Soil moisture is a important factor in hydrologic analysis. So, if we have spatially distributed soil moisture data, it can help to study much research in a various field. Recently, there are a lot of satellite derived soil moisture data, and it can be served through web freely. Especially, NASA (National Aeronautics and Space Administration) launched the Soil Moisture Aperture Passive (SMAP) satellite for mapping global soil moisture on 31 January 2015. SMAP data have many advantages for study, for example, SMAP data has higher spatial resolution than other satellited derived data. However, becuase many satellited derived soil moisture data have a limitation to data accuracy, if we have ancillary materials for improving data accuracy, it can be used. So, in this study, after applying the alogorithm, which is data assimilation methods, applicability of satellite derived soil moisture data was analyzed. Among the various data assimilation methods, in this study, Model Output Statistics (MOS) technique was used for improving satellite derived soil moisture data. Model Output Statistics (MOS) is a type of statistical post-processing, a class of techniques used to improve numerical weather models' ability to forecast by relating model outputs to observational or additional model data.

  • PDF

Companies Entering the Metabus Industry - Major Big Data Protection with Remote-based Hard Disk Memory Analysis Audit (AUDIT) System

  • Kang, Yoo seok;Kim, Soo dong;Seok, Hyeonseon;Lee, Jae cheol;Kwon, Tae young;Bae, Sang hyun;Yoon, Seong do;Jeong, Hyung won
    • Journal of Integrative Natural Science
    • /
    • v.14 no.4
    • /
    • pp.189-196
    • /
    • 2021
  • Recently, as a countermeasure for cyber breach attacks and confidential leak incidents on PC hard disk memory storage data of the metaverse industry, it is required when reviewing and developing a remote-based regular/real-time monitoring and analysis security system. The reason for this is that more than 90% of information security leaks occur on edge-end PCs, and tangible and intangible damage, such as an average of 1.20 billion won per metaverse industrial security secret leak (the most important facts and numerical statistics related to 2018 security, 10.2018. the same time as responding to the root of the occurrence of IT WORLD on the 16th, as it becomes the target of malicious code attacks that occur in areas such as the network system web due to interworking integration when building IT infrastructure, Deep-Access-based regular/real-time remote. The concept of memory analysis and audit system is key.

Notified Incidence of Tuberculosis in Foreign-born Individuals in Jeju Province, Republic of Korea

  • Kim, Dae Soon;Bae, Jong-Myon
    • Journal of Preventive Medicine and Public Health
    • /
    • v.52 no.1
    • /
    • pp.66-70
    • /
    • 2019
  • Objectives: In the Republic of Korea (ROK), the notified incidence of tuberculosis in foreign-born individuals (NITFBI) has increased recently, as has the rate of multidrug-resistant (MDR) and rifampicin-resistant (RR) tuberculosis in foreigners staying in the ROK. As Jeju Province in ROK has a no-visa entry policy, control programs for NITFBI should be consolidated. The aim was to evaluate the status of NITFBI, with a focus on the distribution of MDR/RR tuberculosis by nationality. Methods: Data on tuberculosis incidence in individuals born in Jeju Province and in foreign-born individuals were extracted from the Korean Statistical Information Service of Statistics Korea, and the Infectious Disease Surveillance Web Statistics of the Korea Centers for Disease Control and Prevention, respectively. Results: Among all notified incident cases of tuberculosis, the proportion of NITFBI increased from 1.46% in 2011 to 6.84% in 2017. China- and Vietnam-born individuals accounted for the greatest proportion of the 95 cases of NITFBI. Seven cases of MDR/RR tuberculosis were found, all involving patients born in China. Conclusions: In Jeju Province, ROK, NITFBI might become more common in the near future. Countermeasures for controlling active tuberculosis in immigrants born in high-risk nations for tuberculosis should be prepared in Jeju Province, since it is a popular tourist destination.

Utilization of Log Data Reflecting User Information-Seeking Behavior in the Digital Library

  • Lee, Seonhee;Lee, Jee Yeon
    • Journal of Information Science Theory and Practice
    • /
    • v.10 no.1
    • /
    • pp.73-88
    • /
    • 2022
  • This exploratory study aims to understand the potential of log data analysis and expand its utilization in user research methods. Transaction log data are records of electronic interactions that have occurred between users and web services, reflecting information-seeking behavior in the context of digital libraries where users interact with the service system during the search for information. Two ways were used to analyze South Korea's National Digital Science Library (NDSL) log data for three days, including 150,000 data: a log pattern analysis, and log context analysis using statistics. First, a pattern-based analysis examined the general paths of usage by logged and unlogged users. The correlation between paths was analyzed through a χ2 analysis. The subsequent log context analysis assessed 30 identified users' data using basic statistics and visualized the individual user information-seeking behavior while accessing NDSL. The visualization shows included 30 diverse paths for 30 cases. Log analysis provided insight into general and individual user information-seeking behavior. The results of log analysis can enhance the understanding of user actions. Therefore, it can be utilized as the basic data to improve the design of services and systems in the digital library to meet users' needs.

Individual and familial factors associated with youth sexual experience based on national sample survey (국가표본조사자료 기반 청소년 성경험의 개인 및 가족 요인 분석)

  • Hwang, Jinseub;Ryu, Jiin;Kim, Jiwon;Kim, Seokjoo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.1
    • /
    • pp.21-28
    • /
    • 2017
  • This study aims to identify individual and familial factors associated with youth sexual experience by using the nationally representative sample data in South Korea. Specifically, we select 68,043 students in middle and high schools participating in the 2015 Korea Youth Risk Behavior Web-based Survey. Considering the complex survey design, we conduct a descriptive analysis and multiple logistic regression for sexual experience. The main results identify factors on sexual experience such as age, type of school, stress level, drinking, smoking, economic status, and cohabiting parents. In particular, the drinking and smoking behaviors are positively associated with sexual experience and the youth living with neither parent is more likely to have a sexual experience than those who lived two parents. In conclusion, the plan of sex education should consider the risk factors and the quality of sex education should be enhanced in order to build more appropriate sexual culture and behaviors among the youth.

Statistical Approach to Sentiment Classification using MapReduce (맵리듀스를 이용한 통계적 접근의 감성 분류)

  • Kang, Mun-Su;Baek, Seung-Hee;Choi, Young-Sik
    • Science of Emotion and Sensibility
    • /
    • v.15 no.4
    • /
    • pp.425-440
    • /
    • 2012
  • As the scale of the internet grows, the amount of subjective data increases. Thus, A need to classify automatically subjective data arises. Sentiment classification is a classification of subjective data by various types of sentiments. The sentiment classification researches have been studied focused on NLP(Natural Language Processing) and sentiment word dictionary. The former sentiment classification researches have two critical problems. First, the performance of morpheme analysis in NLP have fallen short of expectations. Second, it is not easy to choose sentiment words and determine how much a word has a sentiment. To solve these problems, this paper suggests a combination of using web-scale data and a statistical approach to sentiment classification. The proposed method of this paper is using statistics of words from web-scale data, rather than finding a meaning of a word. This approach differs from the former researches depended on NLP algorithms, it focuses on data. Hadoop and MapReduce will be used to handle web-scale data.

  • PDF

Visualization analysis using R Shiny (R의 Shiny를 이용한 시각화 분석 활용 사례)

  • Na, Jonghwa;Hwang, Eunji
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.6
    • /
    • pp.1279-1290
    • /
    • 2017
  • R's {shiny} package provides an environment for creating web applications with only R scripts. Shiny does not require knowledge of a separate web programming language and its development is very easy and straightforward. In addition, Shiny has a variety of extensibility, and its functions are expanding day by day. Therefore, the presentation of high-quality results is an excellent tool for R-based analysts. In this paper, we present actual cases of large data analysis using Shiny. First, geological anomaly zone is extracted by analyzing topographical data expressed in the form of contour lines by analysis related to spatial data. Next, we will construct a model to predict major diseases by 16 cities and provinces nationwide using weather, environment, and social media information. In this process, we want to show that Shiny is very effective for data visualization and analysis.

Web-based Parking Lot Management System by Vehicle Movement (차량 영상을 이용한 웹기반 주차관리 시스템)

  • Lee, Hyo-Jong
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.46 no.12
    • /
    • pp.95-101
    • /
    • 2009
  • s economic development has been achieved and society gets complicated, problems of traffic system have been also exposed. Due to these problems, drivers have to endure economic loss and delayed time. A web-based parking lot management system has been proposed to solve this problem. Because a parking lot is an important resource of traffic system, efficient management of parking lots can be means to solve critical problems of traffic system. In this study a simple method is introduced to detect moving vehicles with geometric information of moving objects that has been computed from surveillance cameras installed in a parking lot. Statistical information processed from image data is also stored on a server side, such as total number of parking lots, a number of parked cars and a number of available parking spots. A client who wants to know the nearest parking place can share the information via a mobile device and shorten his or her driving time. Great benefit to both drivers and society is expected if many parking lots are equipped with this system.

The government official support status of the agricultural diseases, injuries and accidents among Korea and foreign countries and the implication of the agricultural policy of Korea (해외의 농업안전보건지원 실태 및 국내정책의 함의)

  • Lee, Kyung Sook;Choi, Jeong Wha;Baek, Yoon Jeong;Kim, Kyung Ran;Kim, Hyo Cher
    • Journal of Korean Society of Occupational and Environmental Hygiene
    • /
    • v.17 no.2
    • /
    • pp.89-100
    • /
    • 2007
  • Object: The purpose of this study was to survey the government official support status of the agricultural diseases, injuries and accidents among Korea and foreign countries and to suggest the agricultural policy of Korea. Methods: For this purpose, we analyzed the current national management support status among four foreign countries and Korea about agricultural diseases, injuries and accidents of farmers. For the foreign countries and the national support current status of agricultural diseases, injuries and accidents, related literature such as books, theses, articles, and web documents from the government organization of each countries were collected and analyzed. Key words for web-site and web documents were agricultural diseases, injuries, and accidents, government official system, safety and health, farmer's welfare, and farmer's official support system. UK, United States of America, France, and Japan were selected as the foreign countries' cases. Results and Conclusions: Implications for the agricultural diseases, injuries and accidents derived from the reviews among foreign countries and Korea were as follows: governmental supports should include (1) efforts on unifying administrative systems, (2) special support and management systems focusing on special subjects such as the agriculture that have been neglected, (3) aligned strategies including vision, goals, long-term plans about national safety and health projects, (4) development of supporting systems considering the features of agriculture, (5) systemized national surveys about occupational injuries and accidents for basic statistics and national studies, (6) active prevention efforts of agricultural diseases, injuries and accidents, and (8) specialized funds for safety and health of Korean farmers.