• Title/Summary/Keyword: Statistical Analytics

Search Result 44, Processing Time 0.018 seconds

Big-Data Traffic Analysis for the Campus Network Resource Efficiency (학내 망 자원 효율화를 위한 빅 데이터 트래픽 분석)

  • An, Hyun-Min;Lee, Su-Kang;Sim, Kyu-Seok;Kim, Ik-Han;Jin, Seo-Hoon;Kim, Myung-Sup
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.40 no.3
    • /
    • pp.541-550
    • /
    • 2015
  • The importance of efficient enterprise network management has been emphasized continuously because of the rapid utilization of Internet in a limited resource environment. For the efficient network management, the management policy that reflects the characteristics of a specific network extracted from long-term traffic analysis is essential. However, the long-term traffic data could not be handled in the past and there was only simple analysis with the shot-term traffic data. However, as the big data analytics platforms are developed, the long-term traffic data can be analyzed easily. Recently, enterprise network resource efficiency through the long-term traffic analysis is required. In this paper, we propose the methods of collecting, storing and managing the long-term enterprise traffic data. We define several classification categories, and propose a novel network resource efficiency through the multidirectional statistical analysis of classified long-term traffic. The proposed method adopted to the campus network for the evaluation. The analysis results shows that, for the efficient enterprise network management, the QoS policy must be adopted in different rules that is tuned by time, space, and the purpose.

Latent topics-based product reputation mining (잠재 토픽 기반의 제품 평판 마이닝)

  • Park, Sang-Min;On, Byung-Won
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.39-70
    • /
    • 2017
  • Data-drive analytics techniques have been recently applied to public surveys. Instead of simply gathering survey results or expert opinions to research the preference for a recently launched product, enterprises need a way to collect and analyze various types of online data and then accurately figure out customer preferences. In the main concept of existing data-based survey methods, the sentiment lexicon for a particular domain is first constructed by domain experts who usually judge the positive, neutral, or negative meanings of the frequently used words from the collected text documents. In order to research the preference for a particular product, the existing approach collects (1) review posts, which are related to the product, from several product review web sites; (2) extracts sentences (or phrases) in the collection after the pre-processing step such as stemming and removal of stop words is performed; (3) classifies the polarity (either positive or negative sense) of each sentence (or phrase) based on the sentiment lexicon; and (4) estimates the positive and negative ratios of the product by dividing the total numbers of the positive and negative sentences (or phrases) by the total number of the sentences (or phrases) in the collection. Furthermore, the existing approach automatically finds important sentences (or phrases) including the positive and negative meaning to/against the product. As a motivated example, given a product like Sonata made by Hyundai Motors, customers often want to see the summary note including what positive points are in the 'car design' aspect as well as what negative points are in thesame aspect. They also want to gain more useful information regarding other aspects such as 'car quality', 'car performance', and 'car service.' Such an information will enable customers to make good choice when they attempt to purchase brand-new vehicles. In addition, automobile makers will be able to figure out the preference and positive/negative points for new models on market. In the near future, the weak points of the models will be improved by the sentiment analysis. For this, the existing approach computes the sentiment score of each sentence (or phrase) and then selects top-k sentences (or phrases) with the highest positive and negative scores. However, the existing approach has several shortcomings and is limited to apply to real applications. The main disadvantages of the existing approach is as follows: (1) The main aspects (e.g., car design, quality, performance, and service) to a product (e.g., Hyundai Sonata) are not considered. Through the sentiment analysis without considering aspects, as a result, the summary note including the positive and negative ratios of the product and top-k sentences (or phrases) with the highest sentiment scores in the entire corpus is just reported to customers and car makers. This approach is not enough and main aspects of the target product need to be considered in the sentiment analysis. (2) In general, since the same word has different meanings across different domains, the sentiment lexicon which is proper to each domain needs to be constructed. The efficient way to construct the sentiment lexicon per domain is required because the sentiment lexicon construction is labor intensive and time consuming. To address the above problems, in this article, we propose a novel product reputation mining algorithm that (1) extracts topics hidden in review documents written by customers; (2) mines main aspects based on the extracted topics; (3) measures the positive and negative ratios of the product using the aspects; and (4) presents the digest in which a few important sentences with the positive and negative meanings are listed in each aspect. Unlike the existing approach, using hidden topics makes experts construct the sentimental lexicon easily and quickly. Furthermore, reinforcing topic semantics, we can improve the accuracy of the product reputation mining algorithms more largely than that of the existing approach. In the experiments, we collected large review documents to the domestic vehicles such as K5, SM5, and Avante; measured the positive and negative ratios of the three cars; showed top-k positive and negative summaries per aspect; and conducted statistical analysis. Our experimental results clearly show the effectiveness of the proposed method, compared with the existing method.

Factors Affecting Falls of Demented Inpatients (치매 입원환자의 낙상 영향 요인)

  • Kim, Sang-Mi;Lee, Seong-A
    • 한국노년학
    • /
    • v.39 no.2
    • /
    • pp.231-240
    • /
    • 2019
  • The study aimed to identify risk factors for falls as well as hospitalization status according to disease and demographic characteristics of demented inpatients by investigating the in-depth Injury Patient Surveillance System data collected by Korea Centers for Disease Control and Prevention(KCDC). Older adults over 60 years old who were diagnosed with dementia were included(n=1,732). Their data were analyzed after being assigned to either a fall group or a non-fall group. STATA was used for statistical analyses, such as frequency analysis, chi-square (χ2) test, and logistics regression. It was found that 8.0% of the demented inpatients experienced falls. According to the analysis on category of fall and non-fall group were statistically significant difference in age and Charlson Comorbidity Index(CCI) and bone density deficiency. Based on the logistic regression analysis of factors affecting falls, older adults over 80 are 2.386 times more likely to fall and based on a target with a CCI of 0, the risk of falls is 0.421 times lower, finally based on those without bone density disorder, the fall risk for those with bone density disorder was 3.581 times higher. Therefore, we expect that the important about the factors relating to falls identified in this can not only be found valuable for educating inpatients with dementia and care-givers, but also be used as reference that supports clinical professionals to make decisions on falls management for patients with dementia.

A Foundational Study on Developing a Structural Model for AI-based Sentencing Prediciton Based on Violent Crime Judgment (인공지능기술 적용을 위한 강력범죄 판결문 기반 양형 예측 구조모델 개발 기초 연구)

  • Woongil Park;Eunbi Cho;Jeong-Hyeon Chang;Joo-chang Kim
    • Journal of Internet Computing and Services
    • /
    • v.25 no.1
    • /
    • pp.91-98
    • /
    • 2024
  • With the advancement of ICT (Information and Communication Technology), searching for judgments through the internet has become increasingly convenient. However, predicting sentencing based on judgments remains a challenging task for individuals. This is because sentencing involves a complex process of applying aggravating and mitigating factors within the framework of legal provisions, and it often depends on the subjective judgment of the judge. Therefore, this research aimed to develop a model for predicting sentencing using artificial intelligence by focusing on structuring the data from judgments, making it suitable for AI applications. Through theoretical and statistical analysis of previous studies, we identified variables with high explanatory power for predicting sentencing. Additionally, by analyzing 50 legal judgments related to serious crimes that are publicly available, we presented a framework for extracting essential information from judgments. This framework encompasses basic case information, sentencing details, reasons for sentencing, the reasons for the determination of the sentence, as well as information about offenders, victims, and accomplices evident within the specific content of the judgments. This research is expected to contribute to the development of artificial intelligence technologies in the field of law in the future.