• Title/Summary/Keyword: Statistical Analytics

Search Result 44, Processing Time 0.027 seconds

STATISTICAL MODELLING USING DATA MINING TOOLS IN MERGERS AND ACQUISITION WITH REGARDS TO MANUFACTURE & SERVICE SECTOR

  • KALAIVANI, S.;SIVAKUMAR, K.;VIJAYARANGAM, J.
    • Journal of applied mathematics & informatics
    • /
    • v.40 no.3_4
    • /
    • pp.563-575
    • /
    • 2022
  • Many organizations seek statistical modelling facilitated by data analytics technologies for determining the prediction models associated with M&A (Merger and Acquisition). By combining these data analytics tool alongside with data collection approaches aids organizations towards M&A decision making, followed by achieving profitable insights as well. It promotes for better visibility, overall improvements and effective negotiation strategies for post-M&A integration. This paper explores on the impact of pre and post integration of M&A in a standard organizational setting via devising a suitable statistical model via employing techniques such as Naïve Bayes, K-nearest neighbour (KNN), and Decision Tree & Support Vector Machine (SVM).

R programming: Language and Environment for Statistical Computing and Data Visualization (R 프로그래밍: 통계 계산과 데이터 시각화를 위한 환경)

  • Lee, D.H.;Ren, Ye
    • Electronics and Telecommunications Trends
    • /
    • v.28 no.1
    • /
    • pp.42-51
    • /
    • 2013
  • The R language is an open source programming language and a software environment for statistical computing and data visualization. The R language is widely used among a lot of statisticians and data scientists to develop statistical software and data analysis. The R language provides a variety of statistical and graphical techniques, including basic descriptive statistics, linear or nonlinear modeling, conventional or advanced statistical tests, time series analysis, clustering, simulation, and others. In this paper, we first introduce the R language and investigate its features as a data analytics tool. As results, we may explore the application possibility of the R language in the field of data analytics.

  • PDF

Facilitating Conditions in Adopting Big Data Analytics at Medical Aid Organizations in South Africa

  • VELA, Junior Vela;SUBRAMANIAM, Prabhakar Rontala;OFUSORI, Lizzy Oluwatoyin
    • The Journal of Industrial Distribution & Business
    • /
    • v.13 no.11
    • /
    • pp.1-10
    • /
    • 2022
  • Purpose: This study measures the influence of facilitating conditions on employees' attitudes towards the adoption of big data analytics by selected medical aid organizations in Durban. In the health care sector, there are various sources of big data such as patients' medical records, medical examination results, and pharmacy prescriptions. Several organizations take the benefits of big data to improve their performance and productivity. Research design, data, and methodology: A survey research strategy was conducted on some selected medical aid organizations. A non-probability sampling and the purposive sampling technique were adopted in this study. The collected data was analysed using version 23 of Statistical Package for Social Science (SPSS) Results: the results show that the "facilitating conditions" have a positive influence on employees' attitudes in the adoption of big data analytics Conclusions: The findings of this study provide empirical and scientific contributions of the facilitating conditions issues regarding employee attitudes toward big data analytics adoption. The findings of this study will add to the body of knowledge in this field and raise awareness, which will spur further research, particularly in developing countries.

Statistical Korean Spoken Language Understanding System for Dialog Processing (대화처리를 위한 통계기반 한국어 음성언어이해 시스템)

  • Roh, Yoon-Hyung;Yang, Seong-II;Kim, Young-Gil
    • Annual Conference on Human and Language Technology
    • /
    • 2012.10a
    • /
    • pp.215-218
    • /
    • 2012
  • 본 논문에서는 한국어 대화 처리를 위한 통계기반 음성언어이해 시스템에 대해 기술한다. 음성언어이해시스템은 대화처리에서 음성 인식된 문장으로부터 사용자의 의도를 인식하여 의미표현으로 표현하는 기능을 담당한다. 한국어의 특성을 반영한 실용적인 음성언어이해 시스템을 위해서 강건성과 적용성, 확장성 등이 요구된다. 이를 위해 본 시스템은 음성언어의 특성상 구조분석을 하지 않고, 마이닝 기법을 이용하여 사용자 의도 표현을 생성하는 방식을 취하고 있다. 또한 한국어에서 나타나는 특징들에 대한 처리를 위해 자질 추가 및 점규화 처리 등을 수행하였다. 정보서비스용 대화처리 시스템을 대상으로 개발되고 있고, 차량 정보서비스용 학습 코퍼스를 대상으로 실험을 하여 문장단위 정확률로 약 89%의 성능을 보이고 있다.

  • PDF

Predicting Selling Price of First Time Product for Online Seller using Big Data Analytics

  • Deora, Sukhvinder Singh;Kaur, Mandeep
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.2
    • /
    • pp.193-197
    • /
    • 2021
  • Customers are increasingly attracted towards different e-commerce websites and applications for the purchase of products significantly. This is the reason the sellers are moving to different internet based services to sell their products online. The growth of customers in this sector has resulted in the use of big data analytics to understand customers' behavior in predicting the demand of items. It uses a complex process of examining large amount of data to uncover hidden patterns in the information. It is established on the basis of finding correlation between various parameters that are recorded, understanding purchase patterns and applying statistical measures on collected data. This paper is a document of the bottom-up strategy used to manage the selling price of a first-time product for maximizing profit while selling it online. It summarizes how existing customers' expectations can be used to increase the sale of product and attract the attention of the new customer for buying the new product.

Analysis of Market Trajectory Data using k-NN

  • Park, So-Hyun;Ihm, Sun-Young;Park, Young-Ho
    • Journal of Multimedia Information System
    • /
    • v.5 no.3
    • /
    • pp.195-200
    • /
    • 2018
  • Recently, as the sensor and big data analysis technology have been developed, there have been a lot of researches that analyze the purchase-related data such as the trajectory information and the stay time. Such purchase-related data is usefully used for the purchase pattern prediction and the purchase time prediction. Because it is difficult to find periodic patterns in large-scale human data, it is necessary to look at actual data sets, find various feature patterns, and then apply a machine learning algorithm appropriate to the pattern and purpose. Although existing papers have been used to analyze data using various machine learning methods, there is a lack of statistical analysis such as finding feature patterns before applying the machine learning algorithm. Therefore, we analyze the purchasing data of Songjeong Maeil Market, which is a data gathering place, and finds some characteristic patterns through statistical data analysis. Based on the results of 1, we derive meaningful conclusions by applying the machine learning algorithm and present future research directions. Through the data analysis, it was confirmed that the number of visits was different according to the regional characteristics around Songjeong Maeil Market, and the distribution of time spent by consumers could be grasped.

Bayesian Prediction of Exponentiated Weibull Distribution based on Progressive Type II Censoring

  • Jung, Jinhyouk;Chung, Younshik
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.6
    • /
    • pp.427-438
    • /
    • 2013
  • Based on progressive Type II censored sampling which is an important method to obtain failure data in a lifetime study, we suggest a very general form of Bayesian prediction bounds from two parameters exponentiated Weibull distribution using the proper general prior density. For this, Markov chain Monte Carlo approach is considered and we also provide a simulation study.

Enhanced Regular Expression as a DGL for Generation of Synthetic Big Data

  • Kai, Cheng;Keisuke, Abe
    • Journal of Information Processing Systems
    • /
    • v.19 no.1
    • /
    • pp.1-16
    • /
    • 2023
  • Synthetic data generation is generally used in performance evaluation and function tests in data-intensive applications, as well as in various areas of data analytics, such as privacy-preserving data publishing (PPDP) and statistical disclosure limit/control. A significant amount of research has been conducted on tools and languages for data generation. However, existing tools and languages have been developed for specific purposes and are unsuitable for other domains. In this article, we propose a regular expression-based data generation language (DGL) for flexible big data generation. To achieve a general-purpose and powerful DGL, we enhanced the standard regular expressions to support the data domain, type/format inference, sequence and random generation, probability distributions, and resource reference. To efficiently implement the proposed language, we propose caching techniques for both the intermediate and database queries. We evaluated the proposed improvement experimentally.

Doing social big data analytics: A reflection on research question, data format, and statistical test-Convergent aspects (소셜네트워크서비스 빅데이터 분석을 위한 연구문제 설정과 통계적 제 문제-융합적 관점)

  • Park, Han-Woo;Choi, Kyoung-ho
    • Journal of Digital Convergence
    • /
    • v.14 no.12
    • /
    • pp.591-597
    • /
    • 2016
  • Research question and method play important roles in conducting a research in a scientifically valid way. In today's digitalized research environment, social network service (SNS) has rapidly become a new source of big data. While this shift provides new challenges for researchers in Korea, there is little scholarly discussion of how research questions can be framed and what statistical methods can be applied. This article suggests some basic but primary types of example questions for researchers employing social big data analytics. Further, we illustrate the interface of the intended data set specifically for SNS-mediated communication and information exchange behaviors. Lastly, a statistical test known as proper method for social big data is introduced.

Bankruptcy Prediction Modeling Using Qualitative Information Based on Big Data Analytics (빅데이터 기반의 정성 정보를 활용한 부도 예측 모형 구축)

  • Jo, Nam-ok;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.2
    • /
    • pp.33-56
    • /
    • 2016
  • Many researchers have focused on developing bankruptcy prediction models using modeling techniques, such as statistical methods including multiple discriminant analysis (MDA) and logit analysis or artificial intelligence techniques containing artificial neural networks (ANN), decision trees, and support vector machines (SVM), to secure enhanced performance. Most of the bankruptcy prediction models in academic studies have used financial ratios as main input variables. The bankruptcy of firms is associated with firm's financial states and the external economic situation. However, the inclusion of qualitative information, such as the economic atmosphere, has not been actively discussed despite the fact that exploiting only financial ratios has some drawbacks. Accounting information, such as financial ratios, is based on past data, and it is usually determined one year before bankruptcy. Thus, a time lag exists between the point of closing financial statements and the point of credit evaluation. In addition, financial ratios do not contain environmental factors, such as external economic situations. Therefore, using only financial ratios may be insufficient in constructing a bankruptcy prediction model, because they essentially reflect past corporate internal accounting information while neglecting recent information. Thus, qualitative information must be added to the conventional bankruptcy prediction model to supplement accounting information. Due to the lack of an analytic mechanism for obtaining and processing qualitative information from various information sources, previous studies have only used qualitative information. However, recently, big data analytics, such as text mining techniques, have been drawing much attention in academia and industry, with an increasing amount of unstructured text data available on the web. A few previous studies have sought to adopt big data analytics in business prediction modeling. Nevertheless, the use of qualitative information on the web for business prediction modeling is still deemed to be in the primary stage, restricted to limited applications, such as stock prediction and movie revenue prediction applications. Thus, it is necessary to apply big data analytics techniques, such as text mining, to various business prediction problems, including credit risk evaluation. Analytic methods are required for processing qualitative information represented in unstructured text form due to the complexity of managing and processing unstructured text data. This study proposes a bankruptcy prediction model for Korean small- and medium-sized construction firms using both quantitative information, such as financial ratios, and qualitative information acquired from economic news articles. The performance of the proposed method depends on how well information types are transformed from qualitative into quantitative information that is suitable for incorporating into the bankruptcy prediction model. We employ big data analytics techniques, especially text mining, as a mechanism for processing qualitative information. The sentiment index is provided at the industry level by extracting from a large amount of text data to quantify the external economic atmosphere represented in the media. The proposed method involves keyword-based sentiment analysis using a domain-specific sentiment lexicon to extract sentiment from economic news articles. The generated sentiment lexicon is designed to represent sentiment for the construction business by considering the relationship between the occurring term and the actual situation with respect to the economic condition of the industry rather than the inherent semantics of the term. The experimental results proved that incorporating qualitative information based on big data analytics into the traditional bankruptcy prediction model based on accounting information is effective for enhancing the predictive performance. The sentiment variable extracted from economic news articles had an impact on corporate bankruptcy. In particular, a negative sentiment variable improved the accuracy of corporate bankruptcy prediction because the corporate bankruptcy of construction firms is sensitive to poor economic conditions. The bankruptcy prediction model using qualitative information based on big data analytics contributes to the field, in that it reflects not only relatively recent information but also environmental factors, such as external economic conditions.