• 제목/요약/키워드: statistical learning

검색결과 1,298건 처리시간 0.024초

Fault Prediction Using Statistical and Machine Learning Methods for Improving Software Quality

  • Malhotra, Ruchika;Jain, Ankita
    • Journal of Information Processing Systems
    • /
    • 제8권2호
    • /
    • pp.241-262
    • /
    • 2012
  • An understanding of quality attributes is relevant for the software organization to deliver high software reliability. An empirical assessment of metrics to predict the quality attributes is essential in order to gain insight about the quality of software in the early phases of software development and to ensure corrective actions. In this paper, we predict a model to estimate fault proneness using Object Oriented CK metrics and QMOOD metrics. We apply one statistical method and six machine learning methods to predict the models. The proposed models are validated using dataset collected from Open Source software. The results are analyzed using Area Under the Curve (AUC) obtained from Receiver Operating Characteristics (ROC) analysis. The results show that the model predicted using the random forest and bagging methods outperformed all the other models. Hence, based on these results it is reasonable to claim that quality models have a significant relevance with Object Oriented metrics and that machine learning methods have a comparable performance with statistical methods.

Recent deep learning methods for tabular data

  • Yejin Hwang;Jongwoo Song
    • Communications for Statistical Applications and Methods
    • /
    • 제30권2호
    • /
    • pp.215-226
    • /
    • 2023
  • Deep learning has made great strides in the field of unstructured data such as text, images, and audio. However, in the case of tabular data analysis, machine learning algorithms such as ensemble methods are still better than deep learning. To keep up with the performance of machine learning algorithms with good predictive power, several deep learning methods for tabular data have been proposed recently. In this paper, we review the latest deep learning models for tabular data and compare the performances of these models using several datasets. In addition, we also compare the latest boosting methods to these deep learning methods and suggest the guidelines to the users, who analyze tabular datasets. In regression, machine learning methods are better than deep learning methods. But for the classification problems, deep learning methods perform better than the machine learning methods in some cases.

Collaborative CRM using Statistical Learning Theory and Bayesian Fuzzy Clustering

  • Jun, Sung-Hae
    • Communications for Statistical Applications and Methods
    • /
    • 제11권1호
    • /
    • pp.197-211
    • /
    • 2004
  • According to the increase of internet application, the marketing process as well as the research and survey, the education process, and administration of government are very depended on web bases. All kinds of goods and sales which are traded on the internet shopping malls are extremely increased. So, the necessity of automatically intelligent information system is shown, this system manages web site connected users for effective marketing. For the recommendation system which can offer a fit information from numerous web contents to user, we propose an automatic recommendation system which furnish necessary information to connected web user using statistical learning theory and bayesian fuzzy clustering. This system is called collaborative CRM in this paper. The performance of proposed system is compared with the other methods using real data of the existent shopping mall site. This paper shows that the predictive accuracy of the proposed system is improved by comparison with others.

Seven Facets of Learning Agility in Higher Education for Future Society

  • SUNG, Eunmo
    • Educational Technology International
    • /
    • 제22권2호
    • /
    • pp.169-197
    • /
    • 2021
  • Learning agility as high potentials is drawing attention as a competency for leading an uncertain future society. The present study aims to determine the factors of learning agility in higher education context for future society. To address this goal, Major factors related to learning agility were derived through literature review and statistically verified. For statistical analysis, the nationwide data were collected from 1,000 undergraduate students in South Korea by National Youth Policy Institute. The participants asked to answer 29 items of learning agility questionnaires (LAQ). The collected data were analyzed by descriptive statistical analysis, exploratory factor analysis, and confirmatory factor analysis. As a result, learning agility items were verified normality and reliability. Learning agility was identified seven factors; challenging mind, learning responsibility, reflecting experience, intellectual curiosity, systemic thinking, change adaptability, and logical thinking. Also, the structural model fit of the seven factors of learning agility was also confirmed to be good. Based on the findings of the present study, empirical, theoretical, and practical contributions were presented, and suggestions for further research were proposed in detail.

Is it possible to forecast KOSPI direction using deep learning methods?

  • Choi, Songa;Song, Jongwoo
    • Communications for Statistical Applications and Methods
    • /
    • 제28권4호
    • /
    • pp.329-338
    • /
    • 2021
  • Deep learning methods have been developed, used in various fields, and they have shown outstanding performances in many cases. Many studies predicted a daily stock return, a classic example of time-series data, using deep learning methods. We also tried to apply deep learning methods to Korea's stock market data. We used Korea's stock market index (KOSPI) and several individual stocks to forecast daily returns and directions. We compared several deep learning models with other machine learning methods, including random forest and XGBoost. In regression, long short term memory (LSTM) and gated recurrent unit (GRU) models are better than other prediction models. For the classification applications, there is no clear winner. However, even the best deep learning models cannot predict significantly better than the simple base model. We believe that it is challenging to predict daily stock return data even if we use the latest deep learning methods.

A RESEARCH ANALYSIS ON EFFECTIVE LEARNING IN INTERNATIONAL CONSTRUCTION JOINT VENTURES

  • L.T. Zhang;W.F. Wong;Charles Y.J. Cheah
    • 국제학술발표논문집
    • /
    • The 2th International Conference on Construction Engineering and Project Management
    • /
    • pp.450-458
    • /
    • 2007
  • This paper presents the results of a statistical analysis and its research findings focusing on the learning aspect in the process of international joint ventures (IJVs). The contents of this paper is derived from a sample of 96 field cases based on a proposed conceptual model of effective learning for international construction joint ventures (ICJVs). The paper presents a brief review on the conceptual model with hypotheses and summarized the key results of statistical analysis including factor and multiple regression analysis for the testing of the validity of the proposed conceptual model and its associated research hypotheses. Among other research findings, the research confirms that ICJVs provides an excellent platform of in-action learning for construction organization and suggests that good outcomes in learning could be reaped by a company who has a clear learning intent from the beginning and subsequently take corresponding learning actions during the full process of the joint venture.

  • PDF

비행교관의 변혁적 리더십이 학생조종사의 심리적 안정감과 학업만족에 미치는 영향 (Effects of Flight Instructor's Transformative Leaderships on Student Pilot's Psychological Stabilities and Learning Satisfactions)

  • 박원태
    • 한국항공운항학회지
    • /
    • 제28권3호
    • /
    • pp.41-51
    • /
    • 2020
  • This research is accomplished to verify objectively how flight instructor's transformative leadership affects student pilot's psychological stabilities and learning satisfactions. Flight instructor's transformative leadership factor divided into individual consideration, intellectual stimulus and charisma from exploring factor analysis. Psychological stability factor subdivided into happiness, concentration and satisfaction. Learning satisfaction factor subdivided into participation, recommendation, persistence, accomplishment and relationship. According to the analysis of flight instructor's transformative leadership effect on psychological stability, it showed that it has statistical significance on happiness, concentration and satisfaction. It also has positive influence on happiness and concentration. The result from regression analysis showed that individual consideration and charisma affected happiness and concentration in order. However, satisfaction from individual consideration, intellectual stimulus and charisma didn't show statistical significance to student pilot's satisfaction. Analysis of flight instructor's transformative leadership on student pilot's learning satisfaction showed statistical significance between them. Intellectual stimulus and charisma had positive influence on student pilot's learning satisfaction. Regression analysis showed charisma and intellectual affect student pilot's learning satisfaction in order.

Relationship on Learning Environment's Distribution and Thinking Skills in Accounting Instruction

  • Nor Sa'adah JAMALUDDIN;Siti Zubaidah MOHD ARIFFIN
    • 유통과학연구
    • /
    • 제21권7호
    • /
    • pp.33-40
    • /
    • 2023
  • Purpose: Higher Order Thinking Skills is one of the important aspects in education that must be mastered by the students in order to be qualified in competing at international level. Success in mastering HOTS among the students is always linked to preparation of a good and conducive learning environment. However, does this connection impacts the students' HOTS achievement? Therefore, this research is carried out in order to evaluate the relationship between HOTS and learning environment with the main focus on Accounting Principle Elective Subject (MPEI PP). Research design, data and methodology: Research in the form of correlation is implied in this study and it involves 59 Form 5 students that has learned all syllabus in Form 4's MPEI PP. Results: Evaluation of HOTS level is based on Taxonomy Bloom that covers applying skill, analysing skill, evaluating skill, and creating skill. Result from data analysis found that there is a very weak correlation (r = 0.02) between the two variables with regression equation of average grade point = 75.023 + (-.273) Learning Environment. Conclusion: Thus, a non-significant relationship between HOTS and learning environment is successfully proven through correlation and regression statistical analysis.

A Designing for Successful Learning on the Web

  • Ahn, Jeong-Yong;Han, Kyung-Soo;Han, Beom-Soo
    • Journal of the Korean Data and Information Science Society
    • /
    • 제14권4호
    • /
    • pp.1083-1090
    • /
    • 2003
  • Web-based learning is currently an active area of research and a considerable number of studies have been conducted on its application in the learning environment. However, in spite of many advances in the research and development of the educational contents, questions about how the environment affects learning remains largely unanswered. In this article, we propose a Web-based learning environment to improve the educational effect. The goal of this article is not to provide a complete system to support Web-based learning but rather to describe some meaningful strategies and fundamental design concepts that utilize information technologies to support teaching and learning.

  • PDF

대학 이러닝 학습자들의 학습 시·공간 패턴에 따른 학업성취도 차이 분석 (The Analysis of Academic Achievement based on Spatio-Temporal Data Relate to e-Learning Patterns of University e-Learning Learners)

  • 이해듬;남민우
    • 융합정보논문지
    • /
    • 제8권4호
    • /
    • pp.247-253
    • /
    • 2018
  • 본 연구는 대학 이러닝 학습자들의 학습 시 공간 데이터를 활용한 이러닝 학습패턴에 따라 학습자등의 출석률과 학업성취도 차이를 규명하였다. 연구대상은 3년간 총 68개 이러닝 강좌, 수강생 13,611명의 이러닝 데이터를 수집하였고, 자료분석은 t검증, 이원변량분석을 활용하였다. 본 연구결과는 다음과 같이 제시한다. 첫째, 대학 이러닝 학습자들의 학습공간에 따른 출석률과 학업성취도 차이를 분석한 결과 교내 주학습자가 출석률과 학업성취도에서 교외 주학습자들 보다 높은 점수를 보였고, 학업성취도는 통계적인 유의성이 나타났다. 둘째, 대학 이러닝 학습자들의 일 단위 학습시간대에서는 오전시간대 주학습자, 오후시간대 주학습자, 야간시간대 주학습자 순으로 출석률과 학업성취도가 높게 나타났으며, 모두 유의미한 차이가 있는 것으로 분석되었다. 주 단위 학습시간대에서는 평일시간대의 주학습자들이 주말시간대 주학습자들 보다 출석률과 학업성취도에서 더 높게 나타났으며, 통계적으로도 유의한 차이가 분석되었다.