• 제목/요약/키워드: Machine Learning Models

검색결과 1,358건 처리시간 0.03초

기계학습을 이용한 기업가적 혁신성 예측 모델에 관한 연구 (Machine Learning for Predicting Entrepreneurial Innovativeness)

  • 정두희;윤진섭;양성민
    • 벤처창업연구
    • /
    • 제16권3호
    • /
    • pp.73-86
    • /
    • 2021
  • 이 연구의 목적은 기업가적 혁신성을 정확하게 예측하는 고도화된 분석 모델을 탐색하는 것이다. 기업가정신 연구 분야에서는 최초로, 데이터 과학적 접근방식에 해당되는 기계학습(Machine learning)을 이용해 기업가적 혁신성(entrepreneurial innovativeness)을 예측하는 모델을 제시한다. 예측모델을 구축하기 위하여 Global Entrepreneurship Monitor(GEM)의 62개국 22,099건 데이터를 이용한다. 27개 설명변수로 이뤄진 데이터 셋을 토대로 전통적 통계방법인 다중회귀분석과, 회귀트리, 랜덤포레스트, XG부스트, 인공신경망 등 기계학습을 이용한 예측모델을 구축하고 각 모델의 성능을 비교한다. 모델의 성능 평가를 위해 RMSE(Root mean square error), MAE(Mean absolute error)와 상관관계(Correlation) 등 지표를 사용한다. 분석 결과 5가지 기계학습 기반 모델은 모두 전통적 방법에 비해 우수한 성능을 보였으며, 예측 성능이 가장 좋은 모델은 XG부스트였다. XG부스트를 통한 기업가적 혁신성 예측에 있어서 기여도가 높은 변수는 창업가의 기회인지 및 시장 확장의 교차항 변수이며, 이는 신시장에서 기회를 획득하고자 하는 유형의 창업기업이 높은 혁신성을 보인다는 점을 확인했다. 이 연구는 고도화된 분석방법인 기계학습을 이용해 새로운 예측모델을 제시, 기업가정신 연구의 시야를 확장했다는 점에서 의의를 지닌다.

Machine Learning Application to the Korean Freshwater Ecosystems

  • Jeong, Kwang-Seuk;Kim, Dong-Kyun;Chon, Tae-Soo;Joo, Gea-Jae
    • The Korean Journal of Ecology
    • /
    • 제28권6호
    • /
    • pp.405-415
    • /
    • 2005
  • This paper considers the advantage of Machine Learning (ML) implemented to freshwater ecosystem research. Currently, many studies have been carried out to find the patterns of environmental impact on dynamics of communities in aquatic ecosystems. Ecological models popularly adapted by many researchers have been a means of information processing in dealing with dynamics in various ecosystems. The up-to-date trend in ecological modelling partially turns to the application of ML to explain specific ecological events in complex ecosystems and to overcome the necessity of complicated data manipulation. This paper briefly introduces ML techniques applied to freshwater ecosystems in Korea. The manuscript provides promising information for the ecologists who utilize ML for elucidating complex ecological patterns and undertaking modelling of spatial and temporal dynamics of communities.

COVID-19 Prediction model using Machine Learning

  • Jadi, Amr
    • International Journal of Computer Science & Network Security
    • /
    • 제21권8호
    • /
    • pp.247-253
    • /
    • 2021
  • The outbreak of the deadly virus COVID-19 is said to infect 17.3Cr people around the globe since 2019. This outbreak is continuously affecting a lot of new people till this day and, most of it is said to under control. However, vaccines introduced around the world can help mitigate the risk of the virus. Apart from medical professionals, prediction models are also said to combinedly help predict the risk of infection based on given datasets. This paper is based on publication of a machine learning approach using regression models to predict the output based on dataset which have indictors grouped based on active, tested, recovered and critical cases along with regions and cities covering most of it from Dubai. Hence, the active cases are tested based on the other indicators and other attributes. The coefficient of the determination (r2) is 0.96, which is considered promising. This model can be used as an frame work, among others, to predict the resources related to the dangerous outbreak.

Improving streamflow and flood predictions through computational simulations, machine learning and uncertainty quantification

  • Venkatesh Merwade;Siddharth Saksena;Pin-ChingLi;TaoHuang
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2023년도 학술발표회
    • /
    • pp.29-29
    • /
    • 2023
  • To mitigate the damaging impacts of floods, accurate prediction of runoff, streamflow and flood inundation is needed. Conventional approach of simulating hydrology and hydraulics using loosely coupled models cannot capture the complex dynamics of surface and sub-surface processes. Additionally, the scarcity of data in ungauged basins and quality of data in gauged basins add uncertainty to model predictions, which need to be quantified. In this presentation, first the role of integrated modeling on creating accurate flood simulations and inundation maps will be presented with specific focus on urban environments. Next, the use of machine learning in producing streamflow predictions will be presented with specific focus on incorporating covariate shift and the application of theory guided machine learning. Finally, a framework to quantify the uncertainty in flood models using Hierarchical Bayesian Modeling Averaging will be presented. Overall, this presentation will highlight that creating accurate information on flood magnitude and extent requires innovation and advancement in different aspects related to hydrologic predictions.

  • PDF

머신러닝을 활용한 통계 분석 기반의 수면 호흡 장애 중증도 예측 (Severity Prediction of Sleep Respiratory Disease Based on Statistical Analysis Using Machine Learning)

  • 김준수;최병재
    • 대한임베디드공학회논문지
    • /
    • 제18권2호
    • /
    • pp.59-65
    • /
    • 2023
  • Currently, polysomnography is essential to diagnose sleep-related breathing disorders. However, there are several disadvantages to polysomnography, such as the requirement for multiple sensors and a long reading time. In this paper, we propose a system for predicting the severity of sleep-related breathing disorders at home utilizing measurable elements in a wearable device. To predict severity, the variables were refined through a three-step variable selection process, and the refined variables were used as inputs into three machine-learning models. As a result of the study, random forest models showed excellent prediction performance throughout. The best performance of the model in terms of F1 scores for the three threshold criteria of 5, 15, and 30 classified as the AHI index was about 87.3%, 90.7%, and 90.8%, respectively, and the maximum performance of the model for the three threshold criteria classified as the RDI index was approx 79.8%, 90.2%, and 90.1%, respectively.

머신러닝을 이용한 유기견 안락사 예측 (Prediction of the Shelter Dog Outcome using Machine Learning Models)

  • 이예슬;이세훈;존킨
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2020년도 제62차 하계학술대회논문집 28권2호
    • /
    • pp.301-302
    • /
    • 2020
  • The number of abandoned dogs were increasing every year in South Korea. However, many dogs are euthanized in the shelter because of the lack of budget. This project predicts euthanasia of abandoned dogs using machine learning algorithm. It collects data from the public data portal where Korea government provides a public dataset as a form of open API. This project uses recent three-year data 2017 to 2019 and 263371 cases were founded. This project implements random forest and logistic regression models. This project attained an average 72% of prediction accuracy.

  • PDF

Iowa Liquor Sales Data Predictive Analysis Using Spark

  • Ankita Paul;Shuvadeep Kundu;Jongwook Woo
    • Asia pacific journal of information systems
    • /
    • 제31권2호
    • /
    • pp.185-196
    • /
    • 2021
  • The paper aims to analyze and predict sales of liquor in the state of Iowa by applying machine learning algorithms to models built for prediction. We have taken recourse of Azure ML and Spark ML for our predictive analysis, which is legacy machine learning (ML) systems and Big Data ML, respectively. We have worked on the Iowa liquor sales dataset comprising of records from 2012 to 2019 in 24 columns and approximately 1.8 million rows. We have concluded by comparing the models with different algorithms applied and their accuracy in predicting the sales using both Azure ML and Spark ML. We find that the Linear Regression model has the highest precision and Decision Forest Regression has the fastest computing time with the sample data set using the legacy Azure ML systems. Decision Tree Regression model in Spark ML has the highest accuracy with the quickest computing time for the entire data set using the Big Data Spark systems.

A Pragmatic Framework for Predicting Change Prone Files Using Machine Learning Techniques with Java-based Software

  • Loveleen Kaur;Ashutosh Mishra
    • Asia pacific journal of information systems
    • /
    • 제30권3호
    • /
    • pp.457-496
    • /
    • 2020
  • This study aims to extensively analyze the performance of various Machine Learning (ML) techniques for predicting version to version change-proneness of source code Java files. 17 object-oriented metrics have been utilized in this work for predicting change-prone files using 31 ML techniques and the framework proposed has been implemented on various consecutive releases of two Java-based software projects available as plug-ins. 10-fold and inter-release validation methods have been employed to validate the models and statistical tests provide supplementary information regarding the reliability and significance of the results. The results of experiments conducted in this article indicate that the ML techniques perform differently under the different validation settings. The results also confirm the proficiency of the selected ML techniques in lieu of developing change-proneness prediction models which could aid the software engineers in the initial stages of software development for classifying change-prone Java files of a software, in turn aiding in the trend estimation of change-proneness over future versions.

기계학습을 적용한 자기보고 증상 기반의 어혈 변증 모델 구축 (Machine Learning Approach to Blood Stasis Pattern Identification Based on Self-reported Symptoms)

  • 김현호;양승범;강연석;박영배;김재효
    • Korean Journal of Acupuncture
    • /
    • 제33권3호
    • /
    • pp.102-113
    • /
    • 2016
  • Objectives : This study is aimed at developing and discussing the prediction model of blood stasis pattern of traditional Korean medicine(TKM) using machine learning algorithms: multiple logistic regression and decision tree model. Methods : First, we reviewed the blood stasis(BS) questionnaires of Korean, Chinese, and Japanese version to make a integrated BS questionnaire of patient-reported outcomes. Through a human subject research, patients-reported BS symptoms data were acquired. Next, experts decisions of 5 Korean medicine doctor were also acquired, and supervised learning models were developed using multiple logistic regression and decision tree. Results : Integrated BS questionnaire with 24 items was developed. Multiple logistic regression models with accuracy of 0.92(male) and 0.95(female) validated by 10-folds cross-validation were constructed. By decision tree modeling methods, male model with 8 decision node and female model with 6 decision node were made. In the both models, symptoms of 'recent physical trauma', 'chest pain', 'numbness', and 'menstrual disorder(female only)' were considered as important factors. Conclusions : Because machine learning, especially supervised learning, can reveal and suggest important or essential factors among the very various symptoms making up a pattern identification, it can be a very useful tool in researching diagnostics of TKM. With a proper patient-reported outcomes or well-structured database, it can also be applied to a pre-screening solutions of healthcare system in Mibyoung stage.

Research on the application of Machine Learning to threat assessment of combat systems

  • Seung-Joon Lee
    • 한국컴퓨터정보학회논문지
    • /
    • 제28권7호
    • /
    • pp.47-55
    • /
    • 2023
  • 본 논문에서는 전투체계 위협지수를 머신러닝 모델 중 Gradient Boosting Regreesor, Suppor Vector Regressor를 통해 예측하는 방법을 제시한다. 현재 전투체계는 안전성과 신뢰성이 중시되는 소프트웨어이므로 신뢰성이 보장되지 않은 AI 기술의 적용을 정책상 제한하고 있으며, 이로 인하여 전력화된 국내 전투체계는 AI 기술을 탑재하고 있지 않다. 하지만 AI의 전력화를 목표로 하는 국방부의 정책 방향에 대응하기 위하여, 전투체계의 머신러닝 적용에 필요한 기반 기술을 확보하기 위한 연구를 실시하였다. 이 연구는 위협지수 평가에 필요한 데이터를 수집한 뒤 데이터 가공 및 정제, 머신러닝 모델 선정 및 최적의 하이퍼 파리미터를 선정하여 학습된 모델의 예측 정확도를 판단하였다. 그 결과 테스트 데이터에 대한 모델 점수가 99점 이상으로 도출되었으며 전투체계에 머신러닝 모델의 적용 가능성을 확인하였다.