• 제목/요약/키워드: Azure Machine Learning Studio

검색결과 5건 처리시간 0.018초

머신러닝 알고리즘 기반의 의료비 예측 모델 개발 (Development of Medical Cost Prediction Model Based on the Machine Learning Algorithm)

  • Han Bi KIM;Dong Hoon HAN
    • Journal of Korea Artificial Intelligence Association
    • /
    • 제1권1호
    • /
    • pp.11-16
    • /
    • 2023
  • Accurate hospital case modeling and prediction are crucial for efficient healthcare. In this study, we demonstrate the implementation of regression analysis methods in machine learning systems utilizing mathematical statics and machine learning techniques. The developed machine learning model includes Bayesian linear, artificial neural network, decision tree, decision forest, and linear regression analysis models. Through the application of these algorithms, corresponding regression models were constructed and analyzed. The results suggest the potential of leveraging machine learning systems for medical research. The experiment aimed to create an Azure Machine Learning Studio tool for the speedy evaluation of multiple regression models. The tool faciliates the comparision of 5 types of regression models in a unified experiment and presents assessment results with performance metrics. Evaluation of regression machine learning models highlighted the advantages of boosted decision tree regression, and decision forest regression in hospital case prediction. These findings could lay the groundwork for the deliberate development of new directions in medical data processing and decision making. Furthermore, potential avenues for future research may include exploring methods such as clustering, classification, and anomaly detection in healthcare systems.

Applications of Machine Learning Models on Yelp Data

  • Ruchi Singh;Jongwook Woo
    • Asia pacific journal of information systems
    • /
    • 제29권1호
    • /
    • pp.35-49
    • /
    • 2019
  • The paper attempts to document the application of relevant Machine Learning (ML) models on Yelp (a crowd-sourced local business review and social networking site) dataset to analyze, predict and recommend business. Strategically using two cloud platforms to minimize the effort and time required for this project. Seven machine learning algorithms in Azure ML of which four algorithms are implemented in Databricks Spark ML. The analyzed Yelp business dataset contained 70 business attributes for more than 350,000 registered business. Additionally, review tips and likes from 500,000 users have been processed for the project. A Recommendation Model is built to provide Yelp users with recommendations for business categories based on their previous business ratings, as well as the business ratings of other users. Classification Model is implemented to predict the popularity of the business as defining the popular business to have stars greater than 3 and unpopular business to have stars less than 3. Text Analysis model is developed by comparing two algorithms, uni-gram feature extraction and n-feature extraction in Azure ML studio and logistic regression model in Spark. Comparative conclusions have been made related to efficiency of Spark ML and Azure ML for these models.

Prediction of the number of public bicycle rental in Seoul using Boosted Decision Tree Regression Algorithm

  • KIM, Hyun-Jun;KIM, Hyun-Ki
    • 한국인공지능학회지
    • /
    • 제10권1호
    • /
    • pp.9-14
    • /
    • 2022
  • The demand for public bicycles operated by the Seoul Metropolitan Government is increasing every year. The size of the Seoul public bicycle project, which first started with about 5,600 units, increased to 3,7500 units as of September 2021, and the number of members is also increasing every year. However, as the size of the project grows, excessive budget spending and deficit problems are emerging for public bicycle projects, and new bicycles, rental office costs, and bicycle maintenance costs are blamed for the deficit. In this paper, the Azure Machine Learning Studio program and the Boosted Decision Tree Regression technique are used to predict the number of public bicycle rental over environmental factors and time. Predicted results it was confirmed that the demand for public bicycles was high in the season except for winter, and the demand for public bicycles was the highest at 6 p.m. In addition, in this paper compare four additional regression algorithms in addition to the Boosted Decision Tree Regression algorithm to measure algorithm performance. The results showed high accuracy in the order of the First Boosted Decision Tree Regression Algorithm (0.878802), second Decision Forest Regression (0.838232), third Poison Regression (0.62699), and fourth Linear Regression (0.618773). Based on these predictions, it is expected that more public bicycles will be placed at rental stations near public transportation to meet the growing demand for commuting hours and that more bicycles will be placed in rental stations in summer than winter and the life of bicycles can be extended in winter.

A study on Natural Disaster Prediction Using Multi-Class Decision Forest

  • Eom, Tae-Hyuk;Kim, Kyung-A
    • 한국인공지능학회지
    • /
    • 제10권1호
    • /
    • pp.1-7
    • /
    • 2022
  • In this paper, a study was conducted to predict natural disasters in Afghanistan based on machine learning. Natural disasters need to be prepared not only in Korea but also in other vulnerable countries. Every year in Afghanistan, natural disasters(snow, earthquake, drought, flood) cause property and casualties. We decided to conduct research on this phenomenon because we thought that the damage would be small if we were to prepare for it. The Azure Machine Learning Studio used in the study has the advantage of being more visible and easier to use than other Machine Learning tools. Decision Forest is a model for classifying into decision tree types. Decision forest enables intuitive analysis as a model that is easy to analyze results and presents key variables and separation criteria. Also, since it is a nonparametric model, it is free to assume (normality, independence, equal dispersion) required by the statistical model. Finally, linear/non-linear relationships can be searched considering interactions between variables. Therefore, the study used decision forest. The study found that overall accuracy was 89 percent and average accuracy was 97 percent. Although the results of the experiment showed a little high accuracy, items with low natural disaster frequency were less accurate due to lack of learning. By learning and complementing more data, overall accuracy can be improved, and damage can be reduced by predicting natural disasters.

A Study on Prediction of Business Status Based on Machine Learning

  • Kim, Ki-Pyeong;Song, Seo-Won
    • 한국인공지능학회지
    • /
    • 제6권2호
    • /
    • pp.23-27
    • /
    • 2018
  • Korea has a high proportion of self-employment. Many of them start the food business since it does not require high-techs and it is possible to start the business relatively easily compared to many others in business categories. However, the closure rate of the business is also high due to excessive competition and market saturation. Cafés and restaurants are examples of food business where the business analysis is highly important. However, for most of the people who want to start their own business, it is difficult to conduct systematic business analysis such as trade area analysis or to find information for business analysis. Therefore, in this paper, we predicted business status with simple information using Microsoft Azure Machine Learning Studio program. Experimental results showed higher performance than the number of attributes, and it is expected that this artificial intelligence model will be helpful to those who are self-employed because it can easily predict the business status. The results showed that the overall accuracy was over 60 % and the performance was high compared to the number of attributes. If this model is used, those who prepare for self-employment who are not experts in the business analysis will be able to predict the business status of stores in Seoul with simple attributes.