• Title/Summary/Keyword: 데이터 기반 의사결정

Search Result 783, Processing Time 0.032 seconds

Analysis of Customer Behavior and Trend of Manufacture (제조업분야의 고객 성향 및 추이 분석)

  • Lee, Byoung-Yup;Yim, Seung-Bin;Park, Yong-Hoon;Yoo, Jae-Soo
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.6
    • /
    • pp.336-343
    • /
    • 2009
  • Companies often use database for performing task more efficiently and data mining for marketing and production efficiency through analyzing of the stored database. The use of the knowledge through the data mining maintains and provides a direction of development for the company. It could be as an additional competitive power for the company when decision making is necessary. This study is designing a model that predicts a rating of existing customer and consumption pattern with using actual data of the manufacturer and data mining methodology. The objective of this model is to improve profits for the company and brand value through connecting the marketing with identifying the customer's rating and consumer behavior.

A Study on Educational Data Mining for Public Data Portal through Topic Modeling Method with Latent Dirichlet Allocation (LDA기반 토픽모델링을 활용한 공공데이터 기반의 교육용 데이터마이닝 연구)

  • Seungki Shin
    • Journal of The Korean Association of Information Education
    • /
    • v.26 no.5
    • /
    • pp.439-448
    • /
    • 2022
  • This study aims to search for education-related datasets provided by public data portals and examine what data types are constructed through classification using topic modeling methods. Regarding the data of the public data portal, 3,072 cases of file data in the education field were collected based on the classification system. Text mining analysis was performed using the LDA-based topic modeling method with stopword processing and data pre-processing for each dataset. Program information and student-supporting notifications were usually provided in the pre-classified dataset for education from the data portal. On the other hand, the characteristics of educational programs and supporting information for the disabled, parents, the elderly, and children through the perspective of lifelong education were generally indicated in the dataset collected by searching for education. The results of data analysis through this study show that providing sufficient educational information through the public data portal would be better to help the students' data science-based decision-making and problem-solving skills.

Semi-Supervised Learning to Predict Default Risk for P2P Lending (준지도학습 기반의 P2P 대출 부도 위험 예측에 대한 연구)

  • Kim, Hyun-jung
    • Journal of Digital Convergence
    • /
    • v.20 no.4
    • /
    • pp.185-192
    • /
    • 2022
  • This study investigates the effect of the semi-supervised learning(SSL) method on predicting default risk of peer-to-peer(P2P) loans. Despite its proven performance, the supervised learning(SL) method requires labeled data, which may require a lot of effort and resources to collect. With the rapid growth of P2P platforms, the number of loans issued annually that have no clear final resolution is continuously increasing leading to abundance in unlabeled data. The research data of P2P loans used in this study were collected on the LendingClub platform. This is why an SSL model is needed to predict the default risk by using not only information from labeled loans(fully paid or defaulted) but also information from unlabeled loans. The results showed that in terms of default risk prediction and despite the use of a small number of labeled data, the SSL method achieved a much better default risk prediction performance than the SL method trained using a much larger set of labeled data.

Food Exchange Table Organization Model Based on Decision Tree Using Machine Learning (머신러닝을 이용한 의사결정트리 기반의 식품교환표 구성 모델)

  • Kim, JiYun;Lee, Sangmin;Jeon, Hyeongjun;Kim, Gaeun;Kim, Ji-Hyun;Park, Naeun;Jin, ChangGyun;Kwon, Jin young;Kim Jongwan
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2020.11a
    • /
    • pp.680-684
    • /
    • 2020
  • 최근 국내에서는 식품에 대한 관심도가 높아짐에 따라 먹거리에 건강·환경·미래지향적 가치가 부여되고 있으며 식품 산업에서도 신규 식품 개발이 증가하는 추세이다. 식단을 구성할 때 기준이 되는 식품교환표는 개정과정에서 많은 인력과 시간이 소요되기 때문에 식품 섭취 변화를 신속하게 반영하기 어렵다. 본 논문에서는 식품교환표의 활용도를 높이기 위한 식품교환표 갱신 기법을 제안한다. 제안 기법은 의사결정트리 모델을 학습하여 새롭게 추가된 식품의 정보를 바탕으로 식품군을 분류하여 식품교환표를 갱신한다. 이는 영양 관리가 필요한 당뇨병 환자 등에게 실용적이며 기호성·다양성이 높은 식단을 구성하는 데 도움을 준다.

Data-driven Analysis for Future Land-use Change Prediction : Case Study on Seoul (서울 데이터 기반 필지별 용도전환 발생 예측)

  • Yun, Sung Bum;Mun, Sungchul;Park, Soon Yong;Kim, Taehyun
    • Journal of Broadcast Engineering
    • /
    • v.25 no.2
    • /
    • pp.176-184
    • /
    • 2020
  • Due to constant development and decline on Seoul areas the Seoul government is pushing various policies to regenerate declined Seoul areas. Theses various policies lead to land-use changes around numerous Seoul districts. This study aims to create prediction model which can foresee future land-use changes and while doing so, tried to derive various influential factors which leads to land-use changes. To do so, various open-data from national departments and Seoul government have been collected and implemented into random forest algorithm. The results showed promising accuracy and derived multiple influential factors which causes land-use changes around Seoul districts. The result of this study could further be implemented in policy makings for the public sectors, or could also be used as basis for studying gentrification problems happening in Seoul Area.

A Study on the Timing of Starting Pitcher Replacement Using Machine Learning (머신러닝을 활용한 선발 투수 교체시기에 관한 연구)

  • Noh, Seongjin;Noh, Mijin;Han, Mumoungcho;Um, Sunhyun;Kim, Yangsok
    • Smart Media Journal
    • /
    • v.11 no.2
    • /
    • pp.9-17
    • /
    • 2022
  • The purpose of this study is to implement a predictive model to support decision-making to replace a starting pitcher before a crisis situation in a baseball game. To this end, using the Major League Statcast data provided by Baseball Savant, we implement a predictive model that preemptively replaces starting pitchers before a crisis situation. To this end, first, the crisis situation that the starting pitcher faces in the game was derived through data exploration. Second, if the starting pitcher was replaced before the end of the inning, learning was carried out by composing a label with a replacement in the previous inning. As a result of comparing the trained models, the model based on the ensemble method showed the highest predictive performance with an F1-Score of 65%. The practical significance of this study is that the proposed model can contribute to increasing the team's winning probability by replacing the starting pitcher before a crisis situation, and the coach will be able to receive data-based strategic decision-making support during the game.

The Development of the Data Mining Agent for eCRM (eCRM을 위한 데이터마이닝 에지전트의 개발)

  • Son, Dal-Ho;Hong, Duck-Hoon
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.11 no.5
    • /
    • pp.236-244
    • /
    • 2006
  • Many attempts have been made to track the web usage patterns and provide suggestions that might help web operators get the information they need. These tracking mechanisms rely on mining web log files for usage patterns. The purpose of this study is to verify a web agent prototype that was built for mining web log files. The web agent for this paper was made by Java and ASP and the agent came into being as part of a cookie for a short-term data storage. For long-term data storage, the agent used a My-SQL as a Data Base. This agent system could inform that if the data comes from the web data mining agent, it could be a rapid information providing method rather than the case of data coming into a data mining tool. Therefore, the developed tool in this study will be helpful as a new kind of decision making system and expert system.

  • PDF

A Study on Development of the Spatial Data Visualization System Based on Statistical Data (통계 데이터 기반 공간 데이터 시각화 시스템 기술 개발에 관한 연구)

  • Jang, Kyung-Soon;Noh, Ho-Jin;Back, Yong;Lee, Chang-Sik;Kim, Byung-Gyu
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2012.11a
    • /
    • pp.1190-1193
    • /
    • 2012
  • 본 논문에서는 공간 데이터를 기반으로 한 도시 정보를 알고리즘을 통해서 시각화하고 시각화된 데이터를 지도와 사상(Mapping)하여 볼 수 있게 할 뿐만 아니라 공간적 정보를 기반으로 의사 결정을 하는 경우 활용할 수 있는 문서를 사용자의 간단한 조작으로 프로그래밍 방식에 의해 작성해주는 시스템을 제안한다. 본 시스템을 통해서 공간적 정보를 시각화를 한다면 지역의 공간적 정보를 쉽게 파악할 수 있을 것이다. 또한 분석된 정보를 기반으로 제공되고 있는 문서 자동화를 활용한다면 공간적 정보의 문서화에 필요한 많은 시간과 비용을 절감할 수 있을 것으로 기대된다.

A Study on the Node Split in Decision Tree with Multivariate Target Variables (다변량 목표변수를 갖는 의사결정나무의 노드분리에 관한 연구)

  • Kim, Seong-Jun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.4
    • /
    • pp.386-390
    • /
    • 2003
  • Data mining is a process of discovering useful patterns for decision making from an amount of data. It has recently received much attention in a wide range of business and engineering fields. Classifying a group into subgroups is one of the most important subjects in data mining. Tree-based methods, known as decision trees, provide an efficient way to finding the classification model. The primary concern in tree learning is to minimize a node impurity, which is evaluated using a target variable in the data set. However, there are situations where multiple target variable should be taken into account, for example, such as manufacturing process monitoring, marketing science, and clinical and health analysis. The purpose of this article is to present some methods for measuring the node impurity, which are applicable to data sets with multivariate target variables. For illustration, a numerical cxample is given with discussion.

A Study on Decision Support by Comparison of Environmental Performance before and after Project (사업 전후 환경성 비교를 통한 의사결정 지원 연구)

  • Kim, Gil-Ho;Yeo, Kyu-Dong;Kim, Hyun-Jung;Lee, Sang-Won
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2011.05a
    • /
    • pp.455-455
    • /
    • 2011
  • 개발로 인한 환경변화는 관련 분석모형을 통해 직접적으로 예측하기 하는 것이 가장 바람직하지만 데이터 취득의 어려움, 분석 방법론의 부재 등의 이유로 정량적 평가가 어려운 현실이다. 그렇기 때문에 수자원사업을 계획시 대부분 환경적인 영향을 매우 정성적인 형태로 평가하거나 수질과 같은 대표적인 항목에 대해서만 예측하는 수준이다. 기존의 연구 또한, 유역 또는 행정구역의 현재의 현 상황을 평가하기 위한 것이 주이며, 수자원사업과 관련성이 적은 항목도 일부 포함되어 있기 때문에 수자원사업의 특수성을 반영하기에 한계가 있다. 현 상황의 이러한 문제점을 인식하여 본 연구는 오늘날 대표적 의사결정 기법이라 할 수 있는 계층화분석과정(AHP)과 다속성효용이론(MAUT)을 활용하여 향후 수자원사업과 관련된 다기준의 사결정 과정에서의 환경성 평가방안을 제시하였다. 환경성 평가기준은 수질, 경관, 생태계 이렇게 세 가지 항목으로 구성하였고, 각 평가기준에 대한 수준을 직접적으로 대변 가능한 정량화 방안을 제시하였다. 그리고 앞서 정량화된 값을 표준화하기 위하여 MAUT 기법으로부터 평가기준별 효용함수를 도출하였다. 한편, 사업을 시행함에 따라 예상되는 환경성변화는 사업전 환경성과 사업 후 환경성을 비교하도록 하였고, 이때 해당사업의 특수성을 반영하고자 별도의 설문과정을 통해 평가기준별 가중치를 결정하였다. 본 연구는 환경성 검토시 생태학적, 물리적 분석에 기반을 둔 정량적 예측의 어려움을 보완하기 위해 정성적 예측을 추가적으로 제시하였고, 사업의 특수성과 평가항목이 갖는 일반성을 명확히 구분하여 의사결정 과정에서 주관적인 요소를 최소화하였다. 또한, 평가항목별 사업전후의 환경성을 비교, 검토함으로써 실제 사업추진 과정에서 개발로 인한 부정적 영향의 사전예방에 도움을 줄 수 있을 것으로 판단된다.

  • PDF