• Title/Summary/Keyword: 서포트벡터회귀

Search Result 102, Processing Time 0.031 seconds

Predicting Snow Damage and Suggesting Improvement Plans Using Deep Learning (딥러닝을 이용한 대설피해액 예측 및 개선방안 제안)

  • Lee, HyeongJoo;Chung, Gunhui
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.485-485
    • /
    • 2021
  • 최근 세계적인 기상이변으로 자연재해의 발생빈도 증가는 물론 이로 인한 피해가 점차 다양화 및 대형화되어 가고 있는 추세이다. 재난으로 인한 피해는 발생지역 피해뿐만 아니라 국가 경제 전반에 큰 영향을 미치는 특징이 있다. 우리나라의 자연재해 중 대설은 다른 자연재해에 비해 발생빈도는 낮지만 광역적인 피해를 유발하며, 피해 면적에 비해 피해액 규모가 크다. 또한 현재에는 강원권이 가장 취약한 것으로 취약성 분석 결과에서 보여주지만, 미래에는 강원권, 충청권, 호남권을 연결하는 축으로 취약지역이 확대될 것으로 전망된다. 본 연구에서는 현재 사회 전반에서 다양하게 활용되고 있는 머신러닝 기법을 이용하여 우리나라 대설피해액을 예측하는 대설피해 예측모형을 개발하고자 하였다. 머신러닝 기법으로는 랜덤포레스트, 서포트 벡터 머신, 인공신경망 기법을 이용하였고, 모형에 사용한 변수는 기상관측자료, 사회·경제적 요소 등을 활용하여 모형을 개발하였다. 결과적으로 기존연구에서 다중회귀모형을 이용하여 개발된 예측모형과 본 연구에서 3개의 머신러닝 기법으로 개발된 예측모형의 예측력을 비교 분석하였고, 예측력이 가장 높은 모형을 제시하였다. 본 연구결과를 활용하여 모형의 개선 및 데이터 품질 개선이 이루어진다면 향후 대설피해에 대한 개략적인 대비가 가능할 것으로 기대된다.

  • PDF

Estimation of Software Reliability with Immune Algorithm and Support Vector Regression (면역 알고리즘 기반의 서포트 벡터 회귀를 이용한 소프트웨어 신뢰도 추정)

  • Kwon, Ki-Tae;Lee, Joon-Kil
    • Journal of Information Technology Services
    • /
    • v.8 no.4
    • /
    • pp.129-140
    • /
    • 2009
  • The accurate estimation of software reliability is important to a successful development in software engineering. Until recent days, the models using regression analysis based on statistical algorithm and machine learning method have been used. However, this paper estimates the software reliability using support vector regression, a sort of machine learning technique. Also, it finds the best set of optimized parameters applying immune algorithm, changing the number of generations, memory cells, and allele. The proposed IA-SVR model outperforms some recent results reported in the literature.

Design of controller using Support Vector Regression (서포트 벡터 회귀를 이용한 제어기 설계)

  • Hwang, Ji-Hwan;Kwak, Hwan-Joo;Park, Gwi-Tae
    • Proceedings of the IEEK Conference
    • /
    • 2009.05a
    • /
    • pp.320-322
    • /
    • 2009
  • Support vector learning attracts great interests in the areas of pattern classification, function approximation, and abnormality detection. In this pater, we design the controller using support vector regression which has good properties in comparison with multi-layer perceptron or radial basis function. The applicability of the presented method is illustrated via an example simulation.

  • PDF

Comparison of Methodologies for Characterizing Pedestrian-Vehicle Collisions (보행자-차량 충돌사고 특성분석 방법론 비교 연구)

  • Choi, Saerona;Jeong, Eunbi;Oh, Cheol
    • Journal of Korean Society of Transportation
    • /
    • v.31 no.6
    • /
    • pp.53-66
    • /
    • 2013
  • The major purpose of this study is to evaluate methodologies to predict the injury severity of pedestrian-vehicle collisions. Methodologies to be evaluated and compared in this study include Binary Logistic Regression(BLR), Ordered Probit Model(OPM), Support Vector Machine(SVM) and Decision Tree(DT) method. Valuable insights into applying methodologies to analyze the characteristics of pedestrian injury severity are derived. For the purpose of identifying causal factors affecting the injury severity, statistical approaches such as BLR and OPM are recommended. On the other hand, to achieve better prediction performance, heuristic approaches such as SVM and DT are recommended. It is expected that the outcome of this study would be useful in developing various countermeasures for enhancing pedestrian safety.

A Differential Pricing Model for Industrial Land based on Locational Characteristics (입지특성을 고려한 토지가격의 차등적 산정방안 - 산업시설용지 공급가격을 중심으로 -)

  • Shim, Jae Heon
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.31 no.2D
    • /
    • pp.303-314
    • /
    • 2011
  • This paper proposes a differential pricing model for industrial land based on locational characteristics, using Support Vector Regression (SVR) as a land pricing methodology. The initial selling price of industrial land is set based on the total cost of site development that comprises the land acquisition cost and tax, land development expense, infrastructure installation cost, labor cost, migration expense, selling and administrative expense, capital cost, and so on. However, the current industrial land pricing method unreasonably applies the same price per square meter to all parcels within an industrial complex without considering differences in price depending on the location of each parcel. Therefore, this paper proposes an empirical land pricing model to solve this irrationality and verifies its validity and applicability.

A hidden Markov model for predicting global stock market index (은닉 마르코프 모델을 이용한 국가별 주가지수 예측)

  • Kang, Hajin;Hwang, Beom Seuk
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.3
    • /
    • pp.461-475
    • /
    • 2021
  • Hidden Markov model (HMM) is a statistical model in which the system consists of two elements, hidden states and observable results. HMM has been actively used in various fields, especially for time series data in the financial sector, since it has a variety of mathematical structures. Based on the HMM theory, this research is intended to apply the domestic KOSPI200 stock index as well as the prediction of global stock indexes such as NIKKEI225, HSI, S&P500 and FTSE100. In addition, we would like to compare and examine the differences in results between the HMM and support vector regression (SVR), which is frequently used to predict the stock price, due to recent developments in the artificial intelligence sector.

Research Trend analysis for Seismic Data Interpolation Methods using Machine Learning (머신러닝을 사용한 탄성파 자료 보간법 기술 연구 동향 분석)

  • Bae, Wooram;Kwon, Yeji;Ha, Wansoo
    • Geophysics and Geophysical Exploration
    • /
    • v.23 no.3
    • /
    • pp.192-207
    • /
    • 2020
  • We acquire seismic data with regularly or irregularly missing traces, due to economic, environmental, and mechanical problems. Since these missing data adversely affect the results of seismic data processing and analysis, we need to reconstruct the missing data before subsequent processing. However, there are economic and temporal burdens to conducting further exploration and reconstructing missing parts. Many researchers have been studying interpolation methods to accurately reconstruct missing data. Recently, various machine learning technologies such as support vector regression, autoencoder, U-Net, ResNet, and generative adversarial network (GAN) have been applied in seismic data interpolation. In this study, by reviewing these studies, we found that not only neural network models, but also support vector regression models that have relatively simple structures can interpolate missing parts of seismic data effectively. We expect that future research can improve the interpolation performance of these machine learning models by using open-source field data, data augmentation, transfer learning, and regularization based on conventional interpolation technologies.

A study on the development of severity-adjusted mortality prediction model for discharged patient with acute stroke using machine learning (머신러닝을 이용한 급성 뇌졸중 퇴원 환자의 중증도 보정 사망 예측 모형 개발에 관한 연구)

  • Baek, Seol-Kyung;Park, Jong-Ho;Kang, Sung-Hong;Park, Hye-Jin
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.11
    • /
    • pp.126-136
    • /
    • 2018
  • The purpose of this study was to develop a severity-adjustment model for predicting mortality in acute stroke patients using machine learning. Using the Korean National Hospital Discharge In-depth Injury Survey from 2006 to 2015, the study population with disease code I60-I63 (KCD 7) were extracted for further analysis. Three tools were used for the severity-adjustment of comorbidity: the Charlson Comorbidity Index (CCI), the Elixhauser comorbidity index (ECI), and the Clinical Classification Software (CCS). The severity-adjustment models for mortality prediction in patients with acute stroke were developed using logistic regression, decision tree, neural network, and support vector machine methods. The most common comorbid disease in stroke patients were hypertension, uncomplicated (43.8%) in the ECI, and essential hypertension (43.9%) in the CCS. Among the CCI, ECI, and CCS, CCS had the highest AUC value. CCS was confirmed as the best severity correction tool. In addition, the AUC values for variables of CCS including main diagnosis, gender, age, hospitalization route, and existence of surgery were 0.808 for the logistic regression analysis, 0.785 for the decision tree, 0.809 for the neural network and 0.830 for the support vector machine. Therefore, the best predictive power was achieved by the support vector machine technique. The results of this study can be used in the establishment of health policy in the future.

A Study on the Number of Domestic Food Delivery Services (국내 배달음식 이용건수 분석 및 예측)

  • Kwon, Jaeyoung;Kim, Sinae;Park, Eungee;Song, Jongwoo
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.5
    • /
    • pp.977-990
    • /
    • 2015
  • Food delivery services are well developed in the Republic of Korea, The increase of one person households and the success of app applications influence delivery services these days. We consider a prediction model for the food delivery service based on weather and dates to predict the number of food delivery services in 2014 using various data mining techniques. We use linear regression, random forest, gradient boosting, support vector machines, neural networks, and logistic regression to find the best prediction model. There are four categories of food delivery services and we consider two methods. For the first method, we estimate the total number of delivery services and the posterior probabilities of each delivery service. For the second method, we use different models for each category and combine them to estimate the total number of delivery services. The neural network and linear regression model perform best in the first method, this is followed by the neural network which is the best for the second method. The result shows that we can estimate the number of deliveries accurately based on dates and weather information.

A study on entertainment TV show ratings and the number of episodes prediction (국내 예능 시청률과 회차 예측 및 영향요인 분석)

  • Kim, Milim;Lim, Soyeon;Jang, Chohee;Song, Jongwoo
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.6
    • /
    • pp.809-825
    • /
    • 2017
  • The number of TV entertainment shows is increasing. Competition among programs in the entertainment market is intensifying since cable channels air many entertainment TV shows. There is now a need for research on program ratings and the number of episodes. This study presents predictive models for entertainment TV show ratings and number of episodes. We use various data mining techniques such as linear regression, logistic regression, LASSO, random forests, gradient boosting, and support vector machine. The analysis results show that the average program ratings before the first broadcast is affected by broadcasting company, average ratings of the previous season, starting year and number of articles. The average program ratings after the first broadcast is influenced by the rating of the first broadcast, broadcasting company and program type. We also found that the predicted average ratings, starting year, type and broadcasting company are important variables in predicting of the number of episodes.