• Title/Summary/Keyword: Weighted Prediction

Search Result 243, Processing Time 0.03 seconds

Feature Selection and Classification of Protein CDS Using n-Block substring weighted Linear Model (N-Block substring 가중 선형모형을 이용한 단백질 CDS의 특징 추출 및 분류)

  • Choi, Seong-Yong;Kim, Jin-Su;Han, Seung-Jin;Choi, Jun-Hyeog;Rim, Kee-Wook;Lee, Jung-Hyun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.19 no.5
    • /
    • pp.730-736
    • /
    • 2009
  • It is more important to analysis of huge gemonics data in Bioinformatics. Here we present a novel datamining approach to predict structure and function using protein's primnary structure only. We propose not also to develope n-Block substring search algorithm in reducing enormous search space effectively in relation to feature selection, but to formulate weighted linear algorithm in a prediction of structure and function of a protein using primary structure. And we show efficient in protein domain characterization and classification by calculation weight value in determining domain association in each selected substring, and also reveal that more efficient results are acquired through claculated model score result in an inference about degree of association with each CDS(coding sequence) in domain.

Machine Learning Model for Predicting the Residual Useful Lifetime of the CNC Milling Insert (공작기계의 절삭용 인서트의 잔여 유효 수명 예측 모형)

  • Won-Gun Choi;Heungseob Kim;Bong Jin Ko
    • Journal of Advanced Navigation Technology
    • /
    • v.27 no.1
    • /
    • pp.111-118
    • /
    • 2023
  • For the implementation of a smart factory, it is necessary to collect data by connecting various sensors and devices in the manufacturing environment and to diagnose or predict failures in production facilities through data analysis. In this paper, to predict the residual useful lifetime of milling insert used for machining products in CNC machine, weight k-NN algorithm, Decision Tree, SVR, XGBoost, Random forest, 1D-CNN, and frequency spectrum based on vibration signal are investigated. As the results of the paper, the frequency spectrum does not provide a reliable criterion for an accurate prediction of the residual useful lifetime of an insert. And the weighted k-nearest neighbor algorithm performed best with an MAE of 0.0013, MSE of 0.004, and RMSE of 0.0192. This is an error of 0.001 seconds of the remaining useful lifetime of the insert predicted by the weighted-nearest neighbor algorithm, and it is considered to be a level that can be applied to actual industrial sites.

Spatial Hedonic Modeling using Geographically Weighted LASSO Model (GWL을 적용한 공간 헤도닉 모델링)

  • Jin, Chanwoo;Lee, Gunhak
    • Journal of the Korean Geographical Society
    • /
    • v.49 no.6
    • /
    • pp.917-934
    • /
    • 2014
  • Geographically weighted regression(GWR) model has been widely used to estimate spatially heterogeneous real estate prices. The GWR model, however, has some limitations of the selection of different price determinants over space and the restricted number of observations for local estimation. Alternatively, the geographically weighted LASSO(GWL) model has been recently introduced and received a growing interest. In this paper, we attempt to explore various local price determinants for the real estate by utilizing the GWL and its applicability to forecasting the real estate price. To do this, we developed the three hedonic models of OLS, GWR, and GWL focusing on the sales price of apartments in Seoul and compared those models in terms of model fit, prediction, and multicollinearity. As a result, local models appeared to be better than the global OLS on the whole, and in particular, the GWL appeared to be more explanatory and predictable than other models. Moreover, the GWL enabled to provide spatially different sets of price determinants which no multicollinearity exists. The GWL helps select the significant sets of independent variables from a high dimensional dataset, and hence will be a useful technique for large and complex spatial big data.

  • PDF

Nonlinear Time Series Prediction Modeling by Weighted Average Defuzzification Based on NEWFM (NEWFM 기반 가중평균 역퍼지화에 의한 비선형 시계열 예측 모델링)

  • Chai, Soo-Han;Lim, Joon-Shik
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.4
    • /
    • pp.563-568
    • /
    • 2007
  • This paper presents a methodology for predicting nonlinear time series based on the neural network with weighted fuzzy membership functions (NEWFM). The degree of classification intensity is obtained by bounded sum of weighted fuzzy membership functions extracted by NEWFM, then weighted average defuzzification is used for predicting nonlinear time series. The experimental results demonstrate that NEWFM has the classification capability of 92.22% against the target class of GDP. The time series created by NEWFM model has a relatively close approximation to the GDP which is a typical business cycle indicator, and has been proved to be a useful indicator which has the turning point forecasting capability of average 12 months in the peak point and average 6 months in the trough point during 5th to 8th cyclical period. In addition, NEWFM measures the efficiency of the economic indexes by the feature selection and enables the users to forecast with reduced numbers of 7 among 10 leading indexes while improving the classification rate from 90% to 92.22%.

Ensemble Method for Predicting Particulate Matter and Odor Intensity (미세먼지, 악취 농도 예측을 위한 앙상블 방법)

  • Lee, Jong-Yeong;Choi, Myoung Jin;Joo, Yeongin;Yang, Jaekyung
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.42 no.4
    • /
    • pp.203-210
    • /
    • 2019
  • Recently, a number of researchers have produced research and reports in order to forecast more exactly air quality such as particulate matter and odor. However, such research mainly focuses on the atmospheric diffusion models that have been used for the air quality prediction in environmental engineering area. Even though it has various merits, it has some limitation in that it uses very limited spatial attributes such as geographical attributes. Thus, we propose the new approach to forecast an air quality using a deep learning based ensemble model combining temporal and spatial predictor. The temporal predictor employs the RNN LSTM and the spatial predictor is based on the geographically weighted regression model. The ensemble model also uses the RNN LSTM that combines two models with stacking structure. The ensemble model is capable of inferring the air quality of the areas without air quality monitoring station, and even forecasting future air quality. We installed the IoT sensors measuring PM2.5, PM10, H2S, NH3, VOC at the 8 stations in Jeonju in order to gather air quality data. The numerical results showed that our new model has very exact prediction capability with comparison to the real measured data. It implies that the spatial attributes should be considered to more exact air quality prediction.

On the Geometric Anisotropy Inherent In Spatial Data (공간자료의 기하학적 비등방성 연구)

  • Go, Hye Ji;Park, Man Sik
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.5
    • /
    • pp.755-771
    • /
    • 2014
  • Isotropy is one of the main assumptions for the ease of spatial prediction (named kriging) based on some covariance models. A lack of isotropy (or anisotropy) in a spatial process necessitates that some additional parameters (angle and ratio) for anisotropic covariance model be obtained in order to produce a more reliable prediction. In this paper, we propose a new class of geometrically extended anisotropic covariance models expressed as a weighted average of some geometrically anisotropic models. The maximum likelihood estimation method is taken into account to estimate the parameters of our interest. We evaluate the performances of our proposal and compare it with an isotropic covariance model and a geometrically anisotropic model in simulation studies. We also employ extended geometric anisotropy to the analysis of real data.

A Consensus Technique for Tropical Cyclone Intensity Prediction over the Western North Pacific (북서태평양 태풍 강도 예측 컨센서스 기법)

  • Oh, Youjung;Moon, Il-Ju;Lee, Woojeong
    • Atmosphere
    • /
    • v.28 no.3
    • /
    • pp.291-303
    • /
    • 2018
  • In this study, a new consensus technique for predicting tropical cyclone (TC) intensity in the western North Pacific was developed. The most important feature of the present consensus model is to select and combine the guidance numerical models with the best performance in the previous years based on various evaluation criteria and averaging methods. Specifically, the performance of the guidance models was evaluated using both the mean absolute error and the correlation coefficient for each forecast lead time, and the number of the numerical models used for the consensus model was not fixed. In averaging multiple models, both simple and weighted methods are used. These approaches are important because that the performance of the available guidance models differs according to forecast lead time and is changing every year. In particular, this study develops both a multi-consensus model (M-CON), which constructs the best consensus models with the lowest error for each forecast lead time, and a single best consensus model (S-CON) having the lowest 72-hour cumulative mean error, through on training process. The evaluation results of the selected consensus models for the training and forecast periods reveal that the M-CON and S-CON outperform the individual best-performance guidance models. In particular, the M-CON showed the best overall performance, having advantages in the early stages of prediction. This study finally suggests that forecaster needs to use the latest evaluation results of the guidance models every year rather than rely on the well-known accuracy of models for a long time to reduce prediction error.

GA-based Normalization Approach in Back-propagation Neural Network for Bankruptcy Prediction Modeling (유전자알고리즘을 기반으로 하는 정규화 기법에 관한 연구 : 역전파 알고리즘을 이용한 부도예측 모형을 중심으로)

  • Tai, Qiu-Yue;Shin, Kyung-Shik
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.3
    • /
    • pp.1-14
    • /
    • 2010
  • The back-propagation neural network (BPN) has long been successfully applied in bankruptcy prediction problems. Despite its wide application, some major issues must be considered before its use, such as the network topology, learning parameters and normalization methods for the input and output vectors. Previous studies on bankruptcy prediction with BPN have shown that many researchers are interested in how to optimize the network topology and learning parameters to improve the prediction performance. In many cases, however, the benefits of data normalization are often overlooked. In this study, a genetic algorithm (GA)-based normalization transform, which is defined as a linearly weighted combination of several different normalization transforms, will be proposed. GA is used to extract the optimal weight for the generalization. From the results of an experiment, the proposed method was evaluated and compared with other methods to demonstrate the advantage of the proposed method.

A Comparative study on smoothing techniques for performance improvement of LSTM learning model

  • Tae-Jin, Park;Gab-Sig, Sim
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.1
    • /
    • pp.17-26
    • /
    • 2023
  • In this paper, we propose a several smoothing techniques are compared and applied to increase the application of the LSTM-based learning model and its effectiveness. The applied smoothing technique is Savitky-Golay, exponential smoothing, and weighted moving average. Through this study, the LSTM algorithm with the Savitky-Golay filter applied in the preprocessing process showed significant best results in prediction performance than the result value shown when applying the LSTM model to Bitcoin data. To confirm the predictive performance results, the learning loss rate and verification loss rate according to the Savitzky-Golay LSTM model were compared with the case of LSTM used to remove complex factors from Bitcoin price prediction, and experimented with an average value of 20 times to increase its reliability. As a result, values of (3.0556, 0.00005) and (1.4659, 0.00002) could be obtained. As a result, since crypto-currencies such as Bitcoin have more volatility than stocks, noise was removed by applying the Savitzky-Golay in the data preprocessing process, and the data after preprocessing were obtained the most-significant to increase the Bitcoin prediction rate through LSTM neural network learning.

A study on the adaptive method of control model for tandem cold rolling mill (연속냉간압연기 제어모델의 적응수정방법에 관한 연구)

  • Lee, Won-Ho;Lee, Sang-Ryong
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.21 no.7
    • /
    • pp.1030-1041
    • /
    • 1997
  • The control model in the tandem cold rolling mill consists of many mathematical theories and is used to calculate the reference values such as the roll gap and the rolling speed for good operation of rolling mill. But, the control model used presently has a problem causing inaccurate prediction of the rolling force. By the parameter identification, it was found that the main factor causing inaccurate prediction of the rolling force was incorrect modeling of the friction coefficient and the flow stress. To get rid of the erroneous factor new adaptive schemes are suggested in this work. Those are a long-time adaptation by the iterative least-square method and a short-time adaptation by the recursive weighted least-square method respectively. The new equations for the friction coefficient and the flow stress are derived by applying the suggested adaptive algorithms. Through the on-line test in an actual mill, it is proved that the rolling force predicted by the new equations is more accurate than the one by the existing equations ever used.