• Title/Summary/Keyword: Regression Curve

Search Result 810, Processing Time 0.026 seconds

Comparison of Regression Models for Estimating Ventilation Rate of Mechanically Ventilated Swine Farm (강제환기식 돈사의 환기량 추정을 위한 회귀모델의 비교)

  • Jo, Gwanggon;Ha, Taehwan;Yoon, Sanghoo;Jang, Yuna;Jung, Minwoong
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.62 no.1
    • /
    • pp.61-70
    • /
    • 2020
  • To estimate the ventilation volume of mechanically ventilated swine farms, various regression models were applied, and errors were compared to select the regression model that can best simulate actual data. Linear regression, linear spline, polynomial regression (degrees 2 and 3), logistic curve, generalized additive model (GAM), and gompertz curve were compared. Overfitting models were excluded even when the error rate was small. The evaluation criteria were root mean square error (RMSE) and mean absolute percentage error (MAPE). The evaluation results indicated that degree 3 exhibited the lowest error rate; however, an overestimation contradiction was observed in a certain section. The logistic curve was the most stable and superior to all the models. In the estimation of ventilation volume by all of the models, the estimated ventilation volume of the logistic curve was the smallest except for the model with a large error rate and the overestimated model.

Comparison of Jump-Preserving Smoothing and Smoothing Based on Jump Detector

  • Park, Dong-Ryeon
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.3
    • /
    • pp.519-528
    • /
    • 2009
  • This paper deals with nonparametric estimation of discontinuous regression curve. Quite number of researches about this topic have been done. These researches are classified into two categories, the indirect approach and direct approach. The major goal of the indirect approach is to obtain good estimates of jump locations, whereas the major goal of the direct approach is to obtain overall good estimate of the regression curve. Thus it seems that two approaches are quite different in nature, so people say that the comparison of two approaches does not make much sense. Therefore, a thorough comparison of them is lacking. However, even though the main issue of the indirect approach is the estimation of jump locations, it is too obvious that we have an estimate of regression curve as the subsidiary result. The point is whether the subsidiary result of the indirect approach is as good as the main result of the direct approach. The performance of two approaches is compared through a simulation study and it turns out that the indirect approach is a very competitive tool for estimating discontinuous regression curve itself.

Estimation of Asymmetric Bell Shaped Probability Curve using Logistic Regression (로지스틱 회귀모형을 이용한 비대칭 종형 확률곡선의 추정)

  • 박성현;김기호;이소형
    • The Korean Journal of Applied Statistics
    • /
    • v.14 no.1
    • /
    • pp.71-80
    • /
    • 2001
  • Logistic regression model is one of the most popular linear models for a binary response variable and used for the estimation of probability function. In many practical situations, the probability function can be expressed by a bell shaped curve and such a function can be estimated by a second order logistic regression model. However, when the probability curve is asymmetric, the estimation results using a second order logistic regression model may not be precise because a second order logistic regression model is a symmetric function. In addition, even if a second order logistic regression model is used, the interpretation for the effect of second order term may not be easy. In this paper, in order to alleviate such problems, an estimation method for asymmetric probabiity curve based on a first order logistic regression model and iterative bi-section method is proposed and its performance is compared with that of a second order logistic regression model by a simulation study.

  • PDF

FUZZY REGRESSION TOWARDS A GENERAL INSURANCE APPLICATION

  • Kim, Joseph H.T.;Kim, Joocheol
    • Journal of applied mathematics & informatics
    • /
    • v.32 no.3_4
    • /
    • pp.343-357
    • /
    • 2014
  • In many non-life insurance applications past data are given in a form known as the run-off triangle. Smoothing such data using parametric crisp regression models has long served as the basis of estimating future claim amounts and the reserves set aside to protect the insurer from future losses. In this article a fuzzy counterpart of the Hoerl curve, a well-known claim reserving regression model, is proposed to analyze the past claim data and to determine the reserves. The fuzzy Hoerl curve is more flexible and general than the one considered in the previous fuzzy literature in that it includes a categorical variable with multiple explanatory variables, which requires the development of the fuzzy analysis of covariance, or fuzzy ANCOVA. Using an actual insurance run-off claim data we show that the suggested fuzzy Hoerl curve based on the fuzzy ANCOVA gives reasonable claim reserves without stringent assumptions needed for the traditional regression approach in claim reserving.

Prediction of Newborn Birthweight by the Measurement of Fundal Height and Gestational Period (임신기간 및 자궁저높이를 이용한 신생아 체중 예측)

  • Cho, Moon-Suk;Park, Young-Sook
    • 모자간호학회지
    • /
    • v.1
    • /
    • pp.34-44
    • /
    • 1991
  • The purposes of this study were to predict newborn birthweight by use of gestational period and fundal height and to identify growth curve of fundal height according to gestational period and growth curve of newborn birthweight according to fundal height. The subjects for the study were 802 women who delivered the normal newborn babies at Seoul National University Hospital from Sep. 1, 1981 to Aug.31, 1986. The data were collected bit chart review and analyzed nth SPSS program. The results of study were as follows : 1. The multiple regression equation ($R^2$=0.416) used for the prediction of newborn birthweight was y=(newborn birthweight, kg)=-4.421+0.075$x_1$(fundal height, cm)+0.053$x_2$(gestational period, weeks)+0.016$x_3$(abdominal girth, cm)+0.010$x_4$(maternal height, cm) 2. The growth curve of fundal height according to gestational period was obtained by polynomial regression. The regression equation was Y(fundal height, cm)=-36.78+18.58$log_ex$(gestational period, weeks) The growth curve of newborn birth weight according to fundal height was obtained by polynomial regression. The regression equation was Y(newborn birthweight, kg)=-8.09+3.27$log_ex$ (Fundal Height, cm) 3. In the following subgroups no significant difference was found in fundal height : engaged vs. nonengaged presentation, and nulliparous vs. multiparous women.

  • PDF

Parameter Estimation and Prediction for NHPP Software Reliability Model and Time Series Regression in Software Failure Data

  • Song, Kwang-Yoon;Chang, In-Hong
    • Journal of Integrative Natural Science
    • /
    • v.7 no.1
    • /
    • pp.67-73
    • /
    • 2014
  • We consider the mean value function for NHPP software reliability model and time series regression model in software failure data. We estimate parameters for the proposed models from two data sets. The values of SSE and MSE is presented from two data sets. We compare the predicted number of faults with the actual two data sets using the mean value function and regression curve.

Development of Regression Equation for Water Quantity Estimation in a Tidal River (감조하천에서의 저수위 유량산정 다중회귀식 개발)

  • Lee, Sang Jin;Ryoo, Kyong Sik;Lee, Bae Sung;Yoon, Jong Su
    • Journal of Korean Society on Water Environment
    • /
    • v.23 no.3
    • /
    • pp.385-390
    • /
    • 2007
  • Reliable flow measurement for dry season is very important to set up the in-stream flow exactly and total maximum daily load control program in the basin. Especially, in the points which tidal current effects are dominant because reliability of the low measurement decrease. The reliable measuring methods are needed. In this study, we analysis the water surface elevation difference of water surface elevation. Quantity relationship to consider tidal currents in these regions. It is known that tidal current effects from Nakdong river barrage are dominant in Samrangjin measuring station. We developed multiple regression equation with water surface elevation, quantity, and difference of water surface elevation and compared these results water measured rating curve. All of these regression equation including linear regression equation and log regression equation fits better measured data them existing water surface elevation quantity line and Among three equations, the log regression equation is best to represent the measured the rating curve in Samrangjin point. The log regression equation is useful method to obtain the quantity in the regions which tidal currents are dominant.

Standard Curve Validation using Trendlines in Excel (Excel의 추세선을 이용한 표준곡선 검증)

  • Lee, Kyung-Hwa;Park, Hyung-Ki;Shin, Young-Man
    • The Korean Journal of Nuclear Medicine Technology
    • /
    • v.20 no.2
    • /
    • pp.69-74
    • /
    • 2016
  • Purpose Using a regression formula of the trendline near the coefficient of determination (R2) "1" by substituting the dependent variable of the standard curve to calculate the values of the independent variable. To determine the suitability of a regression equation by comparing the difference between the independent variables of the standard curve and the predicted independent variables. Materials and Methods Perkin Elmer Gamma-Counter machine was used for Standard curve of regression methods. TSH. TG-Ag (Thyroglobulin Antigen), Insulin that used materials and method test to compare the result from the Excel trendline of the regression formula. Results Each of the value of coefficient of determination ($R^2$) and Trendline $R^2=1$, Polynomial Trendline for TSH, $R^2=1$, Polynomial Trendline for TG-Ag, $R^2=0.9994$, Polynomial Trendline for Insulin. Conclusion We confirmed that IRMA immune method is found to the nearest trends elected a standard curve using polynomial trendline. The independent variables to predict the trend by using a polynomial trendline formula containing the error was a limitation.

  • PDF

Variable Selection Criteria in Regression

  • Kim, Choong-Rak
    • Journal of the Korean Statistical Society
    • /
    • v.23 no.2
    • /
    • pp.293-301
    • /
    • 1994
  • In this paper we propose a variable selection criterion minimizing influence curve in regression, and compare it with other criteria such as $C_p$(Mallows 1973) and adjusted coefficient of determination. Examples and extension to the generalized linear models are given.

  • PDF

Application Method of Logistic Regression Analysis for Annoyance Prediction Model Based on Predicted Noise Level (예측소음도를 이용한 어노이언스 예측모델을 위한 로지스틱 회귀분석의 적용방법)

  • Son, Jin-Hee;Lee, Kun;Choung, Tae-Ryang;Chang, Seo-Il
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.20 no.6
    • /
    • pp.555-561
    • /
    • 2010
  • Predicted noise level has been used to assess the annoyance response since noise map was generalized and being the normal method to assess the environmental noise. Unfortunately using predicted noise level to derive the annoyance prediction curve caused some problems. The data have to be grouped manually to use the annoyance prediction curve. The aim of this paper is to propose the method to handle the predicted noise level and the survey data for annoyance prediction curve. This paper used the percentage of persons annoyed(%A) and the percentage of persons highly annoyed as the descriptor of noise annoyance in a population. The logistic regression method was used for deriving annoyance prediction curve. It is concluded that the method of dichotomizing data and logistic regression was suitable to handle the predicted noise level and survey data.