• Title/Summary/Keyword: a Local linear regression

Search Result 156, Processing Time 0.022 seconds

Comparison of Linear and Nonlinear Regressions and Elements Analysis for Wind Speed Prediction (풍속 예측을 위한 선형회귀분석과 비선형회귀분석 기법의 비교 및 인자분석)

  • Kim, Dongyeon;Seo, Kisung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.25 no.5
    • /
    • pp.477-482
    • /
    • 2015
  • Linear regressions and evolutionary nonlinear regression based compensation techniques for the short-range prediction of wind speed are investigated. Development of an efficient MOS(Model Output Statistics) is necessary to correct systematic errors of the model, but a linear regression based MOS is hard to manage an irregular nature of weather prediction. In order to solve the problem, a nonlinear and symbolic regression method using GP(Genetic Programming) is suggested for a development of MOS for wind speed prediction. The proposed method is compared to various linear regression methods for prediction of wind speed. Also, statistical analysis of distribution for UM elements for each method is executed. experiments are performed for KLAPS(Korea Local Analysis and Prediction System) re-analysis data from 2007 to 2013 year for Jeju Island and Busan area in South Korea.

A Local Linear Kernel Estimator for Sparse Multinomial Data

  • Baek, Jangsun
    • Journal of the Korean Statistical Society
    • /
    • v.27 no.4
    • /
    • pp.515-529
    • /
    • 1998
  • Burman (1987) and Hall and Titterington (1987) studied kernel smoothing for sparse multinomial data in detail. Both of their estimators for cell probabilities are sparse asymptotic consistent under some restrictive conditions on the true cell probabilities. Dong and Simonoff (1994) adopted boundary kernels to relieve the restrictive conditions. We propose a local linear kernel estimator which is popular in nonparametric regression to estimate cell probabilities. No boundary adjustment is necessary for this estimator since it adapts automatically to estimation at the boundaries. It is shown that our estimator attains the optimal rate of convergence in mean sum of squared error under sparseness. Some simulation results and a real data application are presented to see the performance of the estimator.

  • PDF

Regression discontinuity for survival data

  • Youngjoo Cho
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.1
    • /
    • pp.155-178
    • /
    • 2024
  • Regression discontinuity (RD) design is one of the most widely used methods in causal inference for estimation of treatment effect when the treatment is created by a cutpoint from the covariate of interest. There has been little attention to RD design, although it provides a very useful tool for analysis of treatment effect for censored data. In this paper, we define the causal effect for survival function in RD design when the treatment is assigned deterministically by the covariate of interest. We propose estimators of this causal effect for survival data by using transformation, which leads unbiased estimator of the survival function with local linear regression. Simulation studies show the validity of our approach. We also illustrate our proposed method using the prostate, lung, colorectal and ovarian (PLCO) dataset.

On a Transformation Technique for Nonparametric Regression

  • Kim, Woochul;Park, Byeong U.
    • Journal of the Korean Statistical Society
    • /
    • v.25 no.2
    • /
    • pp.217-233
    • /
    • 1996
  • This paper gives a rigorous proof of an asymptotic result about bias and variance for a transformation-based nonparametric regression estimator proposed by Park et al (1995).

  • PDF

Some Results on the Log-linear Regression Diagnostics

  • Yang, Mi-Young;Choi, Ji-Min;Kim, Choong-Rak
    • Communications for Statistical Applications and Methods
    • /
    • v.14 no.2
    • /
    • pp.401-411
    • /
    • 2007
  • In this paper we propose an influence measure for detecting potentially influential observations using the infinitesimal perturbation and the local influence in the log-linear regression model. Also, we propose a goodness-of-fit measure for variable selection. A real data set are used for illustration.

A Comparative Study Between Linear Regression and Support Vector Regression Model Based on Environmental Factors of a Smart Bee Farm

  • Rahman, A. B. M. Salman;Lee, MyeongBae;Venkatesan, Saravanakumar;Lim, JongHyun;Shin, ChangSun
    • Smart Media Journal
    • /
    • v.11 no.5
    • /
    • pp.38-47
    • /
    • 2022
  • Honey is one of the most significant ingredients in conventional food production in different regions of the world. Honey is commonly used as an ingredient in ethnic food. Beekeeping is performed in various locations as part of the local food culture and an occupation related to pollinator production. It is important to conduct beekeeping so that it generates food culture and helps regulate the regional environment in an integrated manner in preserving and improving local food culture. This study analyzes different types of environmental factors of a smart bee farm. The major goal of this study is to determine the best prediction model between the linear regression model (LM) and the support vector regression model (SVR) based on the environmental factors of a smart bee farm. The performance of prediction models is measured by R2 value, root mean squared error (RMSE), and mean absolute error (MAE). From all analysis reports, the best prediction model is the support vector regression model (SVR) with a low coefficient of variation, and the R2 values for Farm inside temperature, bee box inside temperature, and Farm inside humidity are 0.97, 0.96, and 0.44.

Optimum Draft and Phasing of Double-Flap Wavemaker (Double-Flap 조파기의 설계 및 조파판의 최적 위상관계)

  • Shin-Hyoung,Kang
    • Bulletin of the Society of Naval Architects of Korea
    • /
    • v.16 no.1
    • /
    • pp.1-6
    • /
    • 1979
  • A method to determine the optimum drafts of waveboards of single and double-flap wavemaker is presented by using a linearized first-order wave theory and a linear regression method. A linear regression is verified to be quite simpler than the local disturbance consideration or regular-wave forming distance. It is pointed out that lower hinge should be deep enough to keep the upper flap vertical for a long wave length.

  • PDF

ADDITIVE AND HETEROSIS EFFECTS ON MILK YIELD AND BIRTH WEIGHT FROM CROSSBREEDING EXPERIMENTS BETWEEN HOLSTEIN AND THE LOCAL BREED IN BANGLADESH

  • Hirooka, H.;Bhuiyan, A.K.F.H.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.8 no.3
    • /
    • pp.295-300
    • /
    • 1995
  • Data from purebred and crossbred cattle involving Holstein and the Local breed in Bangladesh were used to estimate the genetic effects on average daily milk yield and birth weight A total of 877 records on average daily milk yield for 4 types of breed groups and a total of 418 records on birth weight for 5 breed groups were analyzed. Two different methods were applied in this study; the least squares analysis of variance approach and the linear regression approach. Breed group effects were highly significant for both average daily milk yield and birth weight. The result showed that straightbred Holstein produced the highest milk yield and the 7/8 crosses ranked highest in birth weight For the two traits, the additive breed effect was highly significant, whereas the individual heterosis effect was not significant. Furthermore, this study showed a negative maternal heterosis for average daily milk yields and a positive maternal heterosis for birth weight Comparing the breed least squares means obtained from the linear regression approach revealed that straightbred Holstein produced the highest average milk yield and the 3/4 crosses were predicted to have the largest birth weight. It is indicated that the linear regression approach can adequately separate the genetic component of performance, estimate unknown crossbreeding parameters and predict unknown performance of crosses which are not include in the original data.

Estimation of Density via Local Polynomial Tegression

  • Park, B. U.;Kim, W. C.;J. Huh;J. W. Jeon
    • Journal of the Korean Statistical Society
    • /
    • v.27 no.1
    • /
    • pp.91-100
    • /
    • 1998
  • A method of estimating probability density using regression tools is presented here. It is based on equal-length binning and locally weighted approximate likelihood for bin counts. The method is particularly useful for densities with bounded supports, where it automatically corrects edge effects without using boundary kernels.

  • PDF

A Study on Predicting the demand for Public Shared Bikes using linear Regression

  • HAN, Dong Hun;JUNG, Sang Woo
    • Korean Journal of Artificial Intelligence
    • /
    • v.10 no.1
    • /
    • pp.27-32
    • /
    • 2022
  • As the need for eco-friendly transportation increases due to the deepening climate crisis, many local governments in Korea are introducing shared bicycles. Due to anxiety about public transportation after COVID-19, bicycles have firmly established themselves as the axis of daily transportation. The use of shared bicycles is spread, and the demand for bicycles is increasing by rental offices, but there are operational and management difficulties because the demand is managed under a limited budget. And unfortunately, user behavior results in a spatial imbalance of the bike inventory over time. So, in order to easily operate the maintenance of shared bicycles in Seoul, bicycles should be prepared in large quantities at a time of high demand and withdrawn at a low time. Therefore, in this study, by using machine learning, the linear regression algorithm and MS Azure ML are used to predict and analyze when demand is high. As a result of the analysis, the demand for bicycles in 2018 is on the rise compared to 2017, and the demand is lower in winter than in spring, summer, and fall. It can be judged that this linear regression-based prediction can reduce maintenance and management costs in a shared society and increase user convenience. In a further study, we will focus on shared bike routes by using GPS tracking systems. Through the data found, the route used by most people will be analyzed to derive the optimal route when installing a bicycle-only road.