• Title/Summary/Keyword: 일반화된 선형모형

Search Result 148, Processing Time 0.026 seconds

Modelling Heterogeneity in Fertility for Analysis of Variety Trials (밭의 비옥도를 고려한 품종실험 분석)

  • 윤성철;강위창;이영조;임용빈
    • The Korean Journal of Applied Statistics
    • /
    • v.11 no.2
    • /
    • pp.423-433
    • /
    • 1998
  • In agricultural field experiments, the completely randomized block design is often used for the analysis of variety trials. An important assumption is that every experimental unit in each block has the some fertility. But, in most agricultural field experiments there often exists a systematic heterogeneity in fertility among the experimental units. To account for the heterogeneity, we propose to use the hierarchical generalized linear models. We compare our analysis of the data from Scottish Agricultural colleges list with that using Markov chain Monte Carlo method.

  • PDF

The Temporal Disaggregation Model for Nonlinear Pan Evaporation Estimation (비선형 증발접시 증발량 산정을 위한 시간적 분해모형)

  • Kim, Sungwon;Kim, Jung-Hun;Park, Ki-Bum;Kim, Hung Soo
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.30 no.4B
    • /
    • pp.399-412
    • /
    • 2010
  • The goal of this research is to apply the neural networks models for the temporal disaggregation of the yearly pan evaporation (PE) data, Republic of Korea. The neural networks models consist of multilayer perceptron neural networks model (MLP-NNM) and generalized regression neural networks model (GRNNM), respectively. And, for the performances evaluation of the neural networks models, they are composed of training and test performances, respectively. The three types of data such as the historic, the generated, and the mixed data are used for the training performance. The only historic data, however, is used for the testing performance. From this research, we evaluate the application of MLP-NNM and GRNNM for the temporal disaggregation of nonlinear time series data. We should, furthermore, construct the credible monthly PE data from the temporal disaggregation of the yearly PE data, and can suggest the available data for the evaluation of irrigation and drainage networks system.

Varying coefficient model with errors in variables (가변계수 측정오차 회귀모형)

  • Sohn, Insuk;Shim, Jooyong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.971-980
    • /
    • 2017
  • The varying coefficient regression model has gained lots of attention since it is capable to model dynamic changes of regression coefficients in many regression problems of science. In this paper we propose a varying coefficient regression model that effectively considers the errors on both input and response variables, which utilizes the kernel method in estimating the varying coefficient which is the unknown nonlinear function of smoothing variables. We provide a generalized cross validation method for choosing the hyper-parameters which affect the performance of the proposed model. The proposed method is evaluated through numerical studies.

Analysis of Household Overdue Loans by Using a Two-stage Generalized Linear Model (이단계 일반화 선형모형을 이용한 은행 고객의 연체성향 분석)

  • Oh, Man-Suk;Oh, Hyeon-Tak;Lee, Young-Mi
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.3
    • /
    • pp.407-419
    • /
    • 2006
  • In this paper, we analyze household overdue loans in Korea which has been causing serious social and economical problems. We consider customers of Bank A in Korea and focus on overdue cash services which have been snowballing in the past few years. From analysis of overdue loans, one can predict possible delays for current customers as well as build a credit evaluation and risk management system for future customers. As a statistical analytical tool, we propose a two-stage Generalized Linear regression Model (GLM) which assumes a logistic model for presence/non-presence of overdue and a gamma model for the amount of overdue in the case of overdue. We perform goodness of fit test for the two-stage model and select significant explanatory variables in each stage of the model. It turns out that age, the amount of credit loans from other financial companies, the amount of cash service from other companies, debit balance, the average amount of cash service, and net profit are important explanatory variables relevant to overdue credit card cash service in Korea.

A Study on the Scoring Method for the Insurance Underwriting Using Generalized Linear Model (보험사 언더라이팅 기준 설정을 위한 스코어링 기법에 관한 연구)

  • Lee, Chang-Soo;Kwon, Hyuk-Sung;Kim, Dong-Kwang
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.3
    • /
    • pp.489-498
    • /
    • 2009
  • Underwriting is the first step for the administration of an insurance contract, which may result in stable profitability or unexpected loss for insurance company. Adequacy of underwriting criteria determines underwriting result. Generally, quantitative scoring system is used for underwriting. Method of evaluating risk for the scoring system is summing up scores for risk factors of a potential policyholder in consideration. Scores for each risk factor is predetermined. Current business environment for insurance companies makes underwriting profit more important, which means that insurance companies need more efficient underwriting method. This study suggests a reasonable approach to estimate risk relativities based on generalized linear model. Real data were used to quantify risk levels of groups of insureds for the design of underwriting model. Finally, effects in business volume and profitability of reflecting estimated underwriting scoring system are explained.

A study on the multivariate sliced inverse regression (다변량 분할 역회귀모형에 관한 연구)

  • 이용구;이덕기
    • The Korean Journal of Applied Statistics
    • /
    • v.10 no.2
    • /
    • pp.293-308
    • /
    • 1997
  • Sliced inverse regression is a method for reducing the dimension of the explanatory variable X without going through any parametric or nonparametric model fitting process. This method explores the simplicity of the inverse view of regression; that is, instead of regressing the univariate output varable y against the multivariate X, we regress X against y. In this article, we propose bivariate sliced inverse regression, whose method regress the multivariate X against the bivariate output variables $y_1, Y_2$. Bivariate sliced inverse regression estimates the e.d.r. directions of satisfying two generalized regression model simultaneously. For the application of bivariate sliced inverse regression, we decompose the output variable y into two variables, one variable y gained by projecting the output variable y onto the column space of X and the other variable r through projecting the output variable y onto the space orthogonal to the column space of X, respectively and then estimate the e.d.r. directions of the generalized regression model by utilize two variables simultaneously. As a result, bivariate sliced inverse regression of considering the variable y and r simultaneously estimates the e.d.r. directions efficiently and steadily when the regression model is linear, quadratic and nonlinear, respectively.

  • PDF

Generalized kernel estimating equation for panel estimation of small area unemployment rates (소지역 실업률의 패널추정을 위한 일반화커널추정방정식)

  • Shim, Jooyong;Kim, Youngwon;Hwang, Changha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.6
    • /
    • pp.1199-1210
    • /
    • 2013
  • The high unemployment rate is one of the major problems in most countries nowadays. Hence, the demand for small area labor statistics has rapidly increased over the past few years. However, since sample surveys for producing official statistics are mainly designed for large areas, it is difficult to produce reliable statistics at the small area level due to small sample sizes. Most of existing studies about the small area estimation are related with the estimation of parameters based on cross-sectional data. By the way, since many official statistics are repeatedly collected at a regular interval of time, for instance, monthly, quarterly, or yearly, we need an alternative model which can handle this type of panel data. In this paper, we derive the generalized kernel estimating equation which can model time-dependency among response variables and handle repeated measurement or panel data. We compare the proposed estimating equation with the generalized linear model and the generalized estimating equation through simulation, and apply it to estimating the unemployment rates of 25 areas in Gyeongsangnam-do and Ulsan for 2005.

Design of Experiment Using Design Matrix in Terms of Generalized Linear Model (일반화 선형모형의 디자인 행렬을 이용한 품질 실험 설계)

  • Choi, Sung-Woon
    • Proceedings of the Safety Management and Science Conference
    • /
    • 2009.04a
    • /
    • pp.423-427
    • /
    • 2009
  • This study proposes the generation mechanism of various design matrix using generalized linear model for design of experiment. Design generation method of GLM analysis, factorial design(FD) with center points, ANOVA design with lack-of-fit test, and response surface design are introduced. In central composite(CC) design, orthogonal blocking and fractional factorial design(FFD) are presented. We compare the design of Box-Benhken(BB) and face-centred central compsite design.

  • PDF