• Title/Summary/Keyword: 가법모형

Search Result 60, Processing Time 0.02 seconds

A Study on Applying Shrinkage Method in Generalized Additive Model (일반화가법모형에서 축소방법의 적용연구)

  • Ki, Seung-Do;Kang, Kee-Hoon
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.1
    • /
    • pp.207-218
    • /
    • 2010
  • Generalized additive model(GAM) is the statistical model that resolves most of the problems existing in the traditional linear regression model. However, overfitting phenomenon can be aroused without applying any method to reduce the number of independent variables. Therefore, variable selection methods in generalized additive model are needed. Recently, Lasso related methods are popular for variable selection in regression analysis. In this research, we consider Group Lasso and Elastic net models for variable selection in GAM and propose an algorithm for finding solutions. We compare the proposed methods via Monte Carlo simulation and applying auto insurance data in the fiscal year 2005. lt is shown that the proposed methods result in the better performance.

An Additive Stratified Quantitative Attribute Randomized Response Model (층화 가법 양적속성 확률화응답모형)

  • Lee, Gi-Sung;Ahn, Seung-Chul;Hong, Ki-Hak;Son, Chang-Kyoon
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.2
    • /
    • pp.239-247
    • /
    • 2014
  • For a sensitive survey in which the population is composed by several strata with quantitative attributes, we present an additive stratified quantitative attribute randomized response model which applied stratified random sampling instead of simple random sampling to the models of Himmelfarb-Edgell's additive quantitative attribute model and Gjestvang-Singh's. We also establish theoretical grounds to estimate the stratum mean of sensitive quantitative attributes as well as the over all mean. We deal with the proportional and optimal allocation problems in each suggested model and compare the relative efficiency of the suggested two models; subsequently, Himmelfarb-Edgell's model is more efficient than Gjestvang-Singh's model under the condition of stratified random sampling.

비가법성에 대한 Tukey의 통계량에 관하여

  • Paik, U.B.
    • Journal of the Korean Statistical Society
    • /
    • v.4 no.1
    • /
    • pp.9-17
    • /
    • 1975
  • A, B 두 요인의 영향을 받고 있다고 생각되는 rc개 측정치가 있고 이것이 다음과 같이 $r\timesc$ 이차분류표로 정리되었다고 하자. 여기에서 $$y_{ij} = \mu + \alpha_i + \beta_j + \epsilon_{ij}$$ 와 같은 가법모형을 생각한다. 그리고 $\epsilon_{ij}$는 잔여항으로써 평균이 0, 분산이 $\sigma^2$인 정규분포를 한다고 가정하는 것이 보통이다. 또 이것은 모수모형인 경우 $E(y_{ij}) = \mu + \alpha_i + \beta_i, v(y_{ij}) = \sigma^2$임을 의미하는 것으로 생각된다. 그러나 자료에 따라서는 위에서와 같은 가법적 모형을 적용한다는 것이 적당하지 못한 경우가 있다.

  • PDF

A credit classification method based on generalized additive models using factor scores of mixtures of common factor analyzers (공통요인분석자혼합모형의 요인점수를 이용한 일반화가법모형 기반 신용평가)

  • Lim, Su-Yeol;Baek, Jang-Sun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.2
    • /
    • pp.235-245
    • /
    • 2012
  • Logistic discrimination is an useful statistical technique for quantitative analysis of financial service industry. Especially it is not only easy to be implemented, but also has good classification rate. Generalized additive model is useful for credit scoring since it has the same advantages of logistic discrimination as well as accounting ability for the nonlinear effects of the explanatory variables. It may, however, need too many additive terms in the model when the number of explanatory variables is very large and there may exist dependencies among the variables. Mixtures of factor analyzers can be used for dimension reduction of high-dimensional feature. This study proposes to use the low-dimensional factor scores of mixtures of factor analyzers as the new features in the generalized additive model. Its application is demonstrated in the classification of some real credit scoring data. The comparison of correct classification rates of competing techniques shows the superiority of the generalized additive model using factor scores.

Forecasting number of student by Holt-Winters additive model (홀트-윈터스 가법모형에 의한 전국 학생수 예측)

  • Kim, Jong-Tae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.4
    • /
    • pp.685-694
    • /
    • 2009
  • The idea of this paper is to get the time series data from the number of student on the elementary, meddle and high-school for the forecasting of the numbers of student. Tow models, model A and model B, of time series data are obtained. The Holt-Winters additive methods are used for the forecasting of the numbers of student with the model A and model B until 2019 year. As the result, the abilities of forecasting on model A and B are better than those of the Korean education statistical system 2007.

  • PDF

Tackling Proximity Effects in Nonmarket Valuation Approaches : An Example of Contingent Valuation Method (비시장재화의 가치평가에 있어서 근접효과(Proximity Effects)의 검증에 관한 연구 : 조건부가치평가법을 중심으로)

  • Jeon, Chul-Hyun;Shin, Hio-Jung;Joo, Hye-Jin
    • Environmental and Resource Economics Review
    • /
    • v.19 no.1
    • /
    • pp.101-127
    • /
    • 2010
  • The purpose of the research is to tackle proximity effects (PEs) when nonmarket valuation method CVM is applied to environmental goods such as tidal flats. 1,000 households are surveyed in the ratio of national household for the research. The sample are reclassified into five areas by 30-minute distance. Log-linear are used to analysis PEs in the research. On conclusion log-linear model regarding income effects proves that PEs are apparently represented in NMVMs(${\theta}_1$ >0. ${\theta}_2$ >0 and $dum1{\neq}0$, $dum2{\neq}0$, $dum3{\neq}0$, $dum4{\neq}0$) as a result of a 5 per cent significant level of t -test and F-test, finally rejecting the null hypothesis. In addition, WTP of area I respondents shows 26 per cent more then that of area V respondents, which is from \87,969 to \64,866 in the open-ended format. Finally, the research proves that the PEs in CVM are evidently represented with the econometric model, hence the PEs have to be embedded into the questionnaire of non-market valuation methods with the environmental goods to reduce the underestimation and improve the estimation accuracy.

  • PDF

Nonparametric compositional data analysis for tourism industry in Gangwon area (강원도 관광산업에 대한 비모수적 구성비 자료 분석)

  • Seongeun Park;Jeong Min Jeon;Young Kyung Lee
    • The Korean Journal of Applied Statistics
    • /
    • v.36 no.5
    • /
    • pp.473-488
    • /
    • 2023
  • Gangwon-do is one of Korea's most popular tourist destinations, with varying tourism demands and trends across its subregions. It is crucial to identify the characteristics of tourism in each area and compare the tourism patterns over time to devise policies that revitalize tourism in each local government and promote balanced development across regions. In this paper, we classify the regions in Gangwon-do based on tourism data from the last four years and analyze the tourism pattern of each region using the non-Euclidean additive model proposed by Jeon et al. (2021). The model incorporates the proportions of visitors by age groups and the proportions of navigation searches by destination types as two covariates, and the proportions of tourism expenditure types as a response variable. We estimate the model using the smooth-backfitting method and coordinate-wise bandwidth selection. The results are visualized in ternary plots, and changes in tourism patterns over time are analyzed by comparing the ratios of prediction errors to fitting errors.

Drought index forecast using ensemble learning (앙상블 기법을 이용한 가뭄지수 예측)

  • Jeong, Jihyeon;Cha, Sanghun;Kim, Myojeong;Kim, Gwangseob;Lim, Yoon-Jin;Lee, Kyeong Eun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.1125-1132
    • /
    • 2017
  • In a situation where the severity and frequency of drought events getting stronger and higher, many studies related to drought forecast have been conducted to improve the drought forecast accuracy. However it is difficult to predict drought events using a single model because of nonlinear and complicated characteristics of temporal behavior of drought events. In this study, in order to overcome the shortcomings of the single model approach, we first build various single models capable to explain the relationship between the meteorological drought index, Standardized Precipitation Index (SPI), and other independent variables such as world climate indices. Then, we developed a combined models using Stochastic Gradient Descent method among Ensemble Learnings.

Factor Analysis of Customer Loyalty in Car Insurance Using Generalized Additive Partial Linear Model (일반화가법부분선형모형을 이용한 자동차보험 충성도 요인분석)

  • Ki, Seung-Do;Kang, Kee-Hoon
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.1
    • /
    • pp.67-79
    • /
    • 2012
  • The car insurance market in Korea has already entered (or is in the process of entry) a mature market that is characterized by increased competition by market participants. Participants are expected to compete more intensively in order to survive. Together with a slowdown in market growth the goal of non-life insurers' marketing strategies is to enhance existing customer loyalty because it is easier to raise their loyalty via customer satisfaction than to attract new customers in a stagnant market. In this article, we investigate what factors affect customer loyalty, and suggest some specific ways to establish and implement marketing strategies. We use a generalized additive partial linear model in order to find some significant factors.

Using Generalized Additive Partial Linear Model for Constructing Underwriting System (언더라이팅 시스템 구축을 위한 일반화가법부분선형모형의 활용)

  • Ki, Seung-Do;Kang, Kee-Hoon
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.6
    • /
    • pp.1215-1227
    • /
    • 2009
  • Underwriting refers to the process that the insurance company measures the potential risk of the future clients and decide whether insuring them with current premium. Although the traditional underwriting system used in Korean automobile insurance market is easy to understand, it is not based on a reliable statistical procedure. In this paper, we propose to apply the generalized additive model into construction of underwriting system, which is based on statistical analysis. We use automobile insurance data in Korea and apply our approach to the data. The results from the empirical analysis would be useful even for determining the significance of each variable in calculating automobile insurance premium.