Search | Korea Science

Application of machine learning models for estimating house price (단독주택가격 추정을 위한 기계학습 모형의 응용)

Lee, Chang Ro;Park, Key Ho
- Journal of the Korean Geographical Society
- /
- v.51 no.2
- /
- pp.219-233
- /
- 2016
In social science fields, statistical models are used almost exclusively for causal explanation, and explanatory modeling has been a mainstream until now. In contrast, predictive modeling has been rare in the fields. Hence, we focus on constructing the predictive non-parametric model, instead of the explanatory model. Gangnam-gu, Seoul was chosen as a study area and we collected single-family house sales data sold between 2011 and 2014. We applied non-parametric models proposed in machine learning area including generalized additive model(GAM), random forest, multivariate adaptive regression splines(MARS) and support vector machines(SVM). Models developed recently such as MARS and SVM were found to be superior in predictive power for house price estimation. Finally, spatial autocorrelation was accounted for in the non-parametric models additionally, and the result showed that their predictive power was enhanced further. We hope that this study will prompt methodology for property price estimation to be extended from traditional parametric models into non-parametric ones.
PDF

Generalized Weighted Linear Models Based on Distribution Functions - A Frequentist Perspective (분포함수를 기초로 일반화가중선형모형)

여인권
- The Korean Journal of Applied Statistics
- /
- v.17 no.3
- /
- pp.489-498
- /
- 2004
In this paper, a new form of linear models referred to as generalized weighted linear models is proposed. The proposed models assume that the relationship between the response variable and explanatory variables can be modelled by a distribution function of the response mean and a weighted linear combination of distribution functions of covariates. This form addresses a structural problem of the link function in the generalized linear models in which the parameter space may not be consistent with the space derived from linear predictors. The maximum likelihood estimation with Lagrange's undetermined multipliers is used to estimate the parameters and resampling method is applied to compute confidence intervals and to test hypotheses.
https://doi.org/10.5351/KJAS.2004.17.3.489 인용 PDF KSCI

Estimation for the generalized exponential distribution under progressive type I interval censoring (일반화 지수분포를 따르는 제 1종 구간 중도절단표본에서 모수 추정)

Cho, Youngseukm;Lee, Changsoo;Shin, Hyejung
- Journal of the Korean Data and Information Science Society
- /
- v.24 no.6
- /
- pp.1309-1317
- /
- 2013
There are various parameter estimation methods for the generalized exponential distribution under progressive type I interval censoring. Chen and Lio (2010) studied the parameter estimation method by the maximum likelihood estimation method, mid-point approximation method, expectation maximization algorithm and methods of moments. Among those, mid-point approximation method has the smallest mean square error in the generalized exponential distribution under progressive type I interval censoring. However, this method is difficult to derive closed form of solution for the parameter estimation using by maximum likelihood estimation method. In this paper, we propose two type of approximate maximum likelihood estimate to solve that problem. The simulation results show the obtained estimators have good performance in the sense of the mean square error. And proposed method derive closed form of solution for the parameter estimation from the generalized exponential distribution under progressive type I interval censoring.
https://doi.org/10.7465/jkdi.2013.24.6.1309 인용 PDF KSCI

The Study for NHPP Software Reliability Growth Model Based on Hyper-exponential Distribution (초지수분포(Hyper-exponential)를 이용한 소프트웨어 신뢰성장 모형에 관한 연구)

Kim, Hee-Cheul;Shin, Hyun-Cheul
- Convergence Security Journal
- /
- v.7 no.1
- /
- pp.45-53
- /
- 2007
Finite failure NHPP models presented in the literature exhibit either constant, monotonic increasing or monotonic decreasing failure occurrence rates per fault. In this paper, Goel-Okumoto and Yamada-Ohba-Osaki model was reviewed, proposes the hyper-exponential distribution reliability model, which maked out efficiency application for software reliability. Algorithm to estimate the parameters used to maximum likelihood estimator and bisection method. For model determination and selection, explored goodness of fit (the error sum of squares). The methodology developed in this paper is exemplified with a software reliability random data set introduced by of Weibull distribution (shape 0.1 & scale 1) of Minitab (version 14) statistical package.
PDF

Stochastic Volatility Model vs. GARCH Model : A Comparative Study (확률적 변동성 모형과 자기회귀이분산 모형의 비교분석)

이용흔;김삼용;황선영
- The Korean Journal of Applied Statistics
- /
- v.16 no.2
- /
- pp.217-224
- /
- 2003
The volatility in the financial data is usually measured by conditional variance. Two main streams for gauging conditional variance are stochastic volatility (SV) model and autoregressive type approach (GARCH). This article is conducting comparative study between SV and GARCH through the Korean Stock Prices Index (KOSPI) data. It is seen that SV model is slightly better than GARCH(1,1) in analyzing KOSPI data.
https://doi.org/10.5351/KJAS.2003.16.2.217 인용 PDF KSCI

Two-Stage Experimental Design for Multiple Objectives (다수목적을 위한 2단계 실험)

Jang, Dae-Heung;Kim, Youngil
- The Korean Journal of Applied Statistics
- /
- v.28 no.1
- /
- pp.93-102
- /
- 2015
The D-optimal design for the nonlinear model typically depends on the unknown parameters to be estimated. Therefore, it is strongly recommended in literature to use a sequential experimental design for estimating the parameters. In this paper two stage experimental design is discussed under many different circumstances including estimating parameters. The method is so universal to be applied to any mixture of objectives for any model including linear model. A hybrid approach is suggested to handle more than 2 objectives in two-stage experimental design. The design is discussed in approximate design framework.
https://doi.org/10.5351/KJAS.2015.28.1.093 인용 PDF KSCI

Comparison of semiparametric methods to estimate VaR and ES (조건부 Value-at-Risk와 Expected Shortfall 추정을 위한 준모수적 방법들의 비교 연구)

Kim, Minjo;Lee, Sangyeol
- The Korean Journal of Applied Statistics
- /
- v.29 no.1
- /
- pp.171-180
- /
- 2016
Basel committee suggests using Value-at-Risk (VaR) and expected shortfall (ES) as a measurement for market risk. Various estimation methods of VaR and ES have been studied in the literature. This paper compares semi-parametric methods, such as conditional autoregressive value at risk (CAViaR) and conditional autoregressive expectile (CARE) methods, and a Gaussian quasi-maximum likelihood estimator (QMLE)-based method through back-testing methods. We use unconditional coverage (UC) and conditional coverage (CC) tests for VaR, and a bootstrap test for ES to check the adequacy. A real data analysis is conducted for S&P 500 index and Hyundai Motor Co. stock price index data sets.
https://doi.org/10.5351/KJAS.2016.29.1.171 인용 PDF KSCI

Regression diagnostics for response transformations in a partial linear model (부분선형모형에서 반응변수변환을 위한 회귀진단)

Seo, Han Son;Yoon, Min
- Journal of the Korean Data and Information Science Society
- /
- v.24 no.1
- /
- pp.33-39
- /
- 2013
In the transformation of response variable in partial linear models outliers can cause a bad effect on estimating the transformation parameter, just as in the linear models. To solve this problem the processes of estimating transformation parameter and detecting outliers are needed, but have difficulties to be performed due to the arbitrariness of the nonparametric function included in the partial linear model. In this study, through the estimation of nonparametric function and outlier detection methods such as a sequential test and a maximum trimmed likelihood estimation, processes for transforming response variable robust to outliers in partial linear models are suggested. The proposed methods are verified and compared their effectiveness by simulation study and examples.
https://doi.org/10.7465/jkdi.2013.24.1.33 인용 PDF KSCI

Robust ridge regression for nonlinear mixed effects models with applications to quantitative high throughput screening assay data (비선형 혼합효과모형에서의 로버스트 능형회귀 방법과 정량적 고속 대량 스크리닝 자료에의 응용)

Yoo, Jiseon;Lim, Changwon
- The Korean Journal of Applied Statistics
- /
- v.31 no.1
- /
- pp.123-137
- /
- 2018
A nonlinear mixed effects model is mainly used to analyze repeated measurement data in various fields. A nonlinear mixed effects model consists of two stages: the first-stage individual-level model considers intra-individual variation and the second-stage population model considers inter-individual variation. The individual-level model, which is the first stage of the nonlinear mixed effects model, estimates the parameters of the nonlinear regression model. It is the same as the general nonlinear regression model, and usually estimates parameters using the least squares estimation method. However, the least squares estimation method may have a problem that the estimated value of the parameters and standard errors become extremely large if the assumed nonlinear function is not explicitly revealed by the data. In this paper, a new estimation method is proposed to solve this problem by introducing the ridge regression method recently proposed in the nonlinear regression model into the first-stage individual-level model of the nonlinear mixed effects model. The performance of the proposed estimator is compared with the performance with the standard estimator through a simulation study. The proposed methodology is also illustrated using quantitative high throughput screening data obtained from the US National Toxicology Program.
https://doi.org/10.5351/KJAS.2018.31.1.123 인용 PDF KSCI

An Improved Method for Phenology Model Parameterization Using Sequential Optimization (순차적인 최적화 기법에 의한 생물계절모형 모수추정 방식 개선)

Yun, Kyungdahm;Kim, Soo-Hyung
- Korean Journal of Agricultural and Forest Meteorology
- /
- v.16 no.4
- /
- pp.304-308
- /
- 2014
Accurate prediction of peak bloom dates (PBD) of flowering cherry trees is critical for organizing local cherry festivals and other associated cultural and economic activities. A two-step phenology model is commonly used for predicting flowering time depending on local temperatures as a result of two consecutive steps followed by chill and heat accumulations. However, an extensive computation requirement for parameter estimation has been a limitation for its practical use. We propose a sequential parameterization method by exploiting previously unused records of development stages. With an extra constraint formed by heat accumulation between two intervening stages, each parameter can then be solved sequentially in much shorter time than the brute-force method. The result was found to be almost identical to the previous solution known for cherry trees (Prunus ${\times}$ yedoensis) in the Tidal Basin, Washington D.C.
https://doi.org/10.5532/KJAFM.2014.16.4.304 인용 PDF KSCI

Search Result 414, Processing Time 0.032 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)