• Title/Summary/Keyword: Nonparametric Estimation

Search Result 211, Processing Time 0.03 seconds

Nonparametric estimation of hazard rates change-point (위험률의 변화점에 대한 비모수적 추정)

  • 정광모
    • The Korean Journal of Applied Statistics
    • /
    • v.11 no.1
    • /
    • pp.163-175
    • /
    • 1998
  • The change of hazard rates at some unknown time point has been the interest of many statisticians. But it was restricted to the constant hazard rates which correspond to the exponential distribution. In this paper we generalize the change-point model in which any specific functional forms of hazard rates are net assumed. The assumed model includes various types of changes before and after the unknown time point. The Nelson estimator of cumulative hazard function is introduced. We estimate the change-point maximizing slope changes of Nelson estimator. Consistency and asymptotic distribution of bootstrap estimator are obtained using the martingale theory. Through a Monte Carlo study we check the performance of the proposed method. We also explain the proposed method using the Stanford Heart Transplant Data set.

  • PDF

Process modeling using artificial neural network in the presence of outliers

  • 고영철;박화규;봉복준;손주찬;왕지남
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 1997.10a
    • /
    • pp.177-180
    • /
    • 1997
  • Outliers, unexpected extraordinary observations that look discordant from most observation in a data set are commonplace in various kinds of data analysis. Since the effect of outliers on model identification could be serious, the aim of this paper is to present some ways of handling outliers in given data set and to specify a model in the presence of outliers. A procedure based on neural network which identifies outliers, removes their effects, and specifies a model for the underlying process is proposed. In contrast with traditional parametric methods requiring to estimate the model's structure and parameters before detecting outliers, the proposed procedure is a nonparametric method without the estimation of model's structure and parameters before handling outliers and could be applied for real problems in the presence of outliers. The proposed methodology is performed as followings. Firstly, outliers are detected and the detected outliers replace the prediction values using outliers detection neural network. The data set removing the effect of outliers is retraining using neural network. Therefore the effects of outliers are removed and the modeling precision can be improved. Experimental results show that the proposed method is suitable for predicting data set in the presence of outliers.

  • PDF

Nonparametric estimation of conditional quantile with censored data (조건부 분위수의 중도절단을 고려한 비모수적 추정)

  • Kim, Eun-Young;Choi, Hyemi
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.2
    • /
    • pp.211-222
    • /
    • 2013
  • We consider the problem of nonparametrically estimating the conditional quantile function from censored data and propose new estimators here. They are based on local logistic regression technique of Lee et al. (2006) and "double-kernel" technique of Yu and Jones (1998) respectively, which are modified versions under random censoring. We compare those with two existing estimators based on a local linear fits using the check function approach. The comparison is done by a simulation study.

Hybrid Fuzzy Least Squares Support Vector Machine Regression for Crisp Input and Fuzzy Output

  • Shim, Joo-Yong;Seok, Kyung-Ha;Hwang, Chang-Ha
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.2
    • /
    • pp.141-151
    • /
    • 2010
  • Hybrid fuzzy regression analysis is used for integrating randomness and fuzziness into a regression model. Least squares support vector machine(LS-SVM) has been very successful in pattern recognition and function estimation problems for crisp data. This paper proposes a new method to evaluate hybrid fuzzy linear and nonlinear regression models with crisp inputs and fuzzy output using weighted fuzzy arithmetic(WFA) and LS-SVM. LS-SVM allows us to perform fuzzy nonlinear regression analysis by constructing a fuzzy linear regression function in a high dimensional feature space. The proposed method is not computationally expensive since its solution is obtained from a simple linear equation system. In particular, this method is a very attractive approach to modeling nonlinear data, and is nonparametric method in the sense that we do not have to assume the underlying model function for fuzzy nonlinear regression model with crisp inputs and fuzzy output. Experimental results are then presented which indicate the performance of this method.

Estimation of conditional mean residual life function with random censored data (임의중단자료에서의 조건부 평균잔여수명함수 추정)

  • Lee, Won-Kee;Song, Myung-Unn;Jeong, Seong-Hwa
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.1
    • /
    • pp.89-97
    • /
    • 2011
  • The aims of this study were to propose a method of estimation for mean residual life function (MRLF) from conditional survival function using the Buckley and James's (1979) pseudo random variables, and then to assess the performance of the proposed method through the simulation studies. The mean squared error (MSE) of proposed method were less than those of the Cox's proportional hazard model (PHM) and Beran's nonparametric method for non-PHM case. Futhermore in the case of PHM, the MSE's of proposed method were similar to those of Cox's PHM. Finally, to evaluate the appropriateness of practical use, we applied the proposed method to the gastric cancer data. The data set consist of the 1, 192 patients with gastric cancer underwent surgery at the Department of Surgery, K-University Hospital.

Discontinuous log-variance function estimation with log-residuals adjusted by an estimator of jump size (점프크기추정량에 의한 수정된 로그잔차를 이용한 불연속 로그분산함수의 추정)

  • Hong, Hyeseon;Huh, Jib
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.2
    • /
    • pp.259-269
    • /
    • 2017
  • Due to the nonnegativity of variance, most of nonparametric estimations of discontinuous variance function have used the Nadaraya-Watson estimation with residuals. By the modification of Chen et al. (2009) and Yu and Jones (2004), Huh (2014, 2016a) proposed the estimators of the log-variance function instead of the variance function using the local linear estimator which has no boundary effect. Huh (2016b) estimated the variance function using the adjusted squared residuals by the estimated jump size in the discontinuous variance function. In this paper, we propose an estimator of the discontinuous log-variance function using the local linear estimator with the adjusted log-squared residuals by the estimated jump size of log-variance function like Huh (2016b). The numerical work demonstrates the performance of the proposed method with simulated and real examples.

An Exploratory Study of Collective E-Petitions Estimation Methodology Using Anomaly Detection: Focusing on the Voice of Citizens of Changwon City (이상탐지 활용 전자집단민원 추정 방법론에 관한 탐색적 연구: 창원시 시민의 소리 사례를 중심으로)

  • Jeong, Ha-Yeong
    • Informatization Policy
    • /
    • v.26 no.4
    • /
    • pp.85-106
    • /
    • 2019
  • Recently, there have been increasing cases of collective petitions filed in the electronic petitions system. However, there is no efficient management system, raising concerns on side effects such as increased administrative workload and mass production of social conflicts. Aimed at suggesting a methodology for estimating electronic collective petitions using anomaly detection and corpus linguistics-based content analysis, this study conducted the followings: i) a theoretical review of the concept of collective petitions, ii) estimation of electronic collective petitions using anomaly detection based on nonparametric unsupervised learning, iii) a content similarity analysis on petitions using n-gram cosine angle distance, and iv) a case study on the Voice of Citizens of Changwon City, through which the utility of the proposed methodology, policy implications and future tasks were reviewed.

Estimation of Source Apportionment for Semi-Continuous PM2.5 and Identification of Location for Local Point Sources at the St. Louis Supersite, USA (미국 St. Louis Supersite에서의 준 실시간 PM2.5에 대한 기여도 추정 및 지역 규모 오염원의 위치 파악)

  • Hwang, In-Jo
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.25 no.2
    • /
    • pp.154-166
    • /
    • 2009
  • In this study, 1-hour integrated $PM_{2.5}$ mass and chemical composition concentrations were monitored at the St. Louis-Midwest Supersite in Illinois. Time-resolved samples were collected one week in each of June 2001 (22 June to 28 June), November 2001 (7 November to 13 November), and March 2002 (19 March to 25 March). A total of 427 samples were collected by CAMM (continuous ambient mass monitor) and 15 compounds were analyzed by AAS, PILS (particle-into-liquid sampler), and TOT (thermal optical transmittance) method. PMF was applied to identify the sources and apportion the $PM_{2.5}$ mass to each source for highly time resolved data. In addition, the nonparametric regression (NPR) was applied to identify the predominant directions of local sources relative to wind direction. Also, this study performed compare the NPR analysis and location of actual local point sources at the St. Louis area. The PMF modeling identified nine sources and the average mass was apportioned to gasoline vehicle, road dust, zinc smelter, copper production, secondary sulfate, diesel emission, secondary nitrate, iron+steel, and lead smelter, respectively. These results suggested that this study results will be help for $PM_{2.5}$ source apportionment studies at similar metropolitan area, establish $PM_{2.5}$ standard, and establish effective emissions reduction strategies in Korea.

Condition Assessment for Wind Turbines with Doubly Fed Induction Generators Based on SCADA Data

  • Sun, Peng;Li, Jian;Wang, Caisheng;Yan, Yonglong
    • Journal of Electrical Engineering and Technology
    • /
    • v.12 no.2
    • /
    • pp.689-700
    • /
    • 2017
  • This paper presents an effective approach for wind turbine (WT) condition assessment based on the data collected from wind farm supervisory control and data acquisition (SCADA) system. Three types of assessment indices are determined based on the monitoring parameters obtained from the SCADA system. Neural Networks (NNs) are used to establish prediction models for the assessment indices that are dependent on environmental conditions such as ambient temperature and wind speed. An abnormal level index (ALI) is defined to quantify the abnormal level of the proposed indices. Prediction errors of the prediction models follow a normal distribution. Thus, the ALIs can be calculated based on the probability density function of normal distribution. For other assessment indices, the ALIs are calculated by the nonparametric estimation based cumulative probability density function. A Back-Propagation NN (BPNN) algorithm is used for the overall WT condition assessment. The inputs to the BPNN are the ALIs of the proposed indices. The network structure and the number of nodes in the hidden layer are carefully chosen when the BPNN model is being trained. The condition assessment method has been used for real 1.5 MW WTs with doubly fed induction generators. Results show that the proposed assessment method could effectively predict the change of operating conditions prior to fault occurrences and provide early alarming of the developing faults of WTs.

Nonparametic Kernel Regression model for Rating curve (수위-유량곡선을 위한 비매개 변수적 Kernel 회귀모형)

  • Moon, Young-Il;Cho, Sung-Jin;Chun, Si-Young
    • Journal of Korea Water Resources Association
    • /
    • v.36 no.6
    • /
    • pp.1025-1033
    • /
    • 2003
  • In common with workers in hydrologic fields, scientists and engineers relate one variable to two or more other variables for purposes of predication, optimization, and control. Statistics methods have improved to establish such relationships. Regression, as it is called, is indeed the most commonly used statistics technique in hydrologic fields; relationship between the monitored variable stage and the corresponding discharges(rating curve). Regression methods expressed in the form of mathematical equations which has parameters, so called parametric methods. some times, the establishment of parameters is complicated and uncertain. Many non-parametric regression methods which have not parameters, have been proposed and studied. The most popular of these are kernel regression method. Kernel regression offer a way of estimation the regression function without the specification of a parametric model. This paper conducted comparisons of some bandwidth selection methods which are using the least squares and cross-validation.