• Title/Summary/Keyword: Data Weights

Search Result 1,439, Processing Time 0.024 seconds

Inappropriate Survey Design Analysis of the Korean National Health and Nutrition Examination Survey May Produce Biased Results

  • Kim, Yangho;Park, Sunmin;Kim, Nam-Soo;Lee, Byung-Kook
    • Journal of Preventive Medicine and Public Health
    • /
    • v.46 no.2
    • /
    • pp.96-104
    • /
    • 2013
  • Objectives: The inherent nature of the Korean National Health and Nutrition Examination Survey (KNHANES) design requires special analysis by incorporating sample weights, stratification, and clustering not used in ordinary statistical procedures. Methods: This study investigated the proportion of research papers that have used an appropriate statistical methodology out of the research papers analyzing the KNHANES cited in the PubMed online system from 2007 to 2012. We also compared differences in mean and regression estimates between the ordinary statistical data analyses without sampling weight and design-based data analyses using the KNHANES 2008 to 2010. Results: Of the 247 research articles cited in PubMed, only 19.8% of all articles used survey design analysis, compared with 80.2% of articles that used ordinary statistical analysis, treating KNHANES data as if it were collected using a simple random sampling method. Means and standard errors differed between the ordinary statistical data analyses and design-based analyses, and the standard errors in the design-based analyses tended to be larger than those in the ordinary statistical data analyses. Conclusions: Ignoring complex survey design can result in biased estimates and overstated significance levels. Sample weights, stratification, and clustering of the design must be incorporated into analyses to ensure the development of appropriate estimates and standard errors of these estimates.

Soft Set Theory Oriented Forecast Combination Method for Business Failure Prediction

  • Xu, Wei;Xiao, Zhi
    • Journal of Information Processing Systems
    • /
    • v.12 no.1
    • /
    • pp.109-128
    • /
    • 2016
  • This paper presents a new combined forecasting method that is guided by the soft set theory (CFBSS) to predict business failures with different sample sizes. The proposed method combines both qualitative analysis and quantitative analysis to improve forecasting performance. We considered an expert system (ES), logistic regression (LR), and support vector machine (SVM) as forecasting components whose weights are determined by the receiver operating characteristic (ROC) curve. The proposed procedure was applied to real data sets from Chinese listed firms. For performance comparison, single ES, LR, and SVM methods, the combined forecasting method based on equal weights (CFBEWs), the combined forecasting method based on neural networks (CFBNNs), and the combined forecasting method based on rough sets and the D-S theory (CFBRSDS) were also included in the empirical experiment. CFBSS obtains the highest forecasting accuracy and the second-best forecasting stability. The empirical results demonstrate the superior forecasting performance of our method in terms of accuracy and stability.

Weighted Distance-Based Quantization for Distributed Estimation

  • Kim, Yoon Hak
    • Journal of information and communication convergence engineering
    • /
    • v.12 no.4
    • /
    • pp.215-220
    • /
    • 2014
  • We consider quantization optimized for distributed estimation, where a set of sensors at different sites collect measurements on the parameter of interest, quantize them, and transmit the quantized data to a fusion node, which then estimates the parameter. Here, we propose an iterative quantizer design algorithm with a weighted distance rule that allows us to reduce a system-wide metric such as the estimation error by constructing quantization partitions with their optimal weights. We show that the search for the weights, the most expensive computational step in the algorithm, can be conducted in a sequential manner without deviating from convergence, leading to a significant reduction in design complexity. Our experments demonstrate that the proposed algorithm achieves improved performance over traditional quantizer designs. The benefit of the proposed technique is further illustrated by the experiments providing similar estimation performance with much lower complexity as compared to the recently published novel algorithms.

A Model of Evaluating the Efficiency of Container Terminals for Improving Service Quality (서비스 품질 향상을 위한 컨테이너 터미널의 효율성 평가 모형에 관한 연구)

  • 임병학;한윤환
    • Journal of Korean Society for Quality Management
    • /
    • v.32 no.2
    • /
    • pp.77-92
    • /
    • 2004
  • It is difficult but very necessary to measure the productivity of container terminals as logistics service provider. It is meaningful to find the appropriate inputs and outputs of the logistics service delivery systems and to measure the relationship between these inputs and outputs. This study proposes a model of evaluating the efficiency of container terminals. The evaluation consists of three phases. First, DEA(Data Envelopment Analysis) phase, determines the efficiency score and weights of DMUs(Decision Making Unit). This phase performs through four steps : selection of DMU, selection of DEA model, determination of input and output factors, calculation of efficiency score and weights for each DMU. Secondly, CEM (Cross Evaluation Model) phase, is to calculate the cross-efficiency scores of DMUs. This phase performs through three steps: selection of CEM, determination of cross-efficiency score for each DMU and development of cross-efficiency matrix. Finally, average cross-efficiency analysis phase is to compute the average cross-efficiency score. The proposed model discriminates among DMUs and ranks DMUs, whether they are efficient or inefficient.

Choice of weights in a hybrid volatility based on high-frequency realized volatility (고빈도 금융 시계열 실현 변동성을 이용한 가중 융합 변동성의 가중치 선택)

  • Yoon, J.E.;Hwang, S.Y.
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.3
    • /
    • pp.505-512
    • /
    • 2016
  • The paper is concerned with high frequency financial time series. A weighted hybrid volatility is suggested to compute daily volatilities based on high frequency data. Various realized volatility (RV) computations are reviewed and the weights are chosen by minimizing the differences between the hybrid volatility and the realized volatility. A high frequency time series of KOSPI200 index is illustrated via QLIKE and Theil-U statistics.

A Study for Obtaining Weights in Pairwise Comparison Matrix in AHP

  • Jeong, Hyeong-Chul;Lee, Jong-Chan;Jhun, Myoung-Shic
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.3
    • /
    • pp.531-541
    • /
    • 2012
  • In this study, we consider various methods to estimate the weights of a pairwise comparison matrix in the Analytic Hierarchy Process widely applied in various decision-making fields. This paper uses a data dependent simulation to evaluate the statistical accuracy, minimum violation and minimum norm of the obtaining weight methods from a reciprocal symmetric matrix. No method dominates others in all criteria. Least squares methods perform best in point of mean squared errors; however, the eigenvectors method has an advantage in the minimum norm.

Soft Combination Schemes for Cooperative Spectrum Sensing in Cognitive Radio Networks

  • Shen, Bin;Kwak, Kyung-Sup
    • ETRI Journal
    • /
    • v.31 no.3
    • /
    • pp.263-270
    • /
    • 2009
  • This paper investigates linear soft combination schemes for cooperative spectrum sensing in cognitive radio networks. We propose two weight-setting strategies under different basic optimality criteria to improve the overall sensing performance in the network. The corresponding optimal weights are derived, which are determined by the noise power levels and the received primary user signal energies of multiple cooperative secondary users in the network. However, to obtain the instantaneous measurement of these noise power levels and primary user signal energies with high accuracy is extremely challenging. It can even be infeasible in practical implementations under a low signal-to-noise ratio regime. We therefore propose reference data matrices to scavenge the indispensable information of primary user signal energies and noise power levels for setting the proposed combining weights adaptively by keeping records of the most recent spectrum observations. Analyses and simulation results demonstrate that the proposed linear soft combination schemes outperform the conventional maximal ratio combination and equal gain combination schemes and yield significant performance improvements in spectrum sensing.

  • PDF

A CONTROLLER DESIGN OF ACTIVE SUSPENSION USING EVOLUTION STRATEGY AND NEURAL NETWORK

  • Cheon, Jong-Min;Kim, Seog-Joo;Lee, Jong-Moo;Kwon, Soon-Man
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2005.06a
    • /
    • pp.1530-1533
    • /
    • 2005
  • In this paper, we design a Linear Quadratic Gaussian controller for the active suspension. We can improve the inherent suspension problem, trade-off between the ride quality and the suspension travel by selecting appropriate weights in the LQ-objective function. Because any definite rules for selecting weights do not exist, we use an optimization-algorithm, Evolution Strategy (ES) to find the proper control gains for selected frequencies, which have major effects on the vibrations of the vehicle's state variables. The frequencies and proper control gains are used for the neural network data. During a vehicle running, the trained on-line neural network is activated and provides the proper gains for non-trained frequencies. For the full-state feedback control, Kalman filter observes the full states and Fourier transform is used to detect the frequency of the road.

  • PDF

Evaluation of Surrogate Models for Shape Optimization of Compressor Blades

  • Samad, Abdus;Kim, Kwang-Yong
    • 유체기계공업학회:학술대회논문집
    • /
    • 2006.08a
    • /
    • pp.367-370
    • /
    • 2006
  • Performances of multiple surrogate models are evaluated in a turbomachinery blade shape optimization. The basic models, i.e., Response Surface Approximation, Kriging and Radial Basis Neural Network models as well as weighted average models are tested for shape optimization. Global data based errors for each surrogates are used to calculate the weights. These weights are multiplied with the respective surrogates to get the final weighted average models. The design points are selected using three level fractional factorial D-optimal designs. The present approach can help address the multi-objective design on a rational basis with quantifiable cost-benefit analysis.

  • PDF

Fine-Grain Weighted Logistic Regression Model (가중치 세분화 기반의 로지스틱 회귀분석 모델)

  • Lee, Chang-Hwan
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.53 no.9
    • /
    • pp.77-81
    • /
    • 2016
  • Logistic regression (LR) has been widely used for predicting the relationships among variables in various fields. We propose a new logistic regression model with a fine-grained weighting method, called value weighted logistic regression, by assigning different weights to each feature value. A gradient approach is utilized to obtain the optimal weights of feature values. We conduct experiments on several data sets and the experimental results show that the proposed method shows meaningful improvement in prediction accuracy.