• Title/Summary/Keyword: Sampling studies

Search Result 1,236, Processing Time 0.029 seconds

A Stratified Unknown Repeated Trials in Randomized Response Sampling

  • Singh, Housila P.;Tarray, Tanveer Ahmad
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.6
    • /
    • pp.751-759
    • /
    • 2012
  • This paper proposes an alternative stratified randomized response model based on the model of Singh and Joarder (1997). It is shown numerically that the proposed stratified randomized response model is more efficient than Hong et al. (1994) (under proportional allocation) and Kim and Warde (2004) (under optimum allocation).

A Comparison of Ensemble Methods Combining Resampling Techniques for Class Imbalanced Data (데이터 전처리와 앙상블 기법을 통한 불균형 데이터의 분류모형 비교 연구)

  • Leea, Hee-Jae;Lee, Sungim
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.3
    • /
    • pp.357-371
    • /
    • 2014
  • There are many studies related to imbalanced data in which the class distribution is highly skewed. To address the problem of imbalanced data, previous studies deal with resampling techniques which correct the skewness of the class distribution in each sampled subset by using under-sampling, over-sampling or hybrid-sampling such as SMOTE. Ensemble methods have also alleviated the problem of class imbalanced data. In this paper, we compare around a dozen algorithms that combine the ensemble methods and resampling techniques based on simulated data sets generated by the Backbone model, which can handle the imbalance rate. The results on various real imbalanced data sets are also presented to compare the effectiveness of algorithms. As a result, we highly recommend the resampling technique combining ensemble methods for imbalanced data in which the proportion of the minority class is less than 10%. We also find that each ensemble method has a well-matched sampling technique. The algorithms which combine bagging or random forest ensembles with random undersampling tend to perform well; however, the boosting ensemble appears to perform better with over-sampling. All ensemble methods combined with SMOTE outperform in most situations.

An Economic Design of Rectifying Inspection Plans Based on a Correlated Variable (대용품질특성치를 이용한 계수선별형 샘플링 검사방식의 경제적 설계)

  • Bai, D.S.;Lee, K.T.;Choi, I.S.
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.23 no.4
    • /
    • pp.793-802
    • /
    • 1997
  • A sampling plan is presented for situations where sampling inspection is based on the quality characteristic of interest and items in rejected lots are screened based on a correlated variable. A cost model is constructed which involves the costs of misclassification errors, sampling and screening inspections. A method of finding optimal values of sample size, acceptance number and cutoff value on the correlated variable is presented, and numerical studies are given.

  • PDF

Canonical Sampling Method for Initial Conditions for Reactive Flux Calculations Using Nose-Hoover Chains

  • Lee, Song-Hi;Pak, Young-Shang
    • Bulletin of the Korean Chemical Society
    • /
    • v.25 no.4
    • /
    • pp.533-538
    • /
    • 2004
  • Canonical sampling method has been presented to generate the initial conditions for reactive flux studies of organic reactions in water. Velocity Verlet version of Nose-Hoover chain dynamics algorithm has been employed to sample the initial conditions according to canonical distribution. The unstable normal mode of a transition state has been introduced to define a dividing plane separating reactant and product regions in reaction processes. This method has been implemented and tested for the case iels-Alder reaction of methyl vinyl ketone (MVK) and cyclopentadiene (CPD) in water, providing a reliable tool for further reactive flux molecular dynamics studies in condensed media.

Generalized Ratio-Cum-Product Type Estimator of Finite Population Mean in Double Sampling for Stratification

  • Tailor, Rajesh;Lone, Hilal A.;Pandey, Rajiv
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.3
    • /
    • pp.255-264
    • /
    • 2015
  • This paper addressed the problem of estimation of finite population mean in double sampling for stratification. This paper proposed a generalized ratio-cum-product type estimator of population mean. The bias and mean square error of the proposed estimator has been obtained upto the first degree of approximation. A particular member of the proposed generalized estimator was identified and studied from a comparison point of view. It is observed that the identified particular estimator is more efficient than usual unbiased estimator and Ige and Tripathi (1987) estimators. An empirical study was conducted in support of the theoretical findings.

A review of analysis methods for secondary outcomes in case-control studies

  • Schifano, Elizabeth D.
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.2
    • /
    • pp.103-129
    • /
    • 2019
  • The main goal of a case-control study is to learn the association between various risk factors and a primary outcome (e.g., disease status). Particularly recently, it is also quite common to perform secondary analyses of the case-control data in order to understand certain associations between the risk factors of the primary outcome. It has been repeatedly documented with case-control data, association studies of the risk factors that ignore the case-control sampling scheme can produce highly biased estimates of the population effects. In this article, we review the issues of the naive secondary analyses that do not account for the biased sampling scheme, and also the various methods that have been proposed to account for the case-control ascertainment. We additionally compare the results of many of the discussed methods in an example examining the association of a particular genetic variant with smoking behavior, where the data were obtained from a lung cancer case-control study.

SIMPLE RANKED SAMPLING SCHEME: MODIFICATION AND APPLICATION IN THE THEORY OF ESTIMATION OF ERLANG DISTRIBUTION

  • RAFIA GULZAR;IRSA SAJJAD;M. YOUNUS BHAT;SHAKEEL UL REHMAN
    • Journal of applied mathematics & informatics
    • /
    • v.41 no.2
    • /
    • pp.449-468
    • /
    • 2023
  • This paper deals in the study of the estimation of the parameters of Erlang distribution based on rank set sampling and some of its modifications. Here we considered Maximum Likelihood (ML) and the Bayesian technique to estimate the shape and scale parameter of Erlang distribution based on RSS and its some modifications such as ERSS, MRSS, and MRSSu. The derivation for unknown parameters of Erlang distribution is well presented using normal approximation to the asymptotic distribution of ML estimators. But due to the complexity involves in the integral, the Bayes estimator of unknown parameters is obtained using MCMC method. Further, we compared the MSE of estimation in different sampling schemes with different set sizes and cycle size. A real-life data application is also given to illustrate the efficiency of the proposed scheme.

Ratio and Product Type Exponential Estimators of Population Mean in Double Sampling for Stratification

  • Tailor, Rajesh;Chouhan, Sunil;Kim, Jong-Min
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.1
    • /
    • pp.1-9
    • /
    • 2014
  • This paper discusses the problem of estimation of finite population mean in double sampling for stratification. In fact, ratio and product type exponential estimators of population mean are proposed in double sampling for stratification. The biases and mean squared errors of proposed estimators are obtained upto the first degree of approximation. The proposed estimators have been compared with usual unbiased estimator, ratio and product estimators in double sampling for stratification. To judge the performance of the proposed estimators an empirical study has been carried out.

Policies for Improving the Survey of Research and Development in Science and Technology: The Case of Industrial Sector (과학기술연구개발활동조사의 개선방안 -기업부문을 중심으로-)

  • 유승훈;문혜선
    • Journal of Korea Technology Innovation Society
    • /
    • v.5 no.2
    • /
    • pp.228-244
    • /
    • 2002
  • The survey of research and development (R&D) in science and technology (S&T) covers the current status of R&D activities in S&T in Korea, and provides a basis for decision making regarding S&T policy. Continuous improvement of the survey is widely needed to present reliable national basic statistics. Therefore, the purpose of the study is two-fold: to introduce sampling survey method in industrial sector and to make statistical technique to deal with non-response data from industrial sector. To these ends, first, case studies of the United States and Japan are illustrated. A new sampling design for the R&D survey is proposed and implementing stratified random sampling scheme is suggested. Moreover, statistical analysis of the non-response data is dealt with. Based on several screening criteria, we develop a new imputation method suitable for the R&D survey and also provide more detailed implementation plan. Various solutions to a problem arising from non-response item are also presented. Finally, some implications of the results are discussed.

  • PDF

Self-Collection Tools for Routine Cervical Cancer Screening: A Review

  • Othman, Nor Hayati;Zaki, Fatma Hariati Mohamad
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.20
    • /
    • pp.8563-8569
    • /
    • 2014
  • Sub-optimal participation is a major problem with cervical cancer screening in developing countries which have no organized national screening program. There are various notable factors such as 'embarrassment', 'discomfort' and 'no time' cited by women as they are often also the bread winners for the family. Implementation of self-sampling methods may increase their participation. The aim of this article was to provide a survey of various types of self-sampling tools which are commonly used in collection of cervical cells. We reviewed currently available self-sampling devices and collated the advantages and disadvantages of each in terms of its acceptance and its accuracy in giving desired results. In general, regardless of which device is used, self-sampling for cervical scrapings is highly acceptable to women in most of the studies cited.