• Title/Summary/Keyword: subsample

Search Result 74, Processing Time 0.019 seconds

THE BEST CHOICE OF SUBSAMPLE SIZE (m,k) IN 3 STAGE SAMPLING (3단계 표본 추출에 있어서 부차표본(m,k)의 최상 선택)

  • 정훈조
    • Journal of applied mathematics & informatics
    • /
    • v.3 no.1
    • /
    • pp.101-115
    • /
    • 1996
  • In this paper we extend the best choice of subsample size m in the 2-stage sampling which suggested by Mohammad(1986) to the 3-stage sampling in cases of known and of unknown cost and variance ratio. We find the subsample size m.k which ensures more than the relative efficiency 90% Also we see that the choice of 3-stage subsample size depends on the design parameters using in 2-stage sampling.

Effective sampling of estuarine fauna by a passive net in theWest Sea of Korea occurring strong tide (조류가 강한 서해 하구에서 수동어구를 이용한 하구역 유영생물의 효율적 채집)

  • Hwang, Sun-Do;Im, Yang-Jae
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • v.47 no.4
    • /
    • pp.338-343
    • /
    • 2011
  • To obtain the effective sampling estuarine fauna by a passive net in the West Sea of Korea occurring strong tide, catch were collected by bag nets with various sampling trials off Ganghwa Island in November 2009. We compared the difference of community structures (on spring tide vs. neap tide, total sample vs. subsample and 4 nets vs. 1 net) with each species composition as a sampling unit by the Pearson chi-square test. Number of individual at the spring tide was more abundant than that at the neap tide (p<0.0001) although number of species at the spring tide was not significantly different with that at the neap tide (p=0.174). Both number of species (p=0.138) and number of individual (p=0.096) were not significantly different between total sample and random subsample. Number of species was not significantly different between the subsample by 1 net and the subsample by 4 nets (p=0.515), but number of individual was a little different on both samples (p=0.024). In conclusion, we suggest the subsample by 1 net at spring tide as the effective sampling estuarine fauna by a passive net in the West Sea occurring strong tide.

The Three-Stage Cluster Unrelated Question Model

  • Ahn, Seung-Chul;Lee, Gi-Sung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.1
    • /
    • pp.55-65
    • /
    • 2003
  • In this study, we systemize the theoretical validity for applying unrelated question model to three-stage cluster sampling method and derive the estimate and it's variance of sensitive parameter. We derive the minimum variance form under the optimal values of the subsample sizes when the cost are fixed. Under the some given precision, we obtain the optimal values of the subsample sizes and derive the minimum cost form by using them.

  • PDF

Unmasking Multiple Outliers in Multivariate Data

  • Yoo Jong-Young
    • Communications for Statistical Applications and Methods
    • /
    • v.13 no.1
    • /
    • pp.29-38
    • /
    • 2006
  • We proposed a procedure for detecting of multiple outliers in multivariate data. Rousseeuw and van Zomeren (1990) have suggested the robust distance $RD_i$ by using the Resampling Algorithm. But $RD_i$ are based on the assumption that X is in the general position.(X is said to be in the general position when every subsample of size p+1 has rank p) From the practical points of view, this is clearly unrealistic. In this paper, we proposed a computing method for approximating MVE, which is not subject to these problems. The procedure is easy to compute, and works well even if subsample is singular or nearly singular matrix.

The Three-Stage Cluster Randomized Response Model for Obtaining Sensitive Information

  • Lee, Gi Sung;Hong, Ki Hak;Son, Chang Kyoon;Jung, Young Mee
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.1
    • /
    • pp.247-256
    • /
    • 2003
  • In this study, we systemize the theoretical validity for applying RRM to three-stage cluster sampling method and derive the estimate and it's variance of sensitive parameter. We derive the minimum variance form under the optimal values of the subsample sizes when the costs are fixed. Under the some given precision, we obtain the optimal values of the subsample sizes and derive the minimum cost form by using them. We apply the three-stage cluster RRM to field survey and suggest some necessary points for practical use.

An analysis of risk factors for falls in the elderly by gender (노인의 성별 낙상관련 요인)

  • Kim, Jong-Min;Lee, Myung-Sun;Song, Hyun-Jong
    • Korean Journal of Health Education and Promotion
    • /
    • v.25 no.2
    • /
    • pp.1-18
    • /
    • 2008
  • Objectives: The objective of this study is to identify the gender differences of risk factors for falls among the elderly in community dwellings. Methods: We analyzed the data on 3,278(male 1,255, female 2,023) persons, including 497 persons who have experienced falls, drawn from the 2004 National Elderly Survey. We conducted a cross-tabulation analysis, $X^2$-test and hierarchical regression analysis of the impact of the socio-economical characteristics, environmental characteristics, the number of chronic diseases, usage of supplementary devices, activities of daily living, dementia, and the severity of problem behavior. Results: For the entire sample of the elderly, gender, age, the size of the cities of residence, the number of chronic diseases, and the severity of problem behavior were identified as risk factors for falls. The number of chronic diseases and the severity of problem behavior were found to be significant for the male subsample, while age, the size of cities of residence, dwelling types, and the number of chronic diseases were found to be significant for the female subsample. Conclusion: The number of chronic diseases was identified as a common risk factor for falls in the male and female elderly. Chronic diseases were also found to aggravate the risk for falls when they concur with other diseases.

Prediction Model for Gastric Cancer via Class Balancing Techniques

  • Danish, Jamil ;Sellappan, Palaniappan;Sanjoy Kumar, Debnath;Muhammad, Naseem;Susama, Bagchi ;Asiah, Lokman
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.1
    • /
    • pp.53-63
    • /
    • 2023
  • Many researchers are trying hard to minimize the incidence of cancers, mainly Gastric Cancer (GC). For GC, the five-year survival rate is generally 5-25%, but for Early Gastric Cancer (EGC), it is almost 90%. Predicting the onset of stomach cancer based on risk factors will allow for an early diagnosis and more effective treatment. Although there are several models for predicting stomach cancer, most of these models are based on unbalanced datasets, which favours the majority class. However, it is imperative to correctly identify cancer patients who are in the minority class. This research aims to apply three class-balancing approaches to the NHS dataset before developing supervised learning strategies: Oversampling (Synthetic Minority Oversampling Technique or SMOTE), Undersampling (SpreadSubsample), and Hybrid System (SMOTE + SpreadSubsample). This study uses Naive Bayes, Bayesian Network, Random Forest, and Decision Tree (C4.5) methods. We measured these classifiers' efficacy using their Receiver Operating Characteristics (ROC) curves, sensitivity, and specificity. The validation data was used to test several ways of balancing the classifiers. The final prediction model was built on the one that did the best overall.

Do the Price Limits in KOSDAQ Market change on the Volatility? (코스닥시장의 가격제한폭 확대는 변동성을 증가시키는가?)

  • Park, Jong-Hae;Jung, Dae-Sung
    • Management & Information Systems Review
    • /
    • v.33 no.2
    • /
    • pp.119-133
    • /
    • 2014
  • This Research focuses on the effect of the price limits change in KOSDAQ market change on the volatility. The sample period ranges from 22 May 2000 to 24 March 2010 for daily data. We construct two subsample periods for comparing with the effect of the change of the price limit. These limits were relaxed from 12% to 15% on March 25, 2005. The first subsample period is from 25 March 2000 to 24 March 2005. The second subsample period is from 25 March 2005. to 24 March 2010. We employee four different volatility, which are the range-based volatility of Parkinson(1980; PK), Garman and Klass(1980; GK) Rogers and Satchell(1991; RS), Yang and Zhang(2008; YZ). The empirical result as follows. The major findings are summarized as follows; First, the volatility of individual stocks in KOSDAQ market reduces significantly after the price limit change. Second, There is so high volatile especially when the volatility of stock prices is high. Third, There is no meaningful relationship between volatility and market capitalization. Fourth, the more volume stocks reduce the volatility. Our results show the volatility decreased the more large volume, the more trading amount and the high price stock.

  • PDF

Bootstrap control limits of process control charts for correlative process data

  • Suzuki Hideo
    • Proceedings of the Korean Society for Quality Management Conference
    • /
    • 1998.11a
    • /
    • pp.174-179
    • /
    • 1998
  • This research explores the application of the bootstrap methods to the construction of control limits for the x charts and the EWMA charts based on single observations with stationary autoregressive processes. The subsample means-based control chars in the presence autocorrelation are also considered. We use a technique for inferring confidence intervals using bootstrap, the percentile method. Simulation studies are conducted to compare the performance of the bootstrap method and that of standard method for constructing control charts under several conditions.

  • PDF

Information Content of Commercial Paper Credit Rating Changes In Korea (우리나라 기업어음등급평가의 정보효과 검증)

  • 박범호;최호석
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2000.10a
    • /
    • pp.89-92
    • /
    • 2000
  • The purpose of this thesis is to investigate the information content of commercial paper credit rating changes of Korean firms. The result shows neither sinificant daily abnormal returns nor significant cumulative daily abnormal returns over the test window. This ind icates that commercial paper rating changes are not informative to investors. A sensitivity analysis conducted for the portfolio of subsample shows a similar result. This thesis, however, may contribute to the better operation of Korean financial market by providing several directions to establish credit-based financial transactions.

  • PDF