• Title/Summary/Keyword: proportion data

Search Result 2,606, Processing Time 0.023 seconds

Comparison of methods for the proportion of true null hypotheses in microarray studies

  • Kang, Joonsung
    • Communications for Statistical Applications and Methods
    • /
    • v.27 no.1
    • /
    • pp.141-148
    • /
    • 2020
  • We consider estimating the proportion of true null hypotheses in multiple testing problems. A traditional multiple testing rate, family-wise error rate is too conservative and old to control type I error in multiple testing setups; however, false discovery rate (FDR) has received significant attention in many research areas such as GWAS data, FMRI data, and signal processing. Identify differentially expressed genes in microarray studies involves estimating the proportion of true null hypotheses in FDR procedures. However, we need to account for unknown dependence structures among genes in microarray data in order to estimate the proportion of true null hypothesis since the genuine dependence structure of microarray data is unknown. We compare various procedures in simulation data and real microarray data. We consider a hidden Markov model for simulated data with dependency. Cai procedure (2007) and a sliding linear model procedure (2011) have a relatively smaller bias and standard errors, being more proper for estimating the proportion of true null hypotheses in simulated data under various setups. Real data analysis shows that 5 estimation procedures among 9 procedures have almost similar values of the estimated proportion of true null hypotheses in microarray data.

A study on the proportion of plans and elevations in traditional architecture - Focused on the period of Joseon - (전통 건축에 있어서 평면 및 입면의 비례 연구 - 조선시대 중심으로 -)

  • Seo, Hyun-Su;Kong, Sung-Hoon;An, Chang-Hwan
    • Proceeding of Spring/Autumn Annual Conference of KHA
    • /
    • 2009.04a
    • /
    • pp.91-94
    • /
    • 2009
  • The purpose of this study is to research the proportion of plans and elevations in traditional architecture for period of Joseon. The collected data of traditional buildings are analyzed for the basic design methods; horizontal factors and vertical factors for floor plans and front elevations. The results of the analysis on this study are as follows. Proportion data of floor plans in traditional architecture is from 1 : 3.8 to 1 : 1.21. Proportion data of front elevations in traditional architecture is from 1 : 8.1 to 1 : 1.03. Average proportion of floor plans in traditional architecture is 1.87. Average proportion of front elevations in traditional architecture is 2.59.

  • PDF

Variance estimation for distribution rate in stratified cluster sampling with missing values

  • Heo, Sunyeong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.2
    • /
    • pp.443-449
    • /
    • 2017
  • Estimation of population proportion like the distribution rate of LED TV and the prevalence of a disease are often estimated based on survey sample data. Population proportion is generally considered as a special form of population mean. In complex sampling like stratified multistage sampling with unequal probability sampling, the denominator of mean may be random variable and it is estimated like ratio estimator. In this research, we examined the estimation of distribution rate based on stratified multistage sampling, and determined some numerical outcomes using stratified random sample data with about 25% of missing observations. In the data used for this research, the survey weight was determined by deterministic way. So, the weights are not random variable, and the population distribution rate and its variance estimator can be estimated like population mean estimation. When the weights are not random variable, if one estimates the variance of proportion estimator using ratio method, then the variances may be inflated. Therefore, in estimating variance for population proportion, we need to examine the structure of data and survey design before making any decision for estimation methods.

Small Domain Estimation of the Proportion Using Survey Weights

  • Kim, Dal-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.4
    • /
    • pp.1179-1189
    • /
    • 2007
  • In this paper, we estimate the proportion of individuals having health insurance in a given year for several small domains cross-classified by age, sex and other demographic characteristics using the data provided by the National Center for Health Statistics(NCHS). We employ Bayesian as well as frequentist methodology to obtain small domain estimates and the associated measures of precision. One of the new features of our study is that we utilize the survey weights along with the model to derive the small domain estimates.

  • PDF

A Combined Procedure of Direct Question Method and Modified Randomized Response Technique for Estimating Population Proportion

  • Kim, Hyuk-Joo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.4
    • /
    • pp.877-887
    • /
    • 2003
  • A two-stage procedure is proposed to estimate the population proportion of a sensitive group. The proposed procedure is obtained by combining the direct question method and a modified randomized response technique. It is verified that the proposed procedure is more efficient than existing methods under some mild conditions.

  • PDF

A Study of Reliability of Lecture Evaluation by Students

  • Kim, Jong-Tae;Lee, Jae-Man
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.1
    • /
    • pp.183-191
    • /
    • 2004
  • This paper shows that there are some extra-factors on the evaluation of lecture by students. The extra-factors are sex, day and nighttime, academic year, size of lecture, and grades. And this paper analysis the proportion of the student which put the same mark on all items(same marked proportion).

  • PDF

Development of the Proportion Design Program for 40$\sim$60MPa High Strength Concrete (40$\sim$60MPa급 고강도 콘크리트 배합설계 프로그램 개발)

  • Yoo, Seung-Yeup;Choi, Dong-Ho;Lee, Sang-Rae;Koo, Ja-Sul;Kang, Suck-Hwa
    • Proceedings of the Korea Concrete Institute Conference
    • /
    • 2008.04a
    • /
    • pp.401-404
    • /
    • 2008
  • This study exploited the design of mixture proportion for the high strength concrete to establish the method of the quality control and high strength ready-mixed concrete for the application to the construction filed systematically how to output the estimated formula which could forecast mixture proportion for the high strength concrete classed 40${\sim}$60MPa through a experiment. It might contribute for systematic establishment of the method of the quality control and high strength ready-mixed concrete because it was possessed of the function of common data though a server, preservation and output of data, and estimation for the design of mixture proportion for the high strength concrete due to the experimental result, and Visual Basic, MS-SQL were used. Simply, it was produced corresponding to the condition of a laboratory, so it could be fundamental data for the design of mixture proportion for the high strength concrete. If upgrade is enforced with mixture proportion data of the each factory after then, it may contribute to the stability on quality and manufacture of high strength ready-mixed concrete to agree with the properties of each factory.

  • PDF

An exploratory study of factors related to long-term hospitalization of inpatients using the quality assessment data for long-term care hospitals (요양병원 입원급여 적정성 평가 결과를 활용한 요양병원 입원환자의 장기입원 관련 요인 탐색 연구)

  • Ji-Yoon Lee;Eun-Woo Nam;Hyoung-Sun Jeong;Min-Hee Heo;Jin-Won Noh
    • Korea Journal of Hospital Management
    • /
    • v.28 no.3
    • /
    • pp.58-67
    • /
    • 2023
  • Purpose: The purpose of this study was to analyze the factors associated with long-term hospitalized patients in long-term care hospitals using the quality assessment data for long-term care hospitals by the Health Insurance Review. Methods: Among 1,376 long-term care hospitals, frequency analysis and descriptive statistics were used to analyze the characteristics of these hospitals. Multiple linear regression was conducted to examine the associations between infrastructure characteristics, medical personnel characteristics, health outcomes and the proportion of long-term hospitalized patients. Results: The research findings indicate that the number of patients per doctor, the number of patients per nurse, and the number of patients per nursing staff were positively associated with the proportion of long-term hospitalized patients. Among health outcomes, a higher proportion of patients with more than a 5% weight loss compared to the previous month and the proportion of patients showing improvement in ADL, were more likely to have a lower proportion of long-term hospitalized patients. However the proportion of diabetic patients with HbA1c test results within the appropriate range was positively associated with the proportion of long-term hospitalized patients. Conclusion: The present study results provide fundamental data for the establishment of policies for long-term care hospitals. Based on this study, it is important to suggest screening methods for unnecessary long-term hospitalizations, such as sufficient medical personnel to improve the quality of care in long-term care hospitals. It is also necessary to clearly separate the roles of medical institutions and long-term care facilities and implement policies to support patients' social reintegration.

  • PDF

A Study for Efficient EM Algorithms for Estimation of the Proportion of a Mixed Distribution (분포 혼합비율의 모수추정을 위한 효율적인 알고리즘에 관한 연구)

  • 황강진;박경탁;유희경
    • Journal of Korean Society for Quality Management
    • /
    • v.30 no.4
    • /
    • pp.68-77
    • /
    • 2002
  • EM algorithm has good convergence rate for numerical procedures which converges on very small step. In the case of proportion estimation in a mixed distribution which has very big incomplete data or of update of new data continuously, however, EM algorithm highly depends on a initial value with slow convergence ratio. There have been many studies to improve the convergence rate of EM algorithm in estimating the proportion parameter of a mixed data. Among them, dynamic EM algorithm by Hurray Jorgensen and Titterington algorithm by D. M. Titterington are proven to have better convergence rate than the standard EM algorithm, when a new data is continuously updated. In this paper we suggest dynamic EM algorithm and Titterington algorithm for the estimation of a mixed Poisson distribution and compare them in terms of convergence rate by using a simulation method.

Bayesian estimation for finite population proportion under selection bias via surrogate samples

  • Choi, Seong Mi;Kim, Dal Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.6
    • /
    • pp.1543-1550
    • /
    • 2013
  • In this paper, we study Bayesian estimation for the finite population proportion in binary data under selection bias. We use a Bayesian nonignorable selection model to accommodate the selection mechanism. We compare four possible estimators of the finite population proportions based on data analysis as well as Monte Carlo simulation. It turns out that nonignorable selection model might be useful for weekly biased samples.