• Title/Summary/Keyword: Binomial Population

Search Result 52, Processing Time 0.025 seconds

A simulation study for the approximate confidence intervals of hypergeometric parameter by using actual coverage probability (실제포함확률을 이용한 초기하분포 모수의 근사신뢰구간 추정에 관한 모의실험 연구)

  • Kim, Dae-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.6
    • /
    • pp.1175-1182
    • /
    • 2011
  • In this paper, properties of exact confidence interval and some approximate confidence intervals of hyper-geometric parameter, that is the probability of success p in the population is discussed. Usually, binomial distribution is a well known discrete distribution with abundant usage. Hypergeometric distribution frequently replaces a binomial distribution when it is desirable to make allowance for the finiteness of the population size. For example, an application of the hypergeometric distribution arises in describing a probability model for the number of children attacked by an infectious disease, when a fixed number of them are exposed to it. Exact confidence interval estimation of hypergeometric parameter is reviewed. We consider the approximation of hypergeometirc distribution to the binomial and normal distribution respectively. Approximate confidence intervals based on these approximation are also adequately discussed. The performance of exact confidence interval estimates and approximate confidence intervals of hypergeometric parameter is compared in terms of actual coverage probability by small sample Monte Carlo simulation.

A Study on Factors Influencing Floating Population using Mobile Phone Data in Urban Area (이동통신 자료를 활용한 대도시 유동인구 영향요인 분석)

  • Kwak, Ho-Chan;Song, Ji Young;Eom, Jin Ki;Kim, Kyoung Tae
    • Journal of The Korean Society For Urban Railway
    • /
    • v.6 no.4
    • /
    • pp.373-381
    • /
    • 2018
  • The floating population that is index to figure out dynamic activities in urban area will be important in urban railway planning, but it is not useful because it is collected by posterior method. This study aims to investigate factors influencing floating population. The floating population data that was collected in Seoul for a month in December 2013 is used as dependent variable, and the negative binomial regression analysis is used in modelling. The number of households, number of employees, number of subway stations, and number of bus lines variables are statistically significant in predicting floating population.

On the actual coverage probability of hypergeometric parameter (초기하분포의 모수에 대한 신뢰구간추정)

  • Kim, Dae-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.6
    • /
    • pp.1109-1115
    • /
    • 2010
  • In this paper, exact confidence interval of hyper-geometric parameter, that is the probability of success p in the population is discussed. Usually, binomial distribution is a well known discrete distribution with abundant usage. Hypergeometric distribution frequently replaces a binomial distribution when it is desirable to make allowance for the finiteness of the population size. For example, an application of the hypergeometric distribution arises in describing a probability model for the number of children attacked by an infectious disease, when a fixed number of them are exposed to it. Exact confidence interval estimation of hypergeometric parameter is reviewed. We consider the performance of exact confidence interval estimates of hypergeometric parameter in terms of actual coverage probability by small sample Monte Carlo simulation.

Confidence Intervals for a tow Binomial Proportion (낮은 이항 비율에 대한 신뢰구간)

  • Ryu Jae-Bok;Lee Seung-Joo
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.2
    • /
    • pp.217-230
    • /
    • 2006
  • e discuss proper confidence intervals for interval estimation of a low binomial proportion. A large sample surveys are practically executed to find rates of rare diseases, specified industrial disaster, and parasitic infection. Under the conditions of 0 < p ${\leq}$ 0.1 and large n, we compared 6 confidence intervals with mean coverage probability, root mean square error and mean expected widths to search a good one for interval estimation of population proportion p. As a result of comparisons, Mid-p confidence interval is best and AC, score and Jeffreys confidence intervals are next.

Binomial Sampling Plans for the Citrus Red Mite, Panonychus citri(Acari: Tetranychidae) on Satsuma Mandarin Groves in Jeju (온주밀감에서 귤응애의 이항표본조사법 개발)

  • 송정흡;이창훈;강상훈;김동환;강시용;류기중
    • Korean journal of applied entomology
    • /
    • v.40 no.3
    • /
    • pp.197-202
    • /
    • 2001
  • The density of citrus red mite(CRM), Panonychus citri(McGregor), on the commercial satsuma mandarin Citrus unshiu L. groves were determined by counts of the number of CRM per leaf using by leaf sample in Jeju for 2 years. Binomial sampling plans were developed based on the relationship between the mean density per leaf(m) and the proportion of leaf infested with less than T mites per leaf($P_{T}$), according to the empirical model $ln(m)={\alpha}+{\beta}ln(-ln(1-P_{T}))$. T was defined as tally threshold, and set to 1, 3, 5 and 7 mites per leaf in this study. Increasing sample size, regardless of tally threshold, had little effects on the precision of the binomial sampling plan. Increasing sampling size had little effect on the precision of the estimated mean regardless of tally thresholds. T=1 was chosen as the best tally threshold for estimating densities of CRM based on the precision of the model. The binomial model with T=1 provided reliable predictions of mean densities of CRM observed on the commercial satsuma mandarin groves. Binomial sequential sampling procedure were developed for classifying the density of CRM. A binomial sampling program for decision-making CRM population level based on action threshold of 2 mites per leaf was obtained.

  • PDF

The Data-based Prediction of Police Calls Using Machine Learning (기계학습을 활용한 데이터 기반 경찰신고건수 예측)

  • Choi, Jaehun
    • The Journal of Bigdata
    • /
    • v.3 no.2
    • /
    • pp.101-112
    • /
    • 2018
  • The purpose of the study is to predict the number of police calls using neural network which is one of the machine learning and negative binomial regression, by using the data of 112 police calls received from Chungnam Provincial Police Agency from June 2016 to May 2017. The variables which may affect the police calls have been selected for developing the prediction model : time, holiday, the day before holiday, season, temperature, precipitation, wind speed, jurisdictional area, population, the number of foreigners, single house rate and other house rate. Some variables show positive correlation, and others negative one. The comparison of the methods can be summarized as follows. Neural network has correlation coefficient of 0.7702 between predicted and actual values with RMSE 2.557. Negative binomial regression on the other hand shows correlation coefficient of 0.7158 with RMSE 2.831. Neural network has low interpretability, but an excellent predictability compared with the negative binomial regression. Based on the prediction model, the police agency can do the optimal manpower allocation for given values in the selected variables.

Robust Bayesian Inference in Finite Population Sampling under Balanced Loss Function

  • Kim, Eunyoung;Kim, Dal Ho
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.3
    • /
    • pp.261-274
    • /
    • 2014
  • In this paper we develop Bayes and empirical Bayes estimators of the finite population mean with the assumption of posterior linearity rather than normality of the superpopulation under the balanced loss function. We compare the performance of the optimal Bayes estimator with ones of the classical sample mean and the usual Bayes estimator under the squared error loss with respect to the posterior expected losses, risks and Bayes risks when the underlying distribution is normal as well as when they are binomial and Poisson.

An Acceptance Sampling Plan for Products from Production Process with Variable Fraction Defective (불량률이 가변적인 공정으로부터 생산된 제품에 대한 수명시험 샘플링 검사방식 설계)

  • 권영일
    • Journal of Korean Society for Quality Management
    • /
    • v.30 no.2
    • /
    • pp.152-159
    • /
    • 2002
  • An acceptance sampling plan for products manufactured from a production process with variable fraction defective is developed. We consider a situation where defective products have short lifetimes and non-defective ones never fail during the technological life of the products. An acceptance criterion which guarantee the out going quality of accepted products is derived using the prior information on the quality of products. Numerical examples are provided.

A Study on the Influence of the Space Syntax and the Urban Characteristics on the Incidence of Crime Using Negative Binomial Regression (음이항 회귀모형을 이용한 공간구문론 및 도시특성요소가 범죄발생에 미치는 영향 연구)

  • Kim, Hyeong Jun;Choi, Yeol
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.36 no.2
    • /
    • pp.333-340
    • /
    • 2016
  • The aim of this study is to specifically understand the characteristics of the crime by empirical analysis for the determining factors that affect determining the crime through the space syntax in Busan. In this study, poisson regression and negative binomial regression were used for accurate analysis. 8 variables that were significant of the total 13 variables. The summary if this study based on the results is as follow. Statistically significant variables are female ratio, over 65 population ratio, administration are and commercial area ratio in characteristics. And the more CCTVs a region has, the lower crime rate it shows. As a results of examing whether space syntax variables can predict crime occurrence places. Space with low connectivity come to be a crime causal factor because they have few other related spaces and thereby have low possibility of sudden appearance of interrupters, which results in low surveillance levels of foot passengers. It will provide the basic data that can contribute to urban planning and implementation of crime prevention aspects.

The Design and Implementation to Teach Sampling Distributions with the Statistical Inferences (통계적 추론에서의 표집분포 개념 지도를 위한 시뮬레이션 소프트웨어 설계 및 구현)

  • Lee, Young-Ha;Lee, Eun-Ho
    • School Mathematics
    • /
    • v.12 no.3
    • /
    • pp.273-299
    • /
    • 2010
  • The purpose of the study is designing and implementing 'Sampling Distributions Simulation' to help students to understand concepts of sampling distributions. This computer simulation is developed to help students understand sampling distributions more easily. 'Sampling Distributions Simulation' consists of 4 sessions. 'The first session - Confidence level and confidence intervals - includes checking if the intended confidence level is actually achieved by the real relative frequency for the obtained sample confidence intervals containing population mean. This will give the students clearer idea about confidence level and confidence intervals in addition to the role of sampling distribution of the sample means among those. 'The second session - Sampling Distributions - helps understand sampling distribution of the sample means, through the simulation method to make comparison between the histogram of sampling distributions and that of the population. The third session - The Central Limit Theorem - includes calculating the means of the samples taken from a population which follows a uniform distribution or follows a Bernoulli distribution and then making the histograms of those means. This will provides comprehension of the central limit theorem, which mentions about the sampling distribution of the sample means when the sample size is very large. The forth session - the normal approximation to the binomial distribution - helps understand the normal approximation to the binomial distribution as an alternative version of central limit theorem. With the practical usage of the shareware 'Sampling Distributions Simulation', we expect students to have a new vision on the sampling distribution and to get more emphasis on it. With the sound understandings on the sampling distributions, more accurate and profound statistical inferences are expected. And the role of the sampling distribution in the inferences should be more deeply appreciated.

  • PDF