• Title/Summary/Keyword: 이항분포

Search Result 140, Processing Time 0.025 seconds

Randomizing Sequences of Finite Length (유한 순서열의 임의화)

  • Huh, Myung-Hoe;Lee, Yong-Goo
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.1
    • /
    • pp.189-196
    • /
    • 2010
  • It is never an easy task to physically randomize the sequence of cards. For instance, US 1970 draft lottery resulted in a social turmoil since the outcome sequence of 366 birthday numbers showed a significant relationship with the input order (Wikipedia, "Draft Lottery 1969", Retrieved 2009/05/01). We are motivated by Laplace's 1825 book titled Philosophical Essay on Probabilities that says "Suppose that the numbers 1, 2, ..., 100 are placed, according to their natural ordering, in an urn, and suppose further that, after having shaken the urn, to shuffle the numbers, one draws one number. It is clear that if the shuffling has been properly done, each number will have the same chance of being drawn. But if we fear that there are small differences between them depending on the order in which the numbers were put into the urn, we can decrease these differences considerably by placing these numbers in a second urn in the order in which they are drawn from the first urn, and then shaking the second urn to shuffle the numbers. These differences, already imperceptible in the second urn, would be diminished more and more by using a third urn, a fourth urn, &c." (translated by Andrew 1. Dale, 1995, Springer. pp. 35-36). Laplace foresaw what would happen to us in 150 years later, and, even more, suggested the possible tool to handle the problem. But he did omit the detailed arguments for the solution. Thus we would like to write the supplement in modern terms for Laplace in this research note. We formulate the problem with a lottery box model, to which Markov chain theory can be applied. By applying Markov chains repeatedly, one expects the uniform distribution on k states as stationary distribution. Additionally, we show that the probability of even-number of successes in binomial distribution with trials and the success probability $\theta$ approaches to 0.5, as n increases to infinity. Our theory is illustrated to the cases of truncated geometric distribution and the US 1970 draft lottery.

Spatial Distribution and Sampling Plan for Pink Citrus Rust Mite, Aculops pelekassi (Acari: Eriophyidae) in Citrus Orchard (감귤원에서 귤녹응애 공간분포 분석과 표본조사법 개발)

  • Song, Jeong-Heub;Hong, Soon-Yeong;Lee, Shin-Chan
    • Korean journal of applied entomology
    • /
    • v.51 no.2
    • /
    • pp.91-97
    • /
    • 2012
  • The dispersion indices, spatial pattern and sampling plan for pink citrus rust mite (PCRM), Aculops pelekassi, monitoring was investigated. Dispersion indices of PCRM indicated the aggregated spatial pattern. Taylor's power law provided better description of variance-mean relationship than Iwao's patchiness regression. Fixed-precision levels (D) of a sequential sampling plan were developed using by Taylor's power law parameters generated from PCRM on fruit sample (cumulated number of PCRM in $cm^2$ of fruit). Based on Kono-Sugino's empirical binomial the mean density per $cm^2$ could be estimated from fruit ratio with more than 12 rust mites per $cm^2$: $ln(m)=4.61+1.23ln[-ln(1-p_{12})]$. To determine the optimal tally threshold, the variance (var(lnm)) for mean (lnm) in Kono-Sugino equation was estimated. The lower and narrow ranged change of variance for esimated mean showed at a tally threshold of 12. To estimate PCRM mean density per $cm^2$ at fixed precision level 0.25, the required sample number was 13 trees, 5 fruits per tree and 2 points per fruit (total 130 samples).

Within Field Distribution Pattern and Design of a Sampling Plan for Damaged Onions by the Onion maggot, Hylemya antiqua Meigen(Diptera: Anthomyiidae) (고자리파리에 의한 양파피해(被害)의 포장내(圃場內) 분포양식(分布樣式)과 피해량(被害量) 추정(推定)을 위한 표본추출(標本抽出) 계획(計劃))

  • Park, C.G.;Hyun, J.S.;Cho, D.J.;Lee, K.S.;Hah, J.K.
    • Korean journal of applied entomology
    • /
    • v.24 no.1 s.62
    • /
    • pp.29-33
    • /
    • 1985
  • Every plant in $990m^2$ onion field was inspected for damages by the onion maggot. Maps were constructed every ten days to show which plants were infested and which were not from April 11 to May 21, 1984. The maps were sectioned into squares one of which contains 80 onion plants and the counts of damaged onions in each square were fitted to poisson and negative binomial distribution and tested by chi-square. We argue that the satisfactory fitness of the expected negative binomial $[P(x^2)>0.05]$ provided a useful description of the spatial distribution patterns of the damaged onions. Edge effect was tested by the differences of damage ratio and variance/mean ratio (${\sigma}^2/m$) between edge and center part. The result showed that the damage ratioes and variances of all the periods, ${\sigma}^2/m$ values after May 1 were greater in edge part than in center part. Again, the maps were sectioned into four blocks and the squares (sample units) were sectioned into quadrants. By application of the variance component technique, it was suggested that $2{\sim}8$ sample units for 5% sampling error and $1{\sim}2$ sample units for 10% error should be sampled randomly to estimate the damage ratio when $2{\sim}3$ quadrants were inspected.

  • PDF

Variable Selection with Log-Density in Logistic Regression Model (로지스틱회귀모형에서 로그-밀도비를 이용한 변수의 선택)

  • Kahng, Myung-Wook;Shin, Eun-Young
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.1
    • /
    • pp.1-11
    • /
    • 2012
  • We present methods to study the log-density ratio of the conditional densities of the predictors given the response variable in the logistic regression model. This allows us to select which predictors are needed and how they should be included in the model. If the conditional distributions are skewed, the distributions can be considered as gamma distributions. A simulation study shows that the linear and log terms are required in general. If the conditional distributions of xjy for the two groups overlap significantly, we need both the linear and log terms; however, only the linear or log term is needed in the model if they are well separated.

The Effects of Collaborative R&D Activity on Product and Process Innovation: A Negative Binomial Modeling Approach (기업의 공동연구개발활동이 제품혁신 및 공정혁신에 미치는 영향 - 음이항회귀모형을 활용하여 -)

  • Kim, Chanyong;Choi, Ye Seul;Lim, Up
    • Journal of the Korean Regional Science Association
    • /
    • v.31 no.4
    • /
    • pp.107-128
    • /
    • 2015
  • Technology innovation is a competitive weapon of sustainable economic growth at the urban and regional level and the growth of firms. In this study, we empirically investigate the effects of collaborative R&D activity on product innovative outputs and process innovative outputs in manufacturing firms in Korea. We analyze the links between collaborative R&D activity and two types of innovative outputs using an alternative negative binomial regression model. The major finding is that collaborative R&D activity has significant positive effects on both product and process innovation. The results also identify a positive link between all types of innovative outputs and other R&D activities including internal R&D activity, patent activity, external technology and capital goods acquisitions. To induce corporate growth that enhances the productivity of individual firms and produces prolonged economic growth, policy makers should place greater emphasis on creating effective arrangements to promote establishing collaborative R&D strategies for manufacturing firms.

A Study of Accident Models for Highway Interchange Ramps (고속도로 연결로의 교통사고 추정모형 연구)

  • Roh, Chang-Gyun;Park, Chong-Seo;Son, Bong-Soo
    • Journal of Korean Society of Transportation
    • /
    • v.26 no.4
    • /
    • pp.29-40
    • /
    • 2008
  • Although a good understanding of the relationship between highway traffic accidents and highway geometric features is fundamental in highway design and safety, the relationship is not well understood quantitatively. The overall goal of this paper is to formulate a reliable statistical model fitting to historical highway accident data. The model can be used to estimate the effect of road design elements on safety for the practical purposes of highway design applications. En route to achieving this goal, a number of specific research objectives were accomplished: investigate the major design elements affecting highway safety; review the existing modeling approaches in order to assess the relationship between safety and highway design features; and formulate a statistical model fitting to the accident data in order to estimate the interchange ramp junction accident frequency of rural highways.

A Comparison of Confidence Intervals for the Difference of Proportions (모비율 차이의 신뢰구간들에 대한 비교연구)

  • 정형철;전명식;김대학
    • The Korean Journal of Applied Statistics
    • /
    • v.16 no.2
    • /
    • pp.377-393
    • /
    • 2003
  • Several confidence interval estimates for the difference of two binomial proportions were introduced. Bootstrap confidence interval is also suggested. We examined the over estimation property of approximate intervals and under estimation trend of exact intervals for the difference of proportions. We compared these confidence intervals based on the average coverage probability, expected width and skewness measure. Particularly actual coverage probability were calculated by using the prior distribution of parameters. Monte Carlo simulation for small sample size is conducted. Some interesting contour plots of average coverage probability and marginal plots for several interval estimates are presented.

Fast block matching algorithm for constrained one-bit transform-based motion estimation using binomial distribution (이항 분포를 이용한 제한된 1비트 변환 움직임 예측의 고속 블록 정합 알고리즘)

  • Park, Han-Jin;Choi, Chang-Ryoul;Jeong, Je-Chang
    • Journal of Broadcast Engineering
    • /
    • v.16 no.5
    • /
    • pp.861-872
    • /
    • 2011
  • Many fast block-matching algorithms (BMAs) in motion estimation field reduce computational complexity by screening the number of checking points. Although many fast BMAs reduce computations, sometimes they should endure matching errors in comparison with full-search algorithm (FSA). In this paper, a novel fast BMA for constrained one-bit transform (C1BT)-based motion estimation is proposed in order to decrease the calculations of the block distortion measure. Unlike the classical fast BMAs, the proposed algorithm shows a new approach to reduce computations. It utilizes the binomial distribution based on the characteristic of binary plane which is composed of only two elements: 0 and 1. Experimental results show that the proposed algorithm keeps its peak signal-to-noise ratio (PSNR) performance very close to the FSA-C1BT while the computation complexity is reduced considerably.

Rule-Based Classification Analysis Using Entropy Distribution (엔트로피 분포를 이용한 규칙기반 분류분석 연구)

  • Lee, Jung-Jin;Park, Hae-Ki
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.4
    • /
    • pp.527-540
    • /
    • 2010
  • Rule-based classification analysis is widely used for massive datamining because it is easy to understand and its algorithm is uncomplicated. In this classification analysis, majority vote of rules or weighted combination of rules using their supports are frequently used in order to combine rules. We propose a method to combine rules by using the multinomial distribution in this paper. Iterative proportional fitting algorithm is used to estimate the multinomial distribution which maximizes entropy constrained on rules' support. Simulation experiments show that this method can compete with other well known classification models in the case of two similar populations.

Log-density Ratio with Two Predictors in a Logistic Regression Model (로지스틱 회귀모형에서 이변량 정규분포에 근거한 로그-밀도비)

  • Kahng, Myung Wook;Yoon, Jae Eun
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.1
    • /
    • pp.141-149
    • /
    • 2013
  • We present methods for studying the log-density ratio that enables the selection of the predictors and the form to be included in the logistic regression model. Under bivariate normal distributional assumptions, we investigate the form of the log-density ratio as a function of two predictors. If two covariance matrices are equal, then the crossproduct and quadratic terms are not needed. If the variables are uncorrelated, we do not need the crossproduct terms, but we still need the linear and quadratic terms. We also explore other conditions in which the crossproduct and quadratic terms are not needed in the logistic regression model.