• Title/Summary/Keyword: 이항자료

Search Result 241, Processing Time 0.021 seconds

Maximum likelihood estimation for a mixture distribution (이항-퇴화 혼합분포의 최우추정법)

  • Hwang, Seonyeong;Sohn, Seung Hye;Oh, Changhyuck
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.2
    • /
    • pp.313-322
    • /
    • 2015
  • A mixture distribution of a discrete uniform or degenerated distribution and two binomial distribution is proposed and a method of obtaining the maximum likelihood estimates of the parameters is provided. For the proposed model simulation studies were conducted to see performance of the maximum likelihood estimates and a mixture of a degenerated distribution and two binomial distributions was applied to fit a lecture evaluation data to show a good result.

Fuzzy Binomial Proportion Test by Agreement Index (동의지수에 의한 퍼지 이항비률 검정)

  • Kang, Man-Ki;Park, Young-Rye
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.19 no.1
    • /
    • pp.19-24
    • /
    • 2009
  • We propose some properties for fuzzy binomial proportion test by agreement index. First we define fuzzy probability space and fuzzy type I error and type II error for the fuzzy probability of the two type errors. Also, we show that a fuzzy power function of performance for a fuzzy hypothesis test and drawing conclusions from the test.

Development of a Binomial Sampling Plan for Bemisia tabaci in Paprika Greenhouses (파프리카온실에서 담배가루이의 이항표본조사법 개발)

  • Kang, Juwan;Choi, Wonseok;Park, Jung-Joon
    • Korean journal of applied entomology
    • /
    • v.55 no.4
    • /
    • pp.405-412
    • /
    • 2016
  • Infestation of adults and pupae of sweetpotato whitefly, Bemisia tabaci, on paprika (Capsicum annuum var. angulosum) grown in greenhouses in Jinju, Gyeongnam province during 2014was determined by counts of the number of target stage of B. tabaci per leaflet. Binomial sampling plans were developed based on the relationship between the mean density per leaflet (m) and the proportion of leaflets infested with less than T whiteflies ($P_T$), according to the empirical model $(({\ln}(m)={\alpha}+{\beta}({\ln}(-{\ln}(1-P_T))))$. T was defined as the tally threshold, and set to 1, 2, 3, 4, 5 (adults) and 1, 3, 5, 7 (pupae) per leaflet in this study. Increasing the sample size, regardless of tally threshold, had little effect on the precision of the binomial sampling plan. Based on the precision of the model, T = 1 was chosen as the best tally threshold for estimating densities of B. tabaci adults and T = 3 was best tally threshold in B. tabaci pupae. Using the results obtained in the greenhouse, a simulated validation of the developed sampling plan by RVSP (Resampling Validation for Sampling Plan) demonstrated the plan's validity. Above all, the binomial model with T = 1 and T = 3 provided reliable predictions of the mean densities of B. tabaci adults and pupae on greenhouse paprika.

CUSUM charts for monitoring type I right-censored lognormal lifetime data (제1형 우측중도절단된 로그정규 수명 자료를 모니터링하는 누적합 관리도)

  • Choi, Minjae;Lee, Jaeheon
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.5
    • /
    • pp.735-744
    • /
    • 2021
  • Maintaining the lifetime of a product is one of the objectives of quality control. In real processes, most samples are constructed with censored data because, in many situations, we cannot measure the lifetime of all samples due to time or cost problems. In this paper, we propose two cumulative sum (CUSUM) control charting procedures to monitor the mean of type I right-censored lognormal lifetime data. One of them is based on the likelihood ratio, and the other is based on the binomial distribution. Through simulations, we evaluate the performance of the two proposed procedures by comparing the average run length (ARL). The overall performance of the likelihood ratio CUSUM chart is better, especially this chart performs better when the censoring rate is low and the shape parameter value is small. Conversely, the binomial CUSUM chart is shown to perform better when the censoring rate is high, the shape parameter value is large, and the change in the mean is small.

Bayesian ordinal probit semiparametric regression models: KNHANES 2016 data analysis of the relationship between smoking behavior and coffee intake (베이지안 순서형 프로빗 준모수 회귀 모형 : 국민건강영양조사 2016 자료를 통한 흡연양태와 커피섭취 간의 관계 분석)

  • Lee, Dasom;Lee, Eunji;Jo, Seogil;Choi, Taeryeon
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.1
    • /
    • pp.25-46
    • /
    • 2020
  • This paper presents ordinal probit semiparametric regression models using Bayesian Spectral Analysis Regression (BSAR) method. Ordinal probit regression is a way of modeling ordinal responses - usually more than two categories - by connecting the probability of falling into each category explained by a combination of available covariates using a probit (an inverse function of normal cumulative distribution function) link. The Bayesian probit model facilitates posterior sampling by bringing a latent variable following normal distribution, therefore, the responses are categorized by the cut-off points according to values of latent variables. In this paper, we extend the latent variable approach to a semiparametric model for the Bayesian ordinal probit regression with nonparametric functions using a spectral representation of Gaussian processes based BSAR method. The latent variable is decomposed into a parametric component and a nonparametric component with or without a shape constraint for modeling ordinal responses and predicting outcomes more flexibly. We illustrate the proposed methods with simulation studies in comparison with existing methods and real data analysis applied to a Korean National Health and Nutrition Examination Survey (KNHANES) 2016 for investigating nonparametric relationship between smoking behavior and coffee intake.

Relationship between Interstate Highway Accidents and Heterogeneous Geometrics by Random Parameter Negative Binomial Model - A case of Interstate Highway in Washington State, USA (확률적 모수를 고려한 음이항모형에 의한 교통사고와 기하구조와의 관계 - 미국 워싱턴 주(州) 고속도로를 중심으로)

  • Park, Minho
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.33 no.6
    • /
    • pp.2437-2445
    • /
    • 2013
  • The objective of this study is finding the relationship between interstate highway accident frequencies and geometrics using Random Parameter Negative Binomial model. Even though it is impossible to take account of the same design criteria to the all segments or corridors on the road in reality, previous research estimated the fixed value of coefficients without considering each segment's characteristic. The drawback of the traditional negative binomial is not to explain the integrated variations in terms of time and the distinct characters specific segment has. This results in under-estimation of the standard error which inflates the t-value and finally, affects the modeling estimation. Therefore, this study tries to find the relationship of accident frequencies with the heterogeneous geometrics using 9-years and 7-interstate highway data in Washington State area. 16-types of geometrics are used to derive the model which is compared with the traditional negative binomial Model to understand which Model is more suitable. In addition, by calculating marginal effect and elasticity, heterogeneous variables' effect to the accidents are estimated. Hopefully, this study will help to estiblish the future policy of geometrics.

Analysis of Stress level of Korean Household Members due to Household Debt (한국국민의 가계 금융부채에 대한 체감도 분석)

  • Oh, Man-Suk;Hyun, Seung-Me
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.2
    • /
    • pp.297-307
    • /
    • 2009
  • Korean household debt is one of the main sources of the current financial crisis. This paper studies the impact of household members' attributes such as a type of housing(self-own or rent), education, age, average monthly income of the head of household, and the area of residence, on the stress level of the household members due to household debt. We analyze a real data set collected by KB Kookmin Bank in 2004. We consider low and high stress level as a binary response variable and use a logistic regression model with the attributes of household members as explanatory variables. A simple but well-fitting model is selected by backward elimination method based on the likelihood statistic for goodness-of-fit test, and the impact of the attributes on the stress level is studied from parameter estimates of the selected model. We also perform the similar analysis on a binary response variable which distinguishes households with no debt from the rest. From the analysis, the stress level tends to be low for households with self-own houses, high average monthly income, low education level, and young members.

A Composite Trend Test with Symptom Occurrence and Severity Symptom Scores (증상 발현과 증상 심각성을 병합한 추세검정법)

  • Choi, Se-Mi;Yang, Soo;Song, Hae-Hiang
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.6
    • /
    • pp.1045-1054
    • /
    • 2011
  • During clinical trials a researcher is frequently able to observe a disease symptom in a subject as well as a severity score for those who experienced a symptom after a fixed length of treatment. The traditional method to evaluate a decreasing trend in proportion, when there is an intrinsic order in the treatment groups (for example control and two or more treatment groups) is a Cochran-Armitage test, while the method to evaluate a decreasing trend in continuous non-normal data is a Jonckheere-Tersptra test. The Cochran-Armitage test emphasizes the dichotomous data of symptom occurrence and the Jonckheere-Tersptra test emphasizes the continuous non-normal data of severity symptom scores. In this paper we propose new test statistics that consider the combined evidence from a symptom occurrence and disease severity score. We illustrate these methods with example data of schizophrenic inpatients that demonstrated antipsychotic-drug induced constipation. A small-scale simulation is conducted to compare the new trend tests with other trend tests.

Generalized Linear Mixed Model for Multivariate Multilevel Binomial Data (다변량 다수준 이항자료에 대한 일반화선형혼합모형)

  • Lim, Hwa-Kyung;Song, Seuck-Heun;Song, Ju-Won;Cheon, Soo-Young
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.6
    • /
    • pp.923-932
    • /
    • 2008
  • We are likely to face complex multivariate data which can be characterized by having a non-trivial correlation structure. For instance, omitted covariates may simultaneously affect more than one count in clustered data; hence, the modeling of the correlation structure is important for the efficiency of the estimator and the computation of correct standard errors, i.e., valid inference. A standard way to insert dependence among counts is to assume that they share some common unobservable variables. For this assumption, we fitted correlated random effect models considering multilevel model. Estimation was carried out by adopting the semiparametric approach through a finite mixture EM algorithm without parametric assumptions upon the random coefficients distribution.

Estimating the Economic Value of Recreation Sea Fishing in the Yellow Sea: An Application of Count Data Model (가산자료모형을 이용한 서해 태안군 유어객의 편익추정)

  • Choi, Jong Du
    • Environmental and Resource Economics Review
    • /
    • v.23 no.2
    • /
    • pp.331-347
    • /
    • 2014
  • The purpose of this study is to estimate the economic value of the recreational sea fishing in the Yellow Sea using count data model. For estimating consumer surplus, we used several count data model of travel cost recreation demand such as a poisson model(PM), a negative binomial model(NBM), a truncated poisson model(TPM), and a truncated negative binomial model(TNBM). Model results show that there is no exist the over-dispersion problem and a NBM was statistically more suitable than the other models. All parameters estimated are statistically significant and theoretically valid. The NBM was applied to estimate the travel demand and consumer surplus. The consumer surplus pre trip was estimated to be 254,453won, total consumer surplus per person and per year 1,536,896won.