• 제목/요약/키워드: conditional distribution

검색결과 295건 처리시간 0.02초

한국어 다음절 단어의 초성, 중성, 종성단위의 음절간 조건부 확률 (Conditional Probability of a 'Choseong', a 'Jungseong', and a 'Jongseong' Between Syllables in Multi-Syllable Korean Words)

  • 이재홍;이재학
    • 전자공학회논문지B
    • /
    • 제28B권9호
    • /
    • pp.692-703
    • /
    • 1991
  • A Korean word is composed of syllables. A Korean syllable is regarded as a random variable according to its probabilistic property in occurrence. A Korean syllable is divided into 'choseong', 'jungseong', and 'jongseong' which are regarded as random variables. We can consider teh conditional probatility of syllable as an index which represents the occurrence correlation between syllables in Korean words. Since the number of syllables is enormous, we use the conditional probability of a' choseong', a 'jungseong', and a 'jongseong' between syllables as an index which represents the occurrence correlation between syllables in Korean words. The length distribution of Korean woeds is computed according to frequency and to kind. Form the cumulative frequency of a Korean syllable computed from multi-syllable Korean woeds, all probabilities and conditiona probabilities are computed for the three random variables. The conditional probabilities of 'choseong'- 'choseong', 'jungseong'- 'jungseong', 'jongseong'-'jongseong', 'jongseong'-'choseong' between adjacent syllables in multi-syllable Korean woeds are computed.

  • PDF

A Note on Performance of Conditional Akaike Information Criteria in Linear Mixed Models

  • Lee, Yonghee
    • Communications for Statistical Applications and Methods
    • /
    • 제22권5호
    • /
    • pp.507-518
    • /
    • 2015
  • It is not easy to select a linear mixed model since the main interest for model building could be different and the number of parameters in the model could not be clearly defined. In this paper, performance of conditional Akaike Information Criteria and its bias-corrected version are compared with marginal Bayesian and Akaike Information Criteria through a simulation study. The results from the simulation study indicate that bias-corrected conditional Akaike Information Criteria shows promising performance when candidate models exclude large models containing the true model, but bias-corrected one prefers over-parametrized models more intensively when a set of candidate models increases. Marginal Bayesian and Akaike Information Criteria also have some difficulty to select the true model when the design for random effects is nested.

비모수적 코플라를 이용한 반복측정 이변량 자료의 조건부 결합 분포 추정 (Estimation of the joint conditional distribution for repeatedly measured bivariate cholesterol data using nonparametric copula)

  • 곽민정
    • Journal of the Korean Data and Information Science Society
    • /
    • 제27권3호
    • /
    • pp.689-700
    • /
    • 2016
  • 본 논문에서는 이변량 경시적 자료의 조건부 결합 분포를 추정하기 위하여 회귀 모형과 코플라 모형을 연구하였다. 주변 분포의 추정을 위하여 시변 전환 모형을 고려하였고, 이변량 반응변수 각각에 대한 주변 분포를 경험 분포를 이용한 비모수적 코플라를 이용하여 결합하여 조건부 결합 분포를 추정하였다. 주변 분포 모형의 모수 추정치는 추정방정식의 해로 얻어낼 수 있으며 우리가 제안한 모형은 조건부 평균 모형만으로 자료를 설명하기 어려운 경우에 적용될 수 있다. 시변 전환 모형과 비모수적 코플라 모형을 결합한 본 논문의 방법은 반복 측정된 이변량 경시적 자료에 대한 모형화가 모형에 대한 가정에서 비교적 자유로운 장점이 있다. 우리는 본 논문의 방법을 반복 측정된 이변량 콜레스테롤 자료를 분석하는데 적용하여 보았다.

Pointwise Estimation of Density of Heteroscedastistic Response in Regression

  • Hyun, Ji-Hoon;Kim, Si-Won;Lee, Sung-Dong;Byun, Wook-Jae;Son, Mi-Kyoung;Kim, Choong-Rak
    • 응용통계연구
    • /
    • 제25권1호
    • /
    • pp.197-203
    • /
    • 2012
  • In fitting a regression model, we often encounter data sets which do not follow Gaussian distribution and/or do not have equal variance. In this case estimation of the conditional density of a response variable at a given design point is hardly solved by a standard least squares method. To solve this problem, we propose a simple method to estimate the distribution of the fitted vales under heteroscedasticity using the idea of quantile regression and the histogram techniques. Application of this method to a real data sets is given.

Tutorial: Dimension reduction in regression with a notion of sufficiency

  • Yoo, Jae Keun
    • Communications for Statistical Applications and Methods
    • /
    • 제23권2호
    • /
    • pp.93-103
    • /
    • 2016
  • In the paper, we discuss dimension reduction of predictors ${\mathbf{X}}{\in}{{\mathbb{R}}^p}$ in a regression of $Y{\mid}{\mathbf{X}}$ with a notion of sufficiency that is called sufficient dimension reduction. In sufficient dimension reduction, the original predictors ${\mathbf{X}}$ are replaced by its lower-dimensional linear projection without loss of information on selected aspects of the conditional distribution. Depending on the aspects, the central subspace, the central mean subspace and the central $k^{th}$-moment subspace are defined and investigated as primary interests. Then the relationships among the three subspaces and the changes in the three subspaces for non-singular transformation of ${\mathbf{X}}$ are studied. We discuss the two conditions to guarantee the existence of the three subspaces that constrain the marginal distribution of ${\mathbf{X}}$ and the conditional distribution of $Y{\mid}{\mathbf{X}}$. A general approach to estimate them is also introduced along with an explanation for conditions commonly assumed in most sufficient dimension reduction methodologies.

coin 패키지를 이용한 독립성 검정 (Independence tests using coin package in R)

  • 김진흠;이정동
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권5호
    • /
    • pp.1039-1055
    • /
    • 2014
  • 검정통계량의 영가설 분포는 모집단 분포에 의존하는데 모집단의 분포를 모를 때 영가설 분포를 검정통계량의 조건부 분포로 대체하여 검정하는 방법을 순열 검정이라고 한다. Strasser와 Weber (1999)는 순열 검정을 통합하는 이론을 마련하였고, Hothorn 등 (2006, 2008)은 그 이론을 R에 내장된 coin 패키지에 구현하였다. coin 패키지에서 조건부 독립성 검정은 총괄적인 형태의 함수인 independence test를 통해서 할 수 있지만 대표적인 독립성 검정은 사용자가 편리하도록 간편한 함수를 별도로 제공하고 있다. 본 논문에서는 Strasser와 Weber (1999)의 순열 검정 방법에 대해 소개하고, coin 패키지에 내장된 15개의 간편 함수에 대해 independence test 함수로 변환하는 절차를 설명하고자 한다. 또한, 정의한 independence test 함수를 써서 실제 자료의 점근 분포와 순열 검정, 정확 검정에 기초한 p-값을 서로 비교하고자 한다.

RECURRENCE RELATIONS FOR QUOTIENT MOMENTS OF GENERALIZED PARETO DISTRIBUTION BASED ON GENERALIZED ORDER STATISTICS AND CHARACTERIZATION

  • Kumar, Devendra
    • 충청수학회지
    • /
    • 제27권3호
    • /
    • pp.347-361
    • /
    • 2014
  • Generalized Pareto distribution play an important role in reliability, extreme value theory, and other branches of applied probability and statistics. This family of distribution includes exponential distribution, Pareto or Lomax distribution. In this paper, we established exact expressions and recurrence relations satised by the quotient moments of generalized order statistics for a generalized Pareto distribution. Further the results for quotient moments of order statistics and records are deduced from the relations obtained and a theorem for characterizing this distribution is presented.

QUOTIENT MOMENTS OF THE ERLANG-TRUNCATED EXPONENTIAL DISTRIBUTION BASED ON RECORD VALUES AND A CHARACTERIZATION

  • Kumar, Devendra
    • Journal of applied mathematics & informatics
    • /
    • 제32권1_2호
    • /
    • pp.7-16
    • /
    • 2014
  • Erlang-truncated exponential distribution is widely used in the field of queuing system and stochastic processes. This family of distribution include exponential distribution. In this paper we establish some exact expression and recurrence relations satisfied by the quotient moments and conditional quotient moments of the upper record values from the Erlang-truncated exponential distribution. Further a characterization of this distribution based on recurrence relations of quotient moments of record values is presented.

The Impact of COVID-19, Day-of-the-Week Effect, and Information Flows on Bitcoin's Return and Volatility

  • LIU, Ying Sing;LEE, Liza
    • The Journal of Asian Finance, Economics and Business
    • /
    • 제7권11호
    • /
    • pp.45-53
    • /
    • 2020
  • Past literatures have not studied the impact of real-world events or information on the return and volatility of virtual currencies, particularly on the COVID-19 event, day-of-the-week effect, daily high-low price spreads and information flow rate. The study uses the ARMA-GARCH model to capture Bitcoin's return and conditional volatility, and explores the impact of information flow rate on conditional volatility in the Bitcoin market based on the Mixture Distribution Hypothesis (Clark, 1973). There were 3,064 samples collected during the period from 1st of January 2012 to 20th April, 2020. Empirical results show that in the Bitcoin market, a daily high-low price spread has a significant inverse relationship for daily return, and information flow rate has a significant positive relationship for condition volatility. The study supports a significant negative relationship between information asymmetry and daily return, and there is a significant positive relationship between daily trading volume and condition volatility. When Bitcoin trades on Saturday & Sunday, there is a significant reverse relationship for conditional volatility and there exists a day-of-the-week volatility effect. Under the impact of COVID-19 event, Bitcoin's condition volatility has increased significantly, indicating the risk of price changes. Finally, the Bitcoin's return has no impact on COVID-19 events and holidays (Saturday & Sunday).

CONCAVITY OF THE CONDITIONAL MEAN SOJOURN TIME IN THE PROCESSOR-SHARING QUEUE WITH BATCH ARRIVALS

  • Kim, Jeong-Sim
    • 대한수학회보
    • /
    • 제47권6호
    • /
    • pp.1251-1258
    • /
    • 2010
  • For an M/G/1 processor-sharing queue with batch arrivals, Avrachenkov et al. [1] conjectured that the conditional mean sojourn time is concave. However, Kim and Kim [5] showed that this conjecture is not true in general. In this paper, we show that this conjecture is true if the service times have a hyperexponential distribution.