• Title/Summary/Keyword: probability distributions

Search Result 744, Processing Time 0.024 seconds

Nonstationary Frequency Analysis of Hydrologic Extreme Variables Considering of Seasonality and Trend (계절성과 경향성을 고려한 극치수문자료의 비정상성 빈도해석)

  • Lee, Jeong-Ju;Kwon, Hyun-Han;Moon, Young-Il
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2010.05a
    • /
    • pp.581-585
    • /
    • 2010
  • This study introduced a Bayesian based frequency analysis in which the statistical trend seasonal analysis for hydrologic extreme series is incorporated. The proposed model employed Gumbel and GEV extreme distribution to characterize extreme events and a fully coupled bayesian frequency model was finally utilized to estimate design rainfalls in Seoul. Posterior distributions of the model parameters in both trend and seasonal analysis were updated through Markov Chain Monte Carlo Simulation mainly utilizing Gibbs sampler. This study proposed a way to make use of nonstationary frequency model for dynamic risk analysis, and showed an increase of hydrologic risk with time varying probability density functions. In addition, full annual cycle of the design rainfall through seasonal model could be applied to annual control such as dam operation, flood control, irrigation water management, and so on. The proposed study showed advantage in assessing statistical significance of parameters associated with trend analysis through statistical inference utilizing derived posterior distributions.

  • PDF

A Note on Approximation of Bottled Water Consumption Distribution: A Mixture Model (혼합모형을 이용한 생수소비 분포의 근사화에 대한 소고(小考))

  • Yoo, Seung-Hoon
    • Environmental and Resource Economics Review
    • /
    • v.11 no.2
    • /
    • pp.321-333
    • /
    • 2002
  • Approximating bottled water consumption distribution is complicated by zero observations in the sample. To deal with the zero observations, a mixture model of bottled water consumption distributions is proposed and applied to allow a point mass at zero. The bottled water consumption distribution is specified as a mixture of two distributions, one with a point mass at zero and the other with full support on the positive half of the real line. The model is empirically verified for household bottled water consumption survey data. The mixture model can easily capture the common bimodality feature of the bottled water consumption distribution. In addition, when covariates were added to the model, it was found that the probability that a household has non-consumption significantly varies with some variables.

  • PDF

Effects on Regression Estimates under Misspecified Generalized Linear Mixed Models for Counts Data

  • Jeong, Kwang Mo
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.6
    • /
    • pp.1037-1047
    • /
    • 2012
  • The generalized linear mixed model(GLMM) is widely used in fitting categorical responses of clustered data. In the numerical approximation of likelihood function the normality is assumed for the random effects distribution; subsequently, the commercial statistical packages also routinely fit GLMM under this normality assumption. We may also encounter departures from the distributional assumption on the response variable. It would be interesting to investigate the impact on the estimates of parameters under misspecification of distributions; however, there has been limited researche on these topics. We study the sensitivity or robustness of the maximum likelihood estimators(MLEs) of GLMM for counts data when the true underlying distribution is normal, gamma, exponential, and a mixture of two normal distributions. We also consider the effects on the MLEs when we fit Poisson-normal GLMM whereas the outcomes are generated from the negative binomial distribution with overdispersion. Through a small scale Monte Carlo study we check the empirical coverage probabilities of parameters and biases of MLEs of GLMM.

Augmentation of Hidden Markov Chain for Complex Sequential Data in Context

  • Sin, Bong-Kee
    • Journal of Multimedia Information System
    • /
    • v.8 no.1
    • /
    • pp.31-34
    • /
    • 2021
  • The classical HMM is defined by a parameter triple �� = (��, A, B), where each parameter represents a collection of probability distributions: initial state, state transition and output distributions in order. This paper proposes a new stationary parameter e = (e1, e2, …, eN) where N is the number of states and et = P(|xt = i, y) for describing how an input pattern y ends in state xt = i at time t followed by nothing. It is often said that all is well that ends well. We argue here that all should end well. The paper sets the framework for the theory and presents an efficient inference and training algorithms based on dynamic programming and expectation-maximization. The proposed model is applicable to analyzing any sequential data with two or more finite segmental patterns are concatenated, each forming a context to its neighbors. Experiments on online Hangul handwriting characters have proven the effect of the proposed augmentation in terms of highly intuitive segmentation as well as recognition performance and 13.2% error rate reduction.

Different estimation methods for the unit inverse exponentiated weibull distribution

  • Amal S Hassan;Reem S Alharbi
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.2
    • /
    • pp.191-213
    • /
    • 2023
  • Unit distributions are frequently used in probability theory and statistics to depict meaningful variables having values between zero and one. Using convenient transformation, the unit inverse exponentiated weibull (UIEW) distribution, which is equally useful for modelling data on the unit interval, is proposed in this study. Quantile function, moments, incomplete moments, uncertainty measures, stochastic ordering, and stress-strength reliability are among the statistical properties provided for this distribution. To estimate the parameters associated to the recommended distribution, well-known estimation techniques including maximum likelihood, maximum product of spacings, least squares, weighted least squares, Cramer von Mises, Anderson-Darling, and Bayesian are utilised. Using simulated data, we compare how well the various estimators perform. According to the simulated outputs, the maximum product of spacing estimates has lower values of accuracy measures than alternative estimates in majority of situations. For two real datasets, the proposed model outperforms the beta, Kumaraswamy, unit Gompartz, unit Lomax and complementary unit weibull distributions based on various comparative indicators.

Wave Analysis Method for Offshore Wind Power Design Suitable for Suitable for Ulsan Area

  • Woobeom Han;Kanghee Lee;Seungjae Lee
    • New & Renewable Energy
    • /
    • v.20 no.2
    • /
    • pp.2-16
    • /
    • 2024
  • Various loads induced by marine environmental conditions, such as waves, currents, and wind, are crucial for the operation and viability of offshore wind power (OWP) systems. In particular, waves have a significant impact on the stress and fatigue load of offshore structures, and highly reliable design parameters should be derived through extreme value analysis (EVA) techniques. In this study, extreme wave analyses were conducted with various Weibull distribution models to determine the reliable design parameters of an OWP system suitable for the Ulsan area. Forty-three years of long-term hindcast data generated by a numerical wave model were adopted as the analyses data, and the least-squares method was used to estimate the parameters of the distribution function for EVA. The inverse first-order reliability method was employed as the EVA technique. The obtained results were compared among themselves under the assumption that the marginal probability distributions were 2p, 3p, and exponentiated Weibull distributions.

MODIFIED GEOMETRIC DISTRIBUTION OF ORDER k AND ITS APPLICATIONS

  • JUNGTAEK OH;KYEONG EUN LEE
    • Journal of applied mathematics & informatics
    • /
    • v.42 no.3
    • /
    • pp.709-723
    • /
    • 2024
  • We study the distributions of waiting times in variations of the geometric distribution of order k. Variation imposes length on the runs of successes and failures. We study two types of waiting time random variables. First, we consider the waiting time for a run of k consecutive successes the first time no sequence of consecutive k failures occurs prior, denoted by T(k). Next, we consider the waiting time for a run of k consecutive failures the first time no sequence of k consecutive successes occurred prior, denoted by J(k). In addition, we study the distribution of the weighted average. The exact formulae of the probability mass function, mean, and variance of distributions are also obtained.

Distribution of Runs and Patterns in Four State Trials

  • Jungtaek Oh
    • Kyungpook Mathematical Journal
    • /
    • v.64 no.2
    • /
    • pp.287-301
    • /
    • 2024
  • From the mathematical and statistical point of view, a segment of a DNA strand can be viewed as a sequence of four-state (A, C, G, T) trials. Herein, we consider the distributions of runs and patterns related to the run lengths of multi-state sequences, especially for four states (A, B, C, D). Let X1, X2, . . . be a sequence of four state independent and identically distributed trials taking values in the set 𝒢 = {A, B, C, D}. In this study, we obtain exact formulas for the probability distribution function for the discrete distribution of runs of B's of order k. We obtain longest run statistics, shortest run statistics, and determine the distributions of waiting times and run lengths.

An Evaluation of Probabilistic Strain-Life Curve in Polyacetal (폴리아세탈 소재의 확률론적 변형률-수명선도 평가)

  • Jang, Cheon-Soo;Kim, Chul-Su;Park, Bum-Gyu;Kim, Jung-Kyu
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.30 no.11 s.254
    • /
    • pp.1417-1424
    • /
    • 2006
  • In order to evaluate variation of fatigue life of mechanical components including engineering plastics, it is important to estimate probabilistic strain-life curves to accurately define the variation of fatigue characteristics. This paper intends to provide new assessment of P-$\varepsilon$-N (probabilistic strain-life curves) for considering the variation of fatigue characteristics in polyacetal. The fatigue strain controlled tests were conducted under constant 50% humidity and room temperature condition by a universal testing machine at strain ratio, R=0. A practical procedure is introduced to evaluate probabilistic strain-life curves. Three probabilistic distributions were used for generating P-$\varepsilon$-N curves such as normal, 2-parameter and 3-parameter Weibull. In this study, 3-parameter Weibull distribution was found to be most appropriate among assumed distributions when the probability distributions of the fatigue characteristic were examined using chi-square and Kolmogorov-Smirnov test. The more appropriate P-$\varepsilon$-N curves for these materials are generated by the proposed method considering 3-parameter Weibull distribution.

Reliability Analysis for Nonnormal Distributions Using Multi-Level DOE (다수준 실험계획법을 이용한 비정규 분포의 신뢰도 계산 방법)

  • Choi, Hyun-Seok;Lee, Sang-Hoon;Kwak, Byung-Man
    • Proceedings of the KSME Conference
    • /
    • 2004.11a
    • /
    • pp.840-845
    • /
    • 2004
  • The reliability analysis for nonnormal distributions using the three level DOE(design of experiments) method was developed by Seo and Kwak in 2002. Although this method estimates only up to the first four moments(mean, standard deviation, skewness, and kurtosis) of the system response function, the result and the type of probability distribution determined by using the Pearson system are shown very good. However the accuracy is low in case of nonlinear performance function and sometimes, the level calculated is outside of the region in which the random variable is defined. In this article we suggest a modified three level DOE method to overcome these weaknesses and to obtain optimum choice for 3 levels and weights to handle nonnormal distributions. Furthermore we extend it to finding the optimum choice for 5 levels and weights to increase the accuracy in case of nonlinear performance function. A systematic procedure for reliability analysis is then proposed by using the Pearson system.

  • PDF