• Title/Summary/Keyword: data distributions

Search Result 2,607, Processing Time 0.028 seconds

Inverse Estimation of Fatigue Life Parameters of Springs Based on the Bayesian Approach (베이지안 접근법을 이용한 스프링 피로 수명 파라미터의 역 추정)

  • Heo, Chan-Young;An, Da-Wn;Won, Jun-Ho;Choi, Joo-Ho
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.35 no.4
    • /
    • pp.393-400
    • /
    • 2011
  • In this study, a procedure for the inverse estimation of the fatigue life parameters of springs which utilize the field fatigue life test data is proposed to replace real test with the FEA on fatigue life prediction. The Bayesian approach is employed, in which the posterior distributions of the parameters are determined conditional on the accumulated life data that are routinely obtained from the regular tests. In order to obtain the accurate samples from the distributions, the Markov chain Monte Carlo (MCMC) technique is employed. The distributions of the parameters are used in the FEA for predicting the fatigue life in the form of a predictive interval. The results show that the actual fatigue life data are found well within the posterior predictive distributions.

A Study of Outlier Detection Using the Mixture of Extreme Distributions Based on Deep-Sea Fishery Data (원양어선 조업 데이터의 혼합 극단분포를 이용한 이상점 탐색 연구)

  • Lee, Jung Jin;Kim, Jae Kyoung
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.5
    • /
    • pp.847-858
    • /
    • 2015
  • Deep-sea fishery in the Antarctic Ocean has been actively progressed by the developed countries including Korea. In order to prevent the environmental destruction of the Antarctic Ocean, related countries have established the Commission for the Conservation of Antarctic Marine Living Resources (CCAMLR) and have monitored any illegal unreported or unregulated fishing. Fishing of tooth fish, an expensive fish, in the Antarctic Ocean has increased recently and high catches per unit effort (CPUE) of fishing boats, which is suspicious for an illegal activity, have been frequently reported. The data of CPUEs in a fishing area of the Antarctic Ocean often show an extreme Distribution or a mixture of two extreme distributions. This paper proposes an algorithm to detect an outlier of CPUEs by using the mixture of two extreme distributions. The parameters of the mixture distribution are estimated by the EM algorithm. Log likelihood value and posterior probabilities are used to detect an outlier. Experiments show that the proposed algorithm to detect outlier of the data can be adopted instead of simple criteria such as a CPUE is greater than 1.

Direction Estimation of Multiple Sound Sources Using Circular Probability Distributions (순환 확률분포를 이용한 다중 음원 방향 추정)

  • Nam, Seung-Hyon;Kim, Yong-Hoh
    • The Journal of the Acoustical Society of Korea
    • /
    • v.30 no.6
    • /
    • pp.308-314
    • /
    • 2011
  • This paper presents techniques for estimating directions of multiple sound sources ranging from $0^{\circ}$ to $360^{\circ}$ using circular probability distributions having a periodic property. Phase differences containing direction information of sources can be modeled as mixtures of multiple probability distributions and source directions can be estimated by maximizing log-likelihood functions. Although the von Mises distribution is widely used for analyzing this kind of periodic data, we define a new class of circular probability distributions from Gaussian and Laplacian distributions by adopting a modulo operation to have $2{\pi}$-periodicity. Direction estimation with these circular probability distributions is done by implementing corresponding EM (Expectation-Maximization) algorithms. Simulation results in various reverberant environments confirm that Laplacian distribution provides better performance than von Mises and Gaussian distributions.

Use of Lèvy distribution to analyze longitudinal data with asymmetric distribution and presence of left censored data

  • Achcar, Jorge A.;Coelho-Barros, Emilio A.;Cuevas, Jose Rafael Tovar;Mazucheli, Josmar
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.1
    • /
    • pp.43-60
    • /
    • 2018
  • This paper considers the use of classical and Bayesian inference methods to analyze data generated by variables whose natural behavior can be modeled using asymmetric distributions in the presence of left censoring. Our approach used a $L{\grave{e}}vy$ distribution in the presence of left censored data and covariates. This distribution could be a good alternative to model data with asymmetric behavior in many applications as lifetime data for instance, especially in engineering applications and health research, when some observations are large in comparison to other ones and standard distributions commonly used to model asymmetry data like the exponential, Weibull or log-logistic are not appropriate to be fitted by the data. Inferences for the parameters of the proposed model under a classical inference approach are obtained using a maximum likelihood estimators (MLEs) approach and usual asymptotical normality for MLEs based on the Fisher information measure. Under a Bayesian approach, the posterior summaries of interest are obtained using standard Markov chain Monte Carlo simulation methods and available software like SAS. A numerical illustration is presented considering data of thyroglobulin levels present in a group of individuals with differentiated cancer of thyroid.

Study of Methodology for Estimating PM10 Concentration of Asian Dust Using Visibility Data (시정자료를 이용한 황사의 미세먼지 농도추정 방법 연구)

  • Lee, Hyo-Jung;Lee, Eun-Hee;Lee, Sang-Sam;Kim, Seungbum
    • Atmosphere
    • /
    • v.22 no.1
    • /
    • pp.13-28
    • /
    • 2012
  • The $PM_{10}$ concentration data is useful for indentifying intensity and a transport way of Asian dust. However, it is difficult to identify them properly due to the limited spatial resolution and coverage. Therefore, a methodology to estimate $PM_{10}$ concentration using visibility data obtained from synoptic observation was developed. To derive the converting function, correlation between visibility and $PM_{10}$ concentration is investigated using visibility and $PM_{10}$ concentration data observed at 20 stations in Korea from 2005 to 2009. To minimize bias due to atmospheric moisture, data with higher relative humidity over a critical value were eliminated while deriving $PM_{10}$-visibility relationship. As a result, an exponentially decreasing function of visibility is obtained under the condition that relative humidity is less than 82%. Verification of the visibility converting function to $PM_{10}$ concentration was carried out for the dust cases in 2010. It was found that spatial distributions of $PM_{10}$ calculated by visibility are in good agreement with the observed $PM_{10}$ distribution, especially for the strong dust cases in 2010. And correlation between the derived and observed $PM_{10}$ concentration was 0.63. We applied the function to obtain distributions of $PM_{10}$ concentration over North Korea, in which concentration data are not available, and compared them with satellite derived dust index, IODI distributions for dust cases in 2010. It is shown that the visibility function estimates quite similar patterns of dust concentration with IODI image, which suggests that it can contribute for prediction by indentifying transport route of Asian dust.

Latent class model for mixed variables with applications to text data (혼합모드 잠재범주모형을 통한 텍스트 자료의 분석)

  • Shin, Hyun Soo;Seo, Byungtae
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.6
    • /
    • pp.837-849
    • /
    • 2019
  • Latent class models (LCM) are useful tools to draw hidden information from categorical data. This model can also be interpreted as a mixture model with multinomial component distributions. In some cases, however, an available dataset may contain both categorical and count or continuous data. For such cases, we can extend the LCM to a mixture model with both multinomial and other component distributions such as normal and Poisson distributions. In this paper, we consider a LCM for the data containing categorical and count data to analyze the Drug Review dataset which contains categorical responses and text review. From this data analysis, we show that we can obtain more specific hidden inforamtion than those from the LCM only with categorical responses.

Statistical analysis and probabilistic modeling of WIM monitoring data of an instrumented arch bridge

  • Ye, X.W.;Su, Y.H.;Xi, P.S.;Chen, B.;Han, J.P.
    • Smart Structures and Systems
    • /
    • v.17 no.6
    • /
    • pp.1087-1105
    • /
    • 2016
  • Traffic load and volume is one of the most important physical quantities for bridge safety evaluation and maintenance strategies formulation. This paper aims to conduct the statistical analysis of traffic volume information and the multimodal modeling of gross vehicle weight (GVW) based on the monitoring data obtained from the weigh-in-motion (WIM) system instrumented on the arch Jiubao Bridge located in Hangzhou, China. A genetic algorithm (GA)-based mixture parameter estimation approach is developed for derivation of the unknown mixture parameters in mixed distribution models. The statistical analysis of one-year WIM data is firstly performed according to the vehicle type, single axle weight, and GVW. The probability density function (PDF) and cumulative distribution function (CDF) of the GVW data of selected vehicle types are then formulated by use of three kinds of finite mixed distributions (normal, lognormal and Weibull). The mixture parameters are determined by use of the proposed GA-based method. The results indicate that the stochastic properties of the GVW data acquired from the field-instrumented WIM sensors are effectively characterized by the method of finite mixture distributions in conjunction with the proposed GA-based mixture parameter identification algorithm. Moreover, it is revealed that the Weibull mixture distribution is relatively superior in modeling of the WIM data on the basis of the calculated Akaike's information criterion (AIC) values.

Metropolis-Hastings Expectation Maximization Algorithm for Incomplete Data (불완전 자료에 대한 Metropolis-Hastings Expectation Maximization 알고리즘 연구)

  • Cheon, Soo-Young;Lee, Hee-Chan
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.1
    • /
    • pp.183-196
    • /
    • 2012
  • The inference for incomplete data such as missing data, truncated distribution and censored data is a phenomenon that occurs frequently in statistics. To solve this problem, Expectation Maximization(EM), Monte Carlo Expectation Maximization(MCEM) and Stochastic Expectation Maximization(SEM) algorithm have been used for a long time; however, they generally assume known distributions. In this paper, we propose the Metropolis-Hastings Expectation Maximization(MHEM) algorithm for unknown distributions. The performance of our proposed algorithm has been investigated on simulated and real dataset, KOSPI 200.

IMPLEMENTATION OF DATA ASSIMILATION METHODOLOGY FOR PHYSICAL MODEL UNCERTAINTY EVALUATION USING POST-CHF EXPERIMENTAL DATA

  • Heo, Jaeseok;Lee, Seung-Wook;Kim, Kyung Doo
    • Nuclear Engineering and Technology
    • /
    • v.46 no.5
    • /
    • pp.619-632
    • /
    • 2014
  • The Best Estimate Plus Uncertainty (BEPU) method has been widely used to evaluate the uncertainty of a best-estimate thermal hydraulic system code against a figure of merit. This uncertainty is typically evaluated based on the physical model's uncertainties determined by expert judgment. This paper introduces the application of data assimilation methodology to determine the uncertainty bands of the physical models, e.g., the mean value and standard deviation of the parameters, based upon the statistical approach rather than expert judgment. Data assimilation suggests a mathematical methodology for the best estimate bias and the uncertainties of the physical models which optimize the system response following the calibration of model parameters and responses. The mathematical approaches include deterministic and probabilistic methods of data assimilation to solve both linear and nonlinear problems with the a posteriori distribution of parameters derived based on Bayes' theorem. The inverse problem was solved analytically to obtain the mean value and standard deviation of the parameters assuming Gaussian distributions for the parameters and responses, and a sampling method was utilized to illustrate the non-Gaussian a posteriori distributions of parameters. SPACE is used to demonstrate the data assimilation method by determining the bias and the uncertainty bands of the physical models employing Bennett's heated tube test data and Becker's post critical heat flux experimental data. Based on the results of the data assimilation process, the major sources of the modeling uncertainties were identified for further model development.