• 제목/요약/키워드: Skewed Data

검색결과 203건 처리시간 0.026초

Predictive Memory Allocation over Skewed Streams

  • Yun, Hong-Won
    • Journal of information and communication convergence engineering
    • /
    • 제7권2호
    • /
    • pp.199-202
    • /
    • 2009
  • Adaptive memory management is a serious issue in data stream management. Data stream differ from the traditional stored relational model in several aspect such as the stream arrives online, high volume in size, skewed data distributions. Data skew is a common property of massive data streams. We propose the predicted allocation strategy, which uses predictive processing to cope with time varying data skew. This processing includes memory usage estimation and indexing with timestamp. Our experimental study shows that the predictive strategy reduces both required memory space and latency time for skewed data over varying time.

Estimations in a skewed uniform distribution

  • Son, Hee-Ju;Woo, Jung-Soo
    • Journal of the Korean Data and Information Science Society
    • /
    • 제20권4호
    • /
    • pp.733-740
    • /
    • 2009
  • We obtain a skewed uniform distribution by a uniform distribution, and evaluate its coeffcient of skewness. And we obtain the approximate maximum likelihood estimator (AML) and moment estimator of skew parameter in the skewed uniform distribution. And we compare simulated mean squared errors (MSE) of those estimators, and also compare MSE of two proposed reliability estimators in two independent skewed uniform distributions each with different skew parameters.

  • PDF

Estimations of the skew parameter in a skewed double power function distribution

  • Kang, Jun-Ho;Lee, Chang-Soo
    • Journal of the Korean Data and Information Science Society
    • /
    • 제24권4호
    • /
    • pp.901-909
    • /
    • 2013
  • A skewed double power function distribution is defined by a double power function distribution. We shall evaluate the coefficient of the skewness of a skewed double power function distribution. We shall obtain an approximate maximum likelihood estimator (MLE) and a moment estimator (MME) of the skew parameter in the skewed double power function distribution, and compare simulated mean squared errors for those estimators. And we shall compare simulated MSEs of two proposed reliability estimators in two independent skewed double power function distributions with different skew parameters.

Bayesian Analysis of a New Skewed Multivariate Probit for Correlated Binary Response Data

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • 제30권4호
    • /
    • pp.613-635
    • /
    • 2001
  • This paper proposes a skewed multivariate probit model for analyzing a correlated binary response data with covariates. The proposed model is formulated by introducing an asymmetric link based upon a skewed multivariate normal distribution. The model connected to the asymmetric multivariate link, allows for flexible modeling of the correlation structure among binary responses and straightforward interpretation of the parameters. However, complex likelihood function of the model prevents us from fitting and analyzing the model analytically. Simulation-based Bayesian inference methodologies are provided to overcome the problem. We examine the suggested methods through two data sets in order to demonstrate their performances.

  • PDF

차량 원더링 계측을 위한 사선센서 적정 설치각도 결정 (Determining the Appropriate Installation Angle of Skewed Sensor to Measure Vehicle Wandering)

  • 오주삼;장경찬;김민성;장진환
    • 한국도로학회논문집
    • /
    • 제10권3호
    • /
    • pp.79-86
    • /
    • 2008
  • 차량의 동적하중이 도로상에 작용하는 위치를 계측하기 위한 원더링 계측용 사선센서의 적정 설치각도를 제안하였다. 이를 위해서 테이프스위치 센서를 이용하여 원더링 계측용 장비를 개발하였고, 개발된 장비와 실험차량을 이용하여 평가용 자료를 수집하였다. 수집자료 분석 결과, 사선센서의 설치각도가 커질수록 원더링 수집자료의 오차가 감소하였고, 이러한 오차의 감소는 통계적으로도 의미가 있는 것으로 분석되었다. 그러나 사선센서를 $30^{\circ}$ 이상으로 설치할 경우, 탠덤축의 제원상의 이유로 인해 오류자료가 수집되는 것을 확인할 수 있었다. 따라서 본 연구에서는 국내 차량제원 등을 종합하여 원더링 계측용 사선센서의 적정 설치각도를 $20^{\circ}{\sim}25^{\circ}$로 제안하였다.

  • PDF

A SKEWED GENERALIZED t DISTRIBUTION

  • NADARAJAH SARALEES
    • Journal of the Korean Statistical Society
    • /
    • 제34권4호
    • /
    • pp.311-329
    • /
    • 2005
  • Skewed t distributions have attracted significant attention in the last few years. In this paper, a generalization - referred to as the skewed generalized t distribution - with the pdf f(x) = 2g(x)G(${\lambda}x$) is introduced, where g(${\cdot}$) and G (${\cdot}$) are taken, respectively, to be the pdf and the cdf of the generalized t distribution due to McDonald and Newey (1984, 1988). Several particular cases of this distribution are identified and various representations for its moments derived. An application is provided to rainfall data from Orlando, Florida.

Relationship Between the Mean and Median in a Skewed Frequency Distribution

  • Shin, Mi-Young;Cho, Tae Kyoung
    • Communications for Statistical Applications and Methods
    • /
    • 제11권3호
    • /
    • pp.513-518
    • /
    • 2004
  • The well-known mode-mean-median inequality for the unimodal population distribution does not always hold for the frequency distribution. But many elementary statistics text books just mention that the relative location of the mean and median can be used to determine whether a distribution is positively or negatively skewed. In this paper we introduce the method generating data that is positively skewed but mean

BAYESIAN HIERARCHICAL MODEL WITH SKEWED ELLIPTICAL DISTRIBUTION

  • Chung, Youn-Shik;Dipak K. Dey;Yang, Tae-Young;Jang, Jung-Hoon
    • Journal of the Korean Statistical Society
    • /
    • 제32권4호
    • /
    • pp.425-448
    • /
    • 2003
  • Meta-analysis refers to quantitative methods for combining results from independent studies in order to draw overall conclusions. We consider hierarchical models including selection models under a skewed heavy tailed error distribution proposed originally by Chen et al. (1999) and Branco and Dey (2001). These rich classes of models combine the information of independent studies, allowing investigation of variability both between and within studies, and incorporate weight function. Here, the testing for the skewness parameter is discussed. The score test statistic for such a test can be shown to be expressed as the posterior expectations. Also, we consider the detail computational scheme under skewed normal and skewed Student-t distribution using MCMC method. Finally, we introduce one example from Johnson (1993)'s real data and apply our proposed methodology. We investigate sensitivity of our results under different skewed errors and under different prior distributions.

기운 일반화 t 분포를 이용한 이진 데이터 회귀 분석 (Binary regression model using skewed generalized t distributions)

  • 김미정
    • 응용통계연구
    • /
    • 제30권5호
    • /
    • pp.775-791
    • /
    • 2017
  • 이진 데이터는 일상 생활에서 자주 접할 수 있는 데이터이다. 이진 데이터를 회귀 분석하는 방법으로 로지스틱(Logistic), 프로빗(Probit), Cauchit, Complementary log-log 모형이 주로 쓰이는데, 이 방법 이외에도 Liu(2004)가 제시한 t 분포를 이용한 로빗(Robit) 모형, Kim 등 (2008)에서 제시한 일반화 t-link 모형을 이용한 방법 등이 있다. 유연한 분포를 이용하면 유연한 회귀 모형이 가능해지는 점에 착안하여, 이 논문에서는 Theodossiou(1998)에서 제시된 기운 일반화 t 분포 (Skewed Generalized t Distribution)의 이용하여 우도 함수를 최대로 하는 이진 데이터 회귀 모형을 소개한다. 기운 일반화 t 분포를 R glm 함수, R sgt 패키지를 연결하여 이 논문에서 제시한 방법을 R로 분석할 수 있는 방법을 소개하고, 피마 인디언(Pima Indian) 데이터를 분석한다.

New Family of the Exponential Distributions for Modeling Skewed Semicircular Data

  • Kim, Hyoung-Moon
    • 응용통계연구
    • /
    • 제22권1호
    • /
    • pp.205-220
    • /
    • 2009
  • For modeling skewed semicircular data, we derive new family of the exponential distributions. We extend it to the l-axial exponential distribution by a transformation for modeling any arc of arbitrary length. It is straightforward to generate samples from the f-axial exponential distribution. Asymptotic result reveals two things. The first is that linear exponential distribution can be used to approximate the l-axial exponential distribution. The second is that the l-axial exponential distribution has the asymptotic memoryless property though it doesn't have strict memoryless property. Some trigonometric moments are also derived in closed forms. Maximum likelihood estimation is adopted to estimate model parameters. Some hypotheses tests and confidence intervals are also developed. The Kolmogorov-Smirnov test is adopted for goodness of fit test of the l-axial exponential distribution. We finally obtain a bivariate version of two kinds of the l-axial exponential distributions.