• 제목/요약/키워드: Histogram Statistics

검색결과 66건 처리시간 0.027초

Problems Occurred with Histogram and a Resolution

  • Park, Byeong Uk;Park, Hong Nae;Song, Moon Sup;Song, Jae Kee
    • 품질경영학회지
    • /
    • 제18권2호
    • /
    • pp.127-133
    • /
    • 1990
  • In this article, several problems inherent in histogram estimate of unknown probability density function are discussed. Those include so called sharp comers and bin edge effect. A resolution for these problems occurred with histogram is discussed. The resulting estimate is called kernel density estimate which is most widely used by data analysts. One of the most recent and reliable data-based choices of scale factor (bandwidth) of the estimate, which has been known to be most crucial, is also discussed.

  • PDF

Double monothetic clustering for histogram-valued data

  • Kim, Jaejik;Billard, L.
    • Communications for Statistical Applications and Methods
    • /
    • 제25권3호
    • /
    • pp.263-274
    • /
    • 2018
  • One of the common issues in large dataset analyses is to detect and construct homogeneous groups of objects in those datasets. This is typically done by some form of clustering technique. In this study, we present a divisive hierarchical clustering method for two monothetic characteristics of histogram data. Unlike classical data points, a histogram has internal variation of itself as well as location information. However, to find the optimal bipartition, existing divisive monothetic clustering methods for histogram data consider only location information as a monothetic characteristic and they cannot distinguish histograms with the same location but different internal variations. Thus, a divisive clustering method considering both location and internal variation of histograms is proposed in this study. The method has an advantage in interpreting clustering outcomes by providing binary questions for each split. The proposed clustering method is verified through a simulation study and applied to a large U.S. house property value dataset.

히스토그램 변환에서 기준분포의 표준편차 변경에 따른 강인한 화자인증 성능 개선 (Performance Improvement of Robust Speaker Verification According to Various Standard Deviations of a Reference Distribution in Histogram Transformation)

  • 권철홍
    • 말소리와 음성과학
    • /
    • 제2권3호
    • /
    • pp.127-134
    • /
    • 2010
  • Additive noise and channel mismatch strongly degrade the performance of speaker verification systems, as they distort the features of speech. In this paper a histogram transformation technique is presented to improve the robustness of text-independent speaker verification systems. The technique transforms the features extracted from speech such that their histogram is conformed to a reference distribution. The effect of different standard deviations for the reference distribution is investigated. Experimental results indicate that, in channel mismatched environments, the proposed technique offers significant improvements over existing techniques. We also verify performance improvement of the proposed method using statistics.

  • PDF

Technique According to the Calculation of Thresholds of Histogram Based on Overlap Areas for Reducing

  • An, Young-Eun;Bae, Sang-Hyun;Kim, Tae-Yeun
    • 통합자연과학논문집
    • /
    • 제13권2호
    • /
    • pp.83-86
    • /
    • 2020
  • In In this study, technique has been suggested according to the calculation of thresholds of histogram based on overlap areas for reducing noise while analyzing the functions of them. Suggested algorithm is to convert histogram extracted from color images to gray level and select overlap areas from extracted histogram. In addition, feature table is configured after extracting histogram in the relevant overlap area while comparing and retrieving for query and database video images by using this feature table. Suggested retrieval system has been confirmed to be more outstanding with retrieval function in video images with more noises than the system that only used color histogram.

연속확률변수 개념의 직관적 이해에 관한 고찰 (A Study on the Intuitive Understanding Concept of Continuous Random Variable)

  • 박영희
    • 대한수학교육학회지:학교수학
    • /
    • 제4권4호
    • /
    • pp.677-688
    • /
    • 2002
  • The context and intuitive understanding is very important in Statistics Education. Especially, there is a need to mitigate student's difficulty in studying probability density function. One of teaching method this concept is to using relative frequency histogram. But, as using this method, we should know several problems included in that. This study investigate problems in the method for teaching probability density function as gradual meaning of histogram. Also, as alternative approach, this thesis introduce the density curve concept. The application of four methods to teach the concept of the probability density function and analysis of the survey result is done in this research.

  • PDF

운영 위험 관련 손실 분포 - 퍼지 히스토그램의 효과 (Fuzzy histogram in estimating loss distributions for operational risk)

  • 박노진
    • Journal of the Korean Data and Information Science Society
    • /
    • 제20권4호
    • /
    • pp.705-712
    • /
    • 2009
  • 히스토그램이 활용의 간편성과 자료의 전체적 구조를 한 눈에 볼 수 있는 정보량을 제공하지만 히스토그램의 계급 구간의 설정에 따라 그 표현이 달라 질 수 있는 문제가 있다. 이러한 문제를 해결하기 위해 퍼지 개념을 활용한 히스토그램이 제안되었고 그 효과가 제시되었다 (Loquin과 Strauss, 2008). 히스토그램이 다양한 분야에서 사용되지만 요즘 운영 위험과 관련된 손실 분포를 추정함에 있어서 유용하게 사용되고 있다. 그런데, 임계치를 활용한 극단치 확률 함수 추정에 사용함에 있어 임계치의 선택에 따른 히스토그램의 모양 변화는 그 활용을 어렵게 하는 경향이 있다. 본 연구는 퍼지히스토그램을 손실에 대한 극단치 분포를 추정에 사용할 경우 임계치의 선택에 따른 전체적 모양의 차이가 일반적인 히스토그램 보다 크지 않아 상대적으로 안정된 분포를 추정할 수 있음을 보였다.

  • PDF

Constrained Bayes and Empirical Bayes Estimator Applications in Insurance Pricing

  • Kim, Myung Joon;Kim, Yeong-Hwa
    • Communications for Statistical Applications and Methods
    • /
    • 제20권4호
    • /
    • pp.321-327
    • /
    • 2013
  • Bayesian and empirical Bayesian methods have become quite popular in the theory and practice of statistics. However, the objective is to often produce an ensemble of parameter estimates as well as to produce the histogram of the estimates. For example, in insurance pricing, the accurate point estimates of risk for each group is necessary and also proper dispersion estimation should be considered. Well-known Bayes estimates (which is the posterior means under quadratic loss) are underdispersed as an estimate of the histogram of parameters. The adjustment of Bayes estimates to correct this problem is known as constrained Bayes estimators, which are matching the first two empirical moments. In this paper, we propose a way to apply the constrained Bayes estimators in insurance pricing, which is required to estimate accurately both location and dispersion. Also, the benefit of the constrained Bayes estimates will be discussed by analyzing real insurance accident data.

Pointwise Estimation of Density of Heteroscedastistic Response in Regression

  • Hyun, Ji-Hoon;Kim, Si-Won;Lee, Sung-Dong;Byun, Wook-Jae;Son, Mi-Kyoung;Kim, Choong-Rak
    • 응용통계연구
    • /
    • 제25권1호
    • /
    • pp.197-203
    • /
    • 2012
  • In fitting a regression model, we often encounter data sets which do not follow Gaussian distribution and/or do not have equal variance. In this case estimation of the conditional density of a response variable at a given design point is hardly solved by a standard least squares method. To solve this problem, we propose a simple method to estimate the distribution of the fitted vales under heteroscedasticity using the idea of quantile regression and the histogram techniques. Application of this method to a real data sets is given.

FREQUENCY HISTOGRAM MODEL FOR LINE TRANSECT DATA WITH AND WITHOUT THE SHOULDER CONDITION

  • EIDOUS OMAR
    • Journal of the Korean Statistical Society
    • /
    • 제34권1호
    • /
    • pp.49-60
    • /
    • 2005
  • In this paper we introduce a nonparametric method for estimating the probability density function of detection distances in line transect sampling. The estimator is obtained using a frequency histogram density estimation method. The asymptotic properties of the proposed estimator are derived and compared with those of the kernel estimator under the assumption that the data collected satisfy the shoulder condition. We found that the asymptotic mean square error (AMSE) of the two estimators have about the same convergence rate. The formula for the optimal histogram bin width is derived which minimizes AMSE. Moreover, the performances of the corresponding k-nearest-neighbor estimators are studied through simulation techniques. In the absence of our knowledge whether the shoulder condition is valid or not a new semi-parametric model is suggested to fit the line transect data. The performances of the proposed two estimators are studied and compared with some existing nonparametric and semiparametric estimators using simulation techniques. The results demonstrate the superiority of the new estimators in most cases considered.

강인한 화자 확인을 위한 히스토그램 개선 기법 (Histogram Enhancement for Robust Speaker Verification)

  • 최재길;권철홍
    • 대한음성학회지:말소리
    • /
    • 제63호
    • /
    • pp.153-170
    • /
    • 2007
  • It is well known that when there is an acoustic mismatch between the speech obtained during training and testing, the accuracy of speaker verification systems drastically deteriorates. This paper presents the use of MFCCs' histogram enhancement technique in order to improve the robustness of a speaker verification system. The technique transforms the features extracted from speech within an utterance such that their statistics conform to reference distributions. The reference distributions proposed in this paper are uniform distribution and beta distribution. The transformation modifies the contrast of MFCCs' histogram so that the performance of a speaker verification system is improved both in the clean training and testing environment and in the clean training and noisy testing environment.

  • PDF