• 제목/요약/키워드: distance between two distributions

검색결과 73건 처리시간 0.023초

A Study on Volume of Difference of Two Joint pdf′s, Focused on the Relation to Normal Theory LR Tests

  • Lee, Kwangjin
    • Communications for Statistical Applications and Methods
    • /
    • 제10권3호
    • /
    • pp.749-764
    • /
    • 2003
  • In this paper we explain that normal theory likelihood-ratio tests(z, t, $x^2$. F) for mean(s) or variance(s) can be geometrically related to volume of difference of two joint pdf's. One is an estimated joint pdf under null parameter space $\omega$ and the other is an estimated joint pdf under full parameter space $\Omega$. For explanations, ‘distance between two distributions’ is defined. We study properties of it, and derive some results on the distance between two multivariate normal distributions.

Statistical Fingerprint Recognition Matching Method with an Optimal Threshold and Confidence Interval

  • Hong, C.S.;Kim, C.H.
    • 응용통계연구
    • /
    • 제25권6호
    • /
    • pp.1027-1036
    • /
    • 2012
  • Among various biometrics recognition systems, statistical fingerprint recognition matching methods are considered using minutiae on fingerprints. We define similarity distance measures based on the coordinate and angle of the minutiae, and suggest a fingerprint recognition model following statistical distributions. We could obtain confidence intervals of similarity distance for the same and different persons, and optimal thresholds to minimize two kinds of error rates for distance distributions. It is found that the two confidence intervals of the same and different persons are not overlapped and that the optimal threshold locates between two confidence intervals. Hence an alternative statistical matching method can be suggested by using nonoverlapped confidence intervals and optimal thresholds obtained from the distributions of similarity distances.

연관성 기반 비유사성을 활용한 범주형 자료 군집분석 (Categorical Data Clustering Analysis Using Association-based Dissimilarity)

  • 이창기;정욱
    • 품질경영학회지
    • /
    • 제47권2호
    • /
    • pp.271-281
    • /
    • 2019
  • Purpose: The purpose of this study is to suggest a more efficient distance measure taking into account the relationship between categorical variables for categorical data cluster analysis. Methods: In this study, the association-based dissimilarity was employed to calculate the distance between two categorical data observations and the distance obtained from the association-based dissimilarity was applied to the PAM cluster algorithms to verify its effectiveness. The strength of association between two different categorical variables can be calculated using a mixture of dissimilarities between the conditional probability distributions of other categorical variables, given these two categorical values. In particular, this method is suitable for datasets whose categorical variables are highly correlated. Results: The simulation results using several real life data showed that the proposed distance which considered relationships among the categorical variables generally yielded better clustering performance than the Hamming distance. In addition, as the number of correlated variables was increasing, the difference in the performance of the two clustering methods based on different distance measures became statistically more significant. Conclusion: This study revealed that the adoption of the relationship between categorical variables using our proposed method positively affected the results of cluster analysis.

겨울철 도시지역 대기 수용성 에어로졸 입자의 크기 분포를 결정하는 주요 인자 (Major factors determining the size distributions of atmospheric water-soluble aerosol particles at an urban site during winter)

  • 박승식
    • 한국입자에어로졸학회지
    • /
    • 제17권3호
    • /
    • pp.43-54
    • /
    • 2021
  • Size distributions of atmospheric particulate matter (PM) and its water-soluble organic and inorganic components were measured between January and February 2021 at an urban site in Gwangju in order to identify the major factors that determine their size distributions. Their size distributions during the study period were mainly divided into two groups. In the first group, PM, NO3-, SO42-, NH4+ and water-soluble organic carbon (WSOC) exhibited bi-modal size distributions with a dominant condensation mode at a particle size of 0.32 ㎛. This group was dominated by local production of secondary water-soluble components under atmospheric stagnation and low relative humidity (RH) conditions, rather than long-range transportation of aerosol particles from China. On the other hand, in the second group, they showed tri-modal size distributions with a very pronounced droplet mode at a diameter of 1.0 ㎛. These size distributions were attributable to the local generation and accumulation of secondary aerosol particles under atmospheric conditions such as atmospheric stagnation and high RH, and an increase in the influx of atmospheric aerosol particles by long-distance transportation abroad. Contributions of droplet mode NO3-, SO42-, NH4+ and WSOC to fine particles in the second group were significantly higher than those in the first group period. However, their condensation mode contributions were about two-fold higher in the first group than in the second group. The significant difference in the size distribution of the accumulation mode of the WSOC and secondary ionic components between the two groups was due to the influx of aerosol particles with a long residence time by long-distance transport from China and local weather conditions (e.g., RH).

Direct Divergence Approximation between Probability Distributions and Its Applications in Machine Learning

  • Sugiyama, Masashi;Liu, Song;du Plessis, Marthinus Christoffel;Yamanaka, Masao;Yamada, Makoto;Suzuki, Taiji;Kanamori, Takafumi
    • Journal of Computing Science and Engineering
    • /
    • 제7권2호
    • /
    • pp.99-111
    • /
    • 2013
  • Approximating a divergence between two probability distributions from their samples is a fundamental challenge in statistics, information theory, and machine learning. A divergence approximator can be used for various purposes, such as two-sample homogeneity testing, change-point detection, and class-balance estimation. Furthermore, an approximator of a divergence between the joint distribution and the product of marginals can be used for independence testing, which has a wide range of applications, including feature selection and extraction, clustering, object matching, independent component analysis, and causal direction estimation. In this paper, we review recent advances in divergence approximation. Our emphasis is that directly approximating the divergence without estimating probability distributions is more sensible than a naive two-step approach of first estimating probability distributions and then approximating the divergence. Furthermore, despite the overwhelming popularity of the Kullback-Leibler divergence as a divergence measure, we argue that alternatives such as the Pearson divergence, the relative Pearson divergence, and the $L^2$-distance are more useful in practice because of their computationally efficient approximability, high numerical stability, and superior robustness against outliers.

TYPE SPACES AND WASSERSTEIN SPACES

  • Song, Shichang
    • 대한수학회지
    • /
    • 제55권2호
    • /
    • pp.447-469
    • /
    • 2018
  • Types (over parameters) in the theory of atomless random variable structures correspond precisely to (conditional) distributions in probability theory. Moreover, the logic (resp. metric) topology on the type space corresponds to the topology of weak (resp. strong) convergence of distributions. In this paper, we study metrics between types. We show that type spaces under $d^{\ast}-metric$ are isometric to Wasserstein spaces. Using optimal transport theory, two formulas for the metrics between types are given. Then, we give a new proof of an integral formula for the Wasserstein distance, and generalize some results in optimal transport theory.

Application of Cluster Distributions to Energy Transfer in Two-Dimensional Choleic Acid Crystals

  • 박치헌;송추윤;우희권;최용국;국성근
    • Bulletin of the Korean Chemical Society
    • /
    • 제16권7호
    • /
    • pp.630-634
    • /
    • 1995
  • The cluster distributions for different concentrations of 1,4-dibromonaphthalene (DBN) in 4,4'-dibromobenzophenone (DBBP)/1,4-dibromonaphthalene (DBN) choleic acid were determined by a computer simulation in order to model the energy transfer dynamics. The results of the simulation indicate that long range interaction between molecules further apart than nearest does not occur and energy transfer efficiency is restricted by single range interaction. The results also demonstrate that the trapping is diffusion limited. The energy transfer rate is reduced by a factor of 15 in DBBP/DBN choleic acid realtive to that in DBBP/DBN doped into polystyrene due to the larger distance between molecules.

근접하여 회전하는 두 원통 사이의 윤활유동해석 (Analysis for Lubrication between Two Close Rotating Cylinders)

  • 이승재;정호열;정재택
    • Tribology and Lubricants
    • /
    • 제17권5호
    • /
    • pp.391-398
    • /
    • 2001
  • Two dimensional slow viscous flow around two counter-rotating equal cylinders is investigated based on Stokes'approximation. An exact formal expression of the stream function is obtained by using the bipolar cylinder coordinates and Fourier series expansion. From the stream function obtained, the streamline patterns around the cylinders are shown and the pressure distribution in the flow field is determined. By integrating the stress distributions on the cylinder, the force and the moment exerted on the cylinder are calculated. The flow rate through the gap between the two cylinders is also determined as the distance between two cylinders varies. Special attention is directed to the case of very small distance between two cylinders concerned with the lubrication theory and the minimum pressure is calculated to explain a possible cavitation.

단방향 연속 섬유 복합재 횡단면에서 섬유 배열에 따른 응력 분포 변화 (Effects of Fiber Arrangements on Stress Distributions over the Transverse Cross Section of Unidirectionally Continuous Fiber-reinforced Composites)

  • 최수훈;지우석
    • Composites Research
    • /
    • 제33권1호
    • /
    • pp.30-37
    • /
    • 2020
  • 단방향 연속 섬유 강화 복합소재에 대하여 섬유 배열에 따른 응력 분포 양상을 연구하기 위해 단면 형상을 대표하는 체적 요소를 생성하였다. 대표 체적 요소에 횡방향 하중을 가하였을 때, 섬유와 기지재 강성의 차이로 인해 섬유 둘레에서 응력 집중 현상이 발생하며, 섬유 간 좁은 간격 때문에 집중된 응력이 중첩되며 섬유 주변에서 높은 응력이 구해질 것이라 쉽게 예측할 수 있다. 본 연구에서는 섬유 둘레 응력 증감이 단순히 섬유 간 간격 뿐 아니라 섬유의 상대적 위치가 하중 방향과 이루는 각도에 의해서도 결정됨을 보여준다. 정규 육각 구조를 가지는 대표 체적 요소의 중앙에 위치한 섬유를 다양한 방향으로 이동시키며 횡방향 하중을 가하여, 섬유 주변 응력이 증가하거나 감소하는 양상을 유한요소해석 기법을 이용해 측정하였다. 섬유 간 거리가 최소이면서 두 섬유의 중심을 잇는 선분의 방향이 하중 방향과 일치할 때 응력이 최대로 증가하였으며, 섬유 간 거리가 최소라 하더라도 하중 방향에 수직일 때 최대 응력은 오히려 감소한다는 것을 보여준다.

도심지 마이크로 셀 환경에서의 단구간 페이딩 특성 분석 (Analysis of short-term feding characteristics in urban microcellular environment)

  • 송기홍;김종호;함영권;김제영
    • 한국통신학회논문지
    • /
    • 제22권8호
    • /
    • pp.1652-1658
    • /
    • 1997
  • 본 논문에서는 도심지 마이크로 셀 환경에서의 수신신호에 대한 단구간 페이딩 분포 특성을 분석하였다. 페이딩 특성을 분석하기 위햐어 거리별 페이딩 신호 분포, 전파의 도래 각에 따른 수신 전력 패턴 및 두 수신 안테나사이의 이격 거리에 따른 페이딩 신호의 공간적 상관특성을 보였다. 또한 여러가지 경우에서의 Rician 파라미터 K를 구하여 페이딩 신호의 분포를 비교하였다. 분석 결과, 마이크로 셀에서는 송수신 거리에 따라 페이딩 발생 주기 및 변동폭이 다르게 나타났으며, 가시거리 영역에서 보다 비가시거리 영역에서 페이딩 신호의 발생 주기는 짧아지고 변동폭은 깊게 나타났다. 비교 분석에 이용된 데이터는 전파경로추적 방법(ray tracing technique)에 의한 시뮬레이션을 통하여 얻었다.

  • PDF