• Title/Summary/Keyword: Similarity Measures

Search Result 305, Processing Time 0.022 seconds

Empirical Comparison of Word Similarity Measures Based on Co-Occurrence, Context, and a Vector Space Model

  • Kadowaki, Natsuki;Kishida, Kazuaki
    • Journal of Information Science Theory and Practice
    • /
    • v.8 no.2
    • /
    • pp.6-17
    • /
    • 2020
  • Word similarity is often measured to enhance system performance in the information retrieval field and other related areas. This paper reports on an experimental comparison of values for word similarity measures that were computed based on 50 intentionally selected words from a Reuters corpus. There were three targets, including (1) co-occurrence-based similarity measures (for which a co-occurrence frequency is counted as the number of documents or sentences), (2) context-based distributional similarity measures obtained from a latent Dirichlet allocation (LDA), nonnegative matrix factorization (NMF), and Word2Vec algorithm, and (3) similarity measures computed from the tf-idf weights of each word according to a vector space model (VSM). Here, a Pearson correlation coefficient for a pair of VSM-based similarity measures and co-occurrence-based similarity measures according to the number of documents was highest. Group-average agglomerative hierarchical clustering was also applied to similarity matrices computed by individual measures. An evaluation of the cluster sets according to an answer set revealed that VSM- and LDA-based similarity measures performed best.

A New Class of Similarity Measures for Fuzzy Sets

  • Omran Saleh;Hassaballah M.
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.6 no.2
    • /
    • pp.100-104
    • /
    • 2006
  • Fuzzy techniques can be applied in many domains of computer vision community. The definition of an adequate similarity measure for measuring the similarity between fuzzy sets is of great importance in the field of image processing, image retrieval and pattern recognition. This paper proposes a new class of the similarity measures. The properties, sensitivity and effectiveness of the proposed measures are investigated and tested on real data. Experimental results show that these similarity measures can provide a useful way for measuring the similarity between fuzzy sets.

Operations on the Similarity Measures of Fuzzy Sets

  • Omran, Saleh;Hassaballah, M.
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.7 no.3
    • /
    • pp.205-208
    • /
    • 2007
  • Measuring the similarity between fuzzy sets plays a vital role in several fields. However, none of all well-known similarity measure methods is all-powerful, and all have the localization of its usage. This paper defines some operations on the similarity measures of fuzzy sets such as summation and multiplication of two similarity measures. Also, these operations will be generalized to any number of similarity measures. These operations will be very useful especially in the field of computer vision, and data retrieval because these fields need to combine and find some relations between similarity measures.

A similarity measure of fuzzy sets

  • Kwon, Soon H.
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.11 no.3
    • /
    • pp.270-274
    • /
    • 2001
  • Conventional similarity measures suggested so far can be classified into three categories: (i) geometric similarity measures, (ij) set-theoretic similarity measures, and (iii) matching function-based similarity measures. On the basis of the characteristics of the conventional similarity measures, in this paper, we propose a new similarity measure of fuzzy sets and investigate its properLies. Finally, numelical examples are provided for the comparison of characteristics of the proposed similarity measure and other previous similarity measures.

  • PDF

APPLICATIONS OF SIMILARITY MEASURES FOR PYTHAGOREAN FUZZY SETS BASED ON SINE FUNCTION IN DECISION-MAKING PROBLEMS

  • ARORA, H.D.;NAITHANI, ANJALI
    • Journal of applied mathematics & informatics
    • /
    • v.40 no.5_6
    • /
    • pp.897-914
    • /
    • 2022
  • Pythagorean fuzzy sets (PFSs) are capable of modelling information with more uncertainties in decision-making problems. The essential feature of PFSs is that they are described by three parameters: membership function, non-membership function and hesitant margin, with the total of the squares of each parameter equal to one. The purpose of this article is to suggest some new similarity measures and weighted similarity measures for PFSs. Numerical computations have been carried out to validate our proposed measures. Applications of these measures have been applied to some real-life decision-making problems of pattern detection and medicinal investigations. Moreover, a descriptive illustration is employed to compare the results of the proposed measures with the existing analogous similarity measures to show their effectiveness.

Similarity Classifier based on Schweizer & Sklars t-norms

  • Luukka, P.;Sampo, J.
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.1053-1056
    • /
    • 2004
  • In this article we have applied Schweizer & Sklars t-norm based similarity measures to classification task. We will compare results to fuzzy similarity measure based classification and show that sometimes better results can be found by using these measures than fuzzy similarity measure. We will also show that classification results are not so sensitive to p values with Schweizer & Sklars measures than when fuzzy similarity is used. This is quite important when one does not have luxury of tuning these kind of parameters but needs good classification results fast.

  • PDF

Effective and Efficient Similarity Measures for Purchase Histories Considering Product Taxonomy

  • Yang, Yu-Jeong;Lee, Ki Yong
    • Journal of Information Processing Systems
    • /
    • v.17 no.1
    • /
    • pp.107-123
    • /
    • 2021
  • In an online shopping site or offline store, products purchased by each customer over time form the purchase history of the customer. Also, in most retailers, products have a product taxonomy, which represents a hierarchical classification of products. Considering the product taxonomy, the lower the level of the category to which two products both belong, the more similar the two products. However, there has been little work on similarity measures for sequences considering a hierarchical classification of elements. In this paper, we propose new similarity measures for purchase histories considering not only the purchase order of products but also the hierarchical classification of products. Unlike the existing methods, where the similarity between two elements in sequences is only 0 or 1 depending on whether two elements are the same or not, the proposed method can assign any real number between 0 and 1 considering the hierarchical classification of elements. We apply this idea to extend three existing representative similarity measures for sequences. We also propose an efficient computation method for the proposed similarity measures. Through various experiments, we show that the proposed method can measure the similarity between purchase histories very effectively and efficiently.

Evaluation of Positioning Effectiveness Based on the Preference and Similarity Data Derived from Consumers' Choice from Different Choice Sets (선택집합의 변화를 통하여 도출된 선호도 및 유사성 정보를 활용한 포지셔닝 우위 평가)

  • Won, Jee-Sung
    • Korean Management Science Review
    • /
    • v.28 no.1
    • /
    • pp.61-74
    • /
    • 2011
  • Not only the preference data but also the similarity data can be used for developing effective marketing strategies. Hahn et al.[10] proposes a methodology of representing a brand(focal brand)'s competitors in a single map called the Preference-Similarity Map, according to their relative preference to and similarity with the focal brand. They also proposes a way to derive the relative preference and similarity values from the survey collecting the choice data from differing choice sets. This study identifies the limitations of the preference and similarity measures proposed by Hahn et al.[10] and shows how these measures can be revised. This study also proposes how to implement the revised measures and analyze brands' positioning strategies. Based on the results of the previous studies on the effect of inter brand similarity on brand evaluations, this study assumes that it is important to analyze how much a specific brand is preferred to its close competitors when evaluating the effectiveness of the brand's positioning in the market. This study applies the proposed measures to the data used in Hahn et al.[10] and also show how the proposed measures are related to the parameters of the choice model proposed by Batsell and Polking[1].

Fuzzy similarity measure in Hypergraph

  • Lee, H.-Kwang
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 1998.06a
    • /
    • pp.549-551
    • /
    • 1998
  • For a fuzzy system modeled by a fuzzy hypergraph, two fuzzy similarity measures are proposed : one for the fuzzy similarity between fuzzy sets and the other between elements in fuzzy sets. The propose measures can represent the realistic similarities which can not be given by the existing measures. With and example, it is shown that it can be used in the behavior analysis in an organization.

  • PDF

Information Quantification Application to Management with Fuzzy Entropy and Similarity Measure

  • Wang, Hong-Mei;Lee, Sang-Hyuk
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.10 no.4
    • /
    • pp.275-280
    • /
    • 2010
  • Verification of efficiency in data management fuzzy entropy and similarity measure were discussed and verified by applying reliable data selection problem and numerical data similarity evaluation. In order to calculate the certainty or uncertainty fuzzy entropy and similarity measure are designed and proved. Designed fuzzy entropy and similarity are considered as dissimilarity measure and similarity measure, and the relation between two measures are explained through graphical illustration. Obtained measures are useful to the application of decision theory and mutual information analysis problem. Extension of data quantification results based on the proposed measures are applicable to the decision making and fuzzy game theory.