• Title/Summary/Keyword: SIMILARITY ANALYSIS

Search Result 3,184, Processing Time 0.033 seconds

Sentence Similarity Analysis using Ontology Based on Cosine Similarity (코사인 유사도를 기반의 온톨로지를 이용한 문장유사도 분석)

  • Hwang, Chi-gon;Yoon, Chang-Pyo;Yun, Dai Yeol
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.441-443
    • /
    • 2021
  • Sentence or text similarity is a measure of the degree of similarity between two sentences. Techniques for measuring text similarity include Jacquard similarity, cosine similarity, Euclidean similarity, and Manhattan similarity. Currently, the cosine similarity technique is most often used, but since this is an analysis according to the occurrence or frequency of a word in a sentence, the analysis on the semantic relationship is insufficient. Therefore, we try to improve the efficiency of analysis on the similarity of sentences by giving relations between words using ontology and including semantic similarity when extracting words that are commonly included in two sentences.

  • PDF

Comparison of Code Similarity Analysis Performance of funcGNN and Siamese Network (funcGNN과 Siamese Network의 코드 유사성 분석 성능비교)

  • Choi, Dong-Bin;Jo, In-su;Park, Young B.
    • Journal of the Semiconductor & Display Technology
    • /
    • v.20 no.3
    • /
    • pp.113-116
    • /
    • 2021
  • As artificial intelligence technologies, including deep learning, develop, these technologies are being introduced to code similarity analysis. In the traditional analysis method of calculating the graph edit distance (GED) after converting the source code into a control flow graph (CFG), there are studies that calculate the GED through a trained graph neural network (GNN) with the converted CFG, Methods for analyzing code similarity through CNN by imaging CFG are also being studied. In this paper, to determine which approach will be effective and efficient in researching code similarity analysis methods using artificial intelligence in the future, code similarity is measured through funcGNN, which measures code similarity using GNN, and Siamese Network, which is an image similarity analysis model. The accuracy was compared and analyzed. As a result of the analysis, the error rate (0.0458) of the Siamese network was bigger than that of the funcGNN (0.0362).

Similarity Analysis Between Fuzzy Set and Crisp Set

  • Park, Hyun-Jeong;Lee, Sang-Hyuk.
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.7 no.4
    • /
    • pp.295-300
    • /
    • 2007
  • The similarity analysis for fuzzy set pair or crisp set pair are carried out. The similarity measure that is based on distance measure is derived and proved. The proposed similarity measure is considered with the help of analysis for uncertainty or certainty part of the membership functions. The usefulness of proposed similarity is verified through the computation of similarity between fuzzy set and crisp set or fuzzy set and fuzzy set. Our results are also compared with those of previous similarity measure which is based on fuzzy number.

Feasibility Study on Similarity Principle in Discrete Element Analysis (이산요소법을 이용한 수치해석에서의 상사성 이론의 적용성 검토)

  • Yun, Taeyoung;Park, Hee Mun
    • International Journal of Highway Engineering
    • /
    • v.18 no.2
    • /
    • pp.51-60
    • /
    • 2016
  • PURPOSES : The applicability of the mechanics-based similarity concept (suggested by Feng et al.) for determining scaled variables, including length and load, via laboratory-scale tests and discrete element analysis, was evaluated. METHODS: Several studies on the similarity concept were reviewed. The exact scaling approach, a similarity concept described by Feng, was applied in order to determine an analytical solution of a free-falling ball. This solution can be considered one of the simplest conditions for discrete element analysis. RESULTS : The results revealed that 1) the exact scaling approach can be used to determine the scale of variables in laboratory tests and numerical analysis, 2) applying only a scale factor, via the exact scaling approach, is inadequate for the error-free replacement of small particles by large ones during discrete element analysis, 3) the level of continuity of flowable materials such as SCC and cement mortar seems to be an important criterion for evaluating the applicability of the similarity concept, and 4) additional conditions, such as the kinetics of particle, contact model, and geometry, must be taken into consideration to achieve the maximum radius of replacement particles during discrete element analysis. CONCLUSIONS : The concept of similarity is a convenient tool to evaluate the correspondence of scaled laboratory test or numerical analysis to physical condition. However, to achieve excellent correspondence, additional factors, such as the kinetics of particles, contact model, and geometry, must be taken into consideration.

Efficient Similarity Analysis Methods for Same Open Source Functions in Different Versions (서로 다른 버전의 동일 오픈소스 함수 간 효율적인 유사도 분석 기법)

  • Kim, Yeongcheol;Cho, Eun-Sun
    • Journal of KIISE
    • /
    • v.44 no.10
    • /
    • pp.1019-1025
    • /
    • 2017
  • Binary similarity analysis is used in vulnerability analysis, malicious code analysis, and plagiarism detection. Proving that a function is equal to a well-known safe functions of different versions through similarity analysis can help to improve the efficiency of the binary code analysis of malicious behavior as well as the efficiency of vulnerability analysis. However, few studies have been carried out on similarity analysis of the same function of different versions. In this paper, we analyze the similarity of function units through various methods based on extractable function information from binary code, and find a way to analyze efficiently with less time. In particular, we perform a comparative analysis of the different versions of the OpenSSL library to determine the way in which similar functions are detected even when the versions differ.

Antecedents of consumers' decision postponement on purchasing fast fashion brands (패스트 패션 브랜드에 대한 소비자 의사결정 연기의 선행변수)

  • Park, Hye-Jung
    • The Research Journal of the Costume Culture
    • /
    • v.22 no.5
    • /
    • pp.743-759
    • /
    • 2014
  • The purpose of this study is to identify the antecedents of consumers' decision postponement on purchasing fast fashion brands. Ongoing search behavior, overchoice confusion, and similarity confusion were considered as antecedents. It was hypothesized that ongoing search behavior influences decision postponement both directly and indirectly through overchoice confusion and similarity confusion. Data were gathered by surveying university students in Seoul, using convenience sampling. Three hundred five questionnaires were used in the statistical analysis, which were exploratory factor analysis using SPSS and confirmatory factor analysis and path analysis using AMOS. Factor analysis proved that ongoing search behavior, overchoice confusion, similarity confusion, and decision postponement were uni-dimensions. Tests of the hypothesized path proved that ongoing search behavior influences decision postponement indirectly through overchoice confusion. In addition, similarity confusion influences decision postponement. The results suggest some confusion reduction strategies for marketers of fast fashion brands. Suggestions for future study are also discussed.

Comparison Analysis of Co-authorship Network and Citation Based Network for Author Research Similarity Exploration

  • Jeeyoung, Yoon;Min, Song
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.56 no.4
    • /
    • pp.269-284
    • /
    • 2022
  • Exploring research similarity of researchers offers insight on research communities and potential interactions among scholars. While co-authorship is a popular measure for studying research similarity of researchers, it cannot provide insight on authors who have not collaborated yet. In this work, we present novel approach to capture research similarity of authors using citation information. Extensive study is conducted on DATA & KNOWLEDGE ENGINEERING (DKE) publications to demonstrate and compare suggested approach with co-authorship based approach. Analysis result shows that proposed approach distinguishes author relationships that is not shown in co-authorship network.

Parentage Identification of 'Daebong' Grape (Vitis spp.) Using RAPD Analysis

  • Kim, Seung-Heui;Jeong, Jae-Hun;Kim, Seon-Kyu;Paek, Kee-Yoeup
    • Journal of Plant Biotechnology
    • /
    • v.4 no.2
    • /
    • pp.67-70
    • /
    • 2002
  • The RAPD data were used to assess genetic similarity among f grape cultivars. Of the 100 random primers tested on genomic DNA, 10 primers could be selected for Benetic analysis, and the selected primers generated a total of 115 distinct amplification fragments. A similarity matrix was constructed on the basis of the presence or absence of bands. The 7 grape cultivars analyzed with UPGMA were clustered into two groups of A and B. The similarity coefficient value of cultivars was high. The mean similarity index for all pairwise comparisons was 0.851, and ranged from 0.714 ('Rosaki' and 'Black Olympia') to 0.988 ('Kyoho' and 'Daebong'). After due consideration of differences in cultural and morphological characteristics of these two theoretically identical cultivars, it could be deduced that 'Daebong' is a bud sport of 'Kyoho' cultivar.

A Study on Detecting Changes in Injection Molding Process through Similarity Analysis of Mold Vibration Signal Patterns (금형 기반 진동 신호 패턴의 유사도 분석을 통한 사출성형공정 변화 감지에 대한 연구)

  • Jong-Sun Kim
    • Design & Manufacturing
    • /
    • v.17 no.3
    • /
    • pp.34-40
    • /
    • 2023
  • In this study, real-time collection of mold vibration signals during injection molding processes was achieved through IoT devices installed on the mold surface. To analyze changes in the collected vibration signals, injection molding was performed under six different process conditions. Analysis of the mold vibration signals according to process conditions revealed distinct trends and patterns. Based on this result, cosine similarity was applied to compare pattern changes in the mold vibration signals. The similarity in time and acceleration vector space between the collected data was analyzed. The results showed that under identical conditions for all six process settings, the cosine similarity remained around 0.92±0.07. However, when different process conditions were applied, the cosine similarity decreased to the range of 0.47±0.07. Based on these results, a cosine similarity threshold of 0.60~0.70 was established. When applied to the analysis of mold vibration signals, it was possible to determine whether the molding process was stable or whether variations had occurred due to changes in process conditions. This establishes the potential use of cosine similarity based on mold vibration signals in future applications for real-time monitoring of molding process changes and anomaly detection.

Analysis of Image Similarity Index of Woven Fabrics and Virtual Fabrics - Application of Textile Design CAD System and Shuttle Loom - (직물과 가상소재의 화상 유사성 분석 연구 - 수직기 및 텍스타일 CAD시스템 활용 -)

  • Yoon, Jung-Won;Kim, Jong-Jun
    • Fashion & Textile Research Journal
    • /
    • v.15 no.6
    • /
    • pp.1010-1017
    • /
    • 2013
  • Current global textiles and fashion industries have gradually shifted focus to high value-added, high sensibility, and multi-functional products based on new human-friendliness and sustainable growth technologies. Textile design CAD systems have been developed in conjunction with computer hardware and software sector advances. This study compares the patterns or images of actual woven fabrics and virtual fabrics prepared with a textile design CAD system. In this study, several weave structures (such as fancy yarn weave and patterns) were prepared with a shuttle loom. The woven textile images were taken using a CCD camera. The same weave structure data and yarn data were fed into a textile design CAD system in order to simulate fabric images as similarly as possible. Similarity Index analysis methods allowed for an analysis of the index between the actual fabric specimen and the simulated image of the corresponding fabric. The results showed that repeated small pattern weaves provide superior similarity index values than those of a fancy yarn weave that indicate some irregularities due to fancy yarn attributes. A Complex Wavelet Structural Similarity(CW-SSIM) index resulted in a better index than other methods such as Multi-Scale(MS) SSIM, and Feature Similarity(FS) SSIM, across fabric specimen images. A correlation analysis of the similarity index based on an image analysis and a similarity evaluation by panel members was also implemented.