• 제목/요약/키워드: Similarity index

검색결과 657건 처리시간 0.032초

A Study on the Performance of Similarity Indices and its Relationship with Link Prediction: a Two-State Random Network Case

  • Ahn, Min-Woo;Jung, Woo-Sung
    • Journal of the Korean Physical Society
    • /
    • 제73권10호
    • /
    • pp.1589-1595
    • /
    • 2018
  • Similarity index measures the topological proximity of node pairs in a complex network. Numerous similarity indices have been defined and investigated, but the dependency of structure on the performance of similarity indices has not been sufficiently investigated. In this study, we investigated the relationship between the performance of similarity indices and structural properties of a network by employing a two-state random network. A node in a two-state network has binary types that are initially given, and a connection probability is determined from the state of the node pair. The performances of similarity indices are affected by the number of links and the ratio of intra-connections to inter-connections. Similarity indices have different characteristics depending on their type. Local indices perform well in small-size networks and do not depend on whether the structure is intra-dominant or inter-dominant. In contrast, global indices perform better in large-size networks, and some such indices do not perform well in an inter-dominant structure. We also found that link prediction performance and the performance of similarity are correlated in both model networks and empirical networks. This relationship implies that link prediction performance can be used as an approximation for the performance of the similarity index when information about node type is unavailable. This relationship may help to find the appropriate index for given networks.

협력필터링의 데이터 희소성 해결을 위한 자카드 지수 반영의 유사도 성능 분석 (Performance Analysis of Similarity Reflecting Jaccard Index for Solving Data Sparsity in Collaborative Filtering)

  • 이수정
    • 컴퓨터교육학회논문지
    • /
    • 제19권4호
    • /
    • pp.59-66
    • /
    • 2016
  • 협력 필터링 시스템에서 데이터 희소성 문제의 해결을 위해 공통평가항목수를 반영하는 방법이 연구되었다. 이러한 방법으로 널리 알려진 자카드 지수는 기존의 유사도 척도와 결합되어 성능을 개선할 수 있었다. 그러나, 다양한 데이터 환경에서 여러 유사도 척도들과 각각 결합했을 때의 성능 개선 효과에 대한 분석 연구는 미미하므로, 본 연구는 이에 대한 분석을 목적으로 한다. 우선 자카드 지수 자체를 유사도 척도로 사용했을때 희소한 데이터셋 상에서 전통적인 척도들보다 월등한 예측 성능을 보였고 추천 성능도 매우 우수하였다. 자카드 지수를 결합함으로써 기존 유사도 척도는 데이터 특성에 상관없이 성능이 대개 향상되었고, 특히 코사인 유사도는 희소한 데이터셋에서 가장 큰 향상을 이루었으나, 평균차이 제곱(Mean Squared Difference)의 유사도는 밀집된 데이터셋에서 오히려 저하된 예측 성능을 보였다. 따라서, 자카드 지수를 결합하여 사용하기 위해 데이터 환경 특성과 유사도 척도를 고려할 필요가 있다.

직물과 가상소재의 화상 유사성 분석 연구 - 수직기 및 텍스타일 CAD시스템 활용 - (Analysis of Image Similarity Index of Woven Fabrics and Virtual Fabrics - Application of Textile Design CAD System and Shuttle Loom -)

  • 윤정원;김종준
    • 한국의류산업학회지
    • /
    • 제15권6호
    • /
    • pp.1010-1017
    • /
    • 2013
  • Current global textiles and fashion industries have gradually shifted focus to high value-added, high sensibility, and multi-functional products based on new human-friendliness and sustainable growth technologies. Textile design CAD systems have been developed in conjunction with computer hardware and software sector advances. This study compares the patterns or images of actual woven fabrics and virtual fabrics prepared with a textile design CAD system. In this study, several weave structures (such as fancy yarn weave and patterns) were prepared with a shuttle loom. The woven textile images were taken using a CCD camera. The same weave structure data and yarn data were fed into a textile design CAD system in order to simulate fabric images as similarly as possible. Similarity Index analysis methods allowed for an analysis of the index between the actual fabric specimen and the simulated image of the corresponding fabric. The results showed that repeated small pattern weaves provide superior similarity index values than those of a fancy yarn weave that indicate some irregularities due to fancy yarn attributes. A Complex Wavelet Structural Similarity(CW-SSIM) index resulted in a better index than other methods such as Multi-Scale(MS) SSIM, and Feature Similarity(FS) SSIM, across fabric specimen images. A correlation analysis of the similarity index based on an image analysis and a similarity evaluation by panel members was also implemented.

한.일의 대미 수출경쟁력에 관한 연구 (An Analysis of Export Competitiveness of Korea and Japan in the USA)

  • 심재희
    • 통상정보연구
    • /
    • 제11권1호
    • /
    • pp.139-155
    • /
    • 2009
  • This study investigates empirically the export competitiveness of Korea and Japan in America by calculating 4 indexes such as market share index(MSI), export similarity index(ESI), market comparative adventage index(MCAI) and market share expansion ratio(MSER)-export similarity deepening ratio(ESDR). The empirical finding of this analysis shows that Korea is competitive in the labor-intensive products and Japan in the technology-intensive products. This result also meets the general understandings that Japan is superior to Korea in the export competitiveness such as value added of goods, etc. Therefore, in order to strengthen the export competitiveness of Korea in the US market, it's desirable for our firms and government to improve the quality of product ranges by developing technologies focused on the higher value-added products.

  • PDF

GB-색인: 고차원 데이타의 복합 유사 질의 및 적합성 피드백을 위한 색인 기법 (GB-Index: An Indexing Method for High Dimensional Complex Similarity Queries with Relevance Feedback)

  • 차광호
    • 한국정보과학회논문지:데이타베이스
    • /
    • 제32권4호
    • /
    • pp.362-371
    • /
    • 2005
  • 멀티미디어 데이타베이스와 같은 고차원 응용에서 유사 색인과 검색은 어려운 문제이며, 특히, 다수의 특성을 함께 색인하는 경우에는 더욱 어렵다. 본 논문에서는 고차원 이미지 데이타베이스에서 복합 유사 질의 및 적합성 피드백을 효율적으로 처리하기 위한 새로운 색인 기법인 GB-색인을 제시한다. GB-색인은 각 특성 차원을 독립적으로 처리함으로써 다수의 특성과 다수의 질의 객체를 유연하게 제어한다. 아울러, 비트맵 색인을 통해 데이타베이스에 있는 모든 객체를 비트맵의 집합으로 표현하여 질의를 효율적으로 처리한다. GB-색인의 기술적인 주된 공헌은 다음과 같다: (1) 고차원 데이타를 위한 효율적인 색인, (2) 효율적인 복합 유사 질의 처리, (3) 적합성 피드백을 위한 분리형 질의의 효과적 처리. 실험 결과에 따르면 GB-색인은 순차 탐색 및 VA-파일에 비해 큰 성능 향상을 보였다.

재구성된 광간섭단층 영상의 구조적 유사성을 이용한 수치 목표 평가 (Numerical Objective Assessment Using Structural Similarity for Diffuse Optical Reconstructed Images)

  • 비키 무댕;최세운
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2021년도 추계학술대회
    • /
    • pp.658-660
    • /
    • 2021
  • 본 연구의 목표는 확산 광학 단층 촬영에 대한 기준 영상을 사용하여 동질성과 이질성을 분리하기 위한 재구성된 영상들간의 수치적 평가를 위해 구조적 유사성 지수에 기초한 알고리즘을 개발한다. 글로벌 지오메트리 및 관심 영역 평가는 유사성을 산출하기 위해 측정되었으며, 그 결과 구조적 유사성 지수의 평균이 모델 내부에 가시적 포함 여부를 판단할 수 있는 잠재적 성능을 나타낸다는 것을 알 수 있으며, 구조적 유사성 지수는 유방 구조 정보를 평가하기 위한 이미지 평가를 지원 가능한 것으로 확인 되었다.

  • PDF

Graph based KNN for Optimizing Index of News Articles

  • Jo, Taeho
    • Journal of Multimedia Information System
    • /
    • 제3권3호
    • /
    • pp.53-61
    • /
    • 2016
  • This research proposes the index optimization as a classification task and application of the graph based KNN. We need the index optimization as an important task for maximizing the information retrieval performance. And we try to solve the problems in encoding words into numerical vectors, such as huge dimensionality and sparse distribution, by encoding them into graphs as the alternative representations to numerical vectors. In this research, the index optimization is viewed as a classification task, the similarity measure between graphs is defined, and the KNN is modified into the graph based version based on the similarity measure, and it is applied to the index optimization task. As the benefits from this research, by modifying the KNN so, we expect the improvement of classification performance, more graphical representations of words which is inherent in graphs, the ability to trace more easily results from classifying words. In this research, we will validate empirically the proposed version in optimizing index on the two text collections: NewsPage.com and 20NewsGroups.

CBR을 이용한 Setup Planning에서의 Similarity Index 결정에 관한 연구 (A Study on the Case-Based Reasoning Setup Planning: Focused on the Similarity Index)

  • 한만철;박선주;하성도
    • 한국정밀공학회지
    • /
    • 제23권9호
    • /
    • pp.119-126
    • /
    • 2006
  • This paper addresses the methodology development far the automated machining setup planning system using case-based reasoning(CBR). The case-based reasoning is used to develop a setup planning system. which consists of part input and representation module, case retrieval module, and case adaptation module. We present new approaches in the part input and representation module and the case retrieval module focusing on the similarity index determination. An illustrative example is included to demonstrate the proposed method.

품질지표기반 정치 후원금 지원을 위한 국회의원 추천시스템 연구 (Quality Indicator Based Recommendation System of the National Assembly Members for Political Sponsors)

  • 정현우;윤형준;이시은;박솔희;손소영
    • 품질경영학회지
    • /
    • 제49권1호
    • /
    • pp.17-29
    • /
    • 2021
  • Purpose: During 2015-2019, the average amount of political donation to the national assembly members in Korea was 1,000 won per person. Despite its benefits such as receiving tax credits, the donation system has not been actively practiced. This paper aims to promote political donations by suggesting a recommendation system of national assembly members by analysing the bills they proposed. Methods: In this paper, we propose a recommendation system based on two aspects: how similar the newly proposed or ammended bills are to the sponsors' interest (similarity index) and how much effort national assembly members put into those bills (intensity index). More than 25,000 bills were used to measure the recommendation quality index consisted with both the similarity and the intensity indices. Word2vec was used to calculate the similarity index of the bills proposed by the national assembly member to the sponsor's interest. The intensity index is calculated by diving the number of newly proposed or entirely revised bills with the number of senators who took part in those bills. Subsequently, we multiply the similarity index by the intensity index to obtain the recommendation quality index that can assist sponsors to identify potential assembly members for their donation. Results: We apply the proposed recommendation system to personas for illustration. The recommendation system showed an average f1 score about 0.69. The analysis results provide insights in recommendation for donation. Conclusion: n this study, the recommendation system was proposed to promote a political donation for national assembly members by creating the recommendation quality index based on the similarity and the intensity indices. We expect that the system presented in this paper will lower user barriers to political information, thereby boosting political sponsorship and increasing political participation.

동물 및 임상 시험의 시계열 프로파일 데이터 비교를 위한 유사성 지수 개발 (Development of a New Similarity Index to Compare Time-series Profile Data for Animal and Human Experiments)

  • 이예경;이현정;장현애;신상문
    • 품질경영학회지
    • /
    • 제49권2호
    • /
    • pp.145-159
    • /
    • 2021
  • Purpose: A statistical similarity evaluation to compare pharmacokinetics(PK) profile data between nonclinical and clinical experiments has become a significant issue on many drug development processes. This study proposes a new similarity index by considering important parameters, such as the area under the curve(AUC) and the time-series profile of various PK data. Methods: In this study, a new profile similarity index(PSI) by using the concept of a process capability index(Cp) is proposed in order to investigate the most similar animal PK profile compared to the target(i.e., Human PK profile). The proposed PSI can be calculated geometric and arithmetic means of all short term similarity indices at all time points on time-series both animal and human PK data. Designed simulation approaches are demonstrated for a verification purpose. Results: Two different simulation studies are conducted by considering three variances(i.e., small, medium, and large variances) as well as three different characteristic types(smaller the better, larger the better, nominal the best). By using the proposed PSI, the most similar animal PK profile compare to the target human PK profile can be obtained in the simulation studies. In addition, a case study represents differentiated results compare to existing simple statistical analysis methods(i.e., root mean squared error and quality loss). Conclusion: The proposed PSI can effectively estimate the level of similarity between animal, human PK profiles. By using these PSI results, we can reduce the number of animal experiments because we only focus on the significant animal representing a high PSI value.