• 제목/요약/키워드: similarity calculation

검색결과 208건 처리시간 0.024초

Clustering method for similar user with Miexed Data in SNS

  • Song, Hyoung-Min;Lee, Sang-Joon;Kwak, Ho-Young
    • 한국컴퓨터정보학회논문지
    • /
    • 제20권11호
    • /
    • pp.25-30
    • /
    • 2015
  • The enormous increase of data with the development of the information technology make internet users to be hard to find suitable information tailored to their needs. In the face of changing environment, the information filtering method, which provide sorted-out information to users, is becoming important. The data on the internet exists as various type. However, similarity calculation algorithm frequently used in existing collaborative filtering method is tend to be suitable to the numeric data. In addition, in the case of the categorical data, it shows the extreme similarity like Boolean Algebra. In this paper, We get the similarity in SNS user's information which consist of the mixed data using the Gower's similarity coefficient. And we suggest a method that is softer than radical expression such as 0 or 1 in categorical data. The clustering method using this algorithm can be utilized in SNS or various recommendation system.

A Text Similarity Measurement Method Based on Singular Value Decomposition and Semantic Relevance

  • Li, Xu;Yao, Chunlong;Fan, Fenglong;Yu, Xiaoqiang
    • Journal of Information Processing Systems
    • /
    • 제13권4호
    • /
    • pp.863-875
    • /
    • 2017
  • The traditional text similarity measurement methods based on word frequency vector ignore the semantic relationships between words, which has become the obstacle to text similarity calculation, together with the high-dimensionality and sparsity of document vector. To address the problems, the improved singular value decomposition is used to reduce dimensionality and remove noises of the text representation model. The optimal number of singular values is analyzed and the semantic relevance between words can be calculated in constructed semantic space. An inverted index construction algorithm and the similarity definitions between vectors are proposed to calculate the similarity between two documents on the semantic level. The experimental results on benchmark corpus demonstrate that the proposed method promotes the evaluation metrics of F-measure.

전송선로행열에 대한 유사변환을 이용한 PCB기판 임피던스 해석 (PCB Board Impedance Analysis Using Similarity Transform for Transmission Matrix)

  • 서영석
    • 한국정보통신학회논문지
    • /
    • 제13권10호
    • /
    • pp.2052-2058
    • /
    • 2009
  • 디지털 시스템의 동작주파수가 증가하고 전압스윙폭이 감소함에 따라 PCB보드의 정확하고 빠른 해석이 중요하게 되었다. 단위 기둥 행열의 다중곱을 이용하는 전송선로 행열을 이용한 방법은 PCB보드 해석에 있어서 가장 빠른 방법이다. 본 논문에서 PCB보드 임피던스를 계산하는 새로운 방법이 제안되었다. 우선, 이 방법에서 PCB의 단위기둥에 대한 전송선로행열의 고유치와 고유벡터가 계산되고, 단위기둥에 대한 전송선로 행열은 행열요소의 곱셈횟수를 줄이기 위해 행열유사변환을 통해 변환된다. 이러한 유사변환을 방법은 기존방법에 비해 계산시간을 대폭 줄여 줄 수 있다. 제안된 방법은 가로 1.3인치 세로 1.9인치의 PCB기판에 적용되었고, 10배 정도의 계산시간저감 효과를 보였다. 제안된 방법은 보드임피던스의 반복적인 계산을 필요로 하는 PCB설계에 응용될 수 있다.

Similarity Relations of Resin Flow in Resin Transfer Molding Process

  • Um, Moon-Kwang;Byun, Joon-Hyung;Daniel, Isaac M.
    • Advanced Composite Materials
    • /
    • 제18권2호
    • /
    • pp.135-152
    • /
    • 2009
  • Liquid molding processes, such as resin transfer molding, involve resin flow through a porous medium inside a mold cavity. Numerical analysis of resin flow and mold filling is a very useful means for optimization of the manufacturing process. However, the numerical analysis is quite time consuming and requires a great deal of effort, since a separate numerical calculation is needed for every set of material properties, part size and injection conditions. The efforts can be appreciably reduced if similarity solutions are used instead of repeated numerical calculations. In this study, the similarity relations for pressure, resin velocity and flow front propagation are proposed to correlate another desired case from the already obtained numerical result. In other words, the model gives a correlation of flow induced variables between two different cases. The model was verified by comparing results obtained by the similarity relation and by independent numerical simulation.

Assessment of performance of machine learning based similarities calculated for different English translations of Holy Quran

  • Al Ghamdi, Norah Mohammad;Khan, Muhammad Badruddin
    • International Journal of Computer Science & Network Security
    • /
    • 제22권4호
    • /
    • pp.111-118
    • /
    • 2022
  • This research article presents the work that is related to the application of different machine learning based similarity techniques on religious text for identifying similarities and differences among its various translations. The dataset includes 10 different English translations of verses (Arabic: Ayah) of two Surahs (chapters) namely, Al-Humazah and An-Nasr. The quantitative similarity values for different translations for the same verse were calculated by using the cosine similarity and semantic similarity. The corpus went through two series of experiments: before pre-processing and after pre-processing. In order to determine the performance of machine learning based similarities, human annotated similarities between translations of two Surahs (chapters) namely Al-Humazah and An-Nasr were recorded to construct the ground truth. The average difference between the human annotated similarity and the cosine similarity for Surah (chapter) Al-Humazah was found to be 1.38 per verse (ayah) per pair of translation. After pre-processing, the average difference increased to 2.24. Moreover, the average difference between human annotated similarity and semantic similarity for Surah (chapter) Al-Humazah was found to be 0.09 per verse (Ayah) per pair of translation. After pre-processing, it increased to 0.78. For the Surah (chapter) An-Nasr, before preprocessing, the average difference between human annotated similarity and cosine similarity was found to be 1.93 per verse (Ayah), per pair of translation. And. After pre-processing, the average difference further increased to 2.47. The average difference between the human annotated similarity and the semantic similarity for Surah An-Nasr before preprocessing was found to be 0.93 and after pre-processing, it was reduced to 0.87 per verse (ayah) per pair of translation. The results showed that as expected, the semantic similarity was proven to be better measurement indicator for calculation of the word meaning.

컬러공간 특성을 이용한 유해 동영상 식별방법에 관한 연구 (An Identification Method of Detrimental Video Images Using Color Space Features)

  • 김성균;김창근;정대율
    • 한국산학기술학회논문지
    • /
    • 제12권6호
    • /
    • pp.2807-2814
    • /
    • 2011
  • 본 논문은 컬러공간 특성을 이용하여 유해동영상을 식별하는 알고리즘을 개발하고, 실험을 통하여 알고리즘의 효율성을 검증한다. 유해동영상 식별 알고리즘은 2차원 투영맵에 기초하고 있다. 비디오 이미지의 컬러특성을 추출하는데 있어 2차원 투영맵은 후보 프레임을 효과적으로 추출하는데 적용되어진다. 본 연구에서는 제시된 유사도 계산 알고리즘을 이용하여 추출된 프레임과 기준 이미지 간의 유사도를 먼저 계산하고, 유사도 평가를 통하여 유해동영상 후보프레임을 식별해 내고 임계치를 적용하여 최종 판단을 내린다. 제시된 알고리즘을 적용한 실험결과, 유해동영상을 찾는데 있어 컬러히스토그램보다 본 연구에서 제안한 2차원 투영맵을 이용한 기법이 계산속도와 식별능력 면에서 더 우수함을 입증하였다.

처방 유사도 분석의 효율성 향상에 관한 연구 (A Study on Prescription Similarity Analysis for Efficiency Improvement)

  • 黃秀敬;禹東賢;金基郁;李丙旭
    • 대한한의학원전학회지
    • /
    • 제35권4호
    • /
    • pp.1-9
    • /
    • 2022
  • Objectives : This study aims to increase efficiency of the prescription similarity analysis method that uses drug composition ratio. Methods : The controlled experiment compared result generation time, generated data quantity, and accuracy of results between previous and new analysis method on the 12,598 formulas and 61 prescription groups. Results : The control group took 346 seconds on average and generated 768,478 results, while the test group took 24 seconds and generated 241,739 results. The test group adopted a selective calculation method that only used overlapping data between two formulas instead of analyzing all number of cases. It simplified the data processing process, reducing the quantity of data that is required to be processed, leading to better system speed, as fast as 14.47 times more than previous analysis method with equal results. Conclusions : Efficiency for similarity analysis could be improved by reducing data span and simplifying the calculation processes.

트랜스포머 인코더와 시암넷 결합한 시맨틱 유사도 알고리즘 (Semantic Similarity Calculation based on Siamese TRAT)

  • 육성잠;조인휘
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2021년도 춘계학술발표대회
    • /
    • pp.397-400
    • /
    • 2021
  • To solve the problem that existing computing methods cannot adequately represent the semantic features of sentences, Siamese TRAT, a semantic feature extraction model based on Transformer encoder is proposed. The transformer model is used to fully extract the semantic information within sentences and carry out deep semantic coding for sentences. In addition, the interactive attention mechanism is introduced to extract the similar features of the association between two sentences, which makes the model better at capturing the important semantic information inside the sentence. As a result, it improves the semantic understanding and generalization ability of the model. The experimental results show that the proposed model can improve the accuracy significantly for the semantic similarity calculation task of English and Chinese, and is more effective than the existing methods.

Similarity Measurement using Gabor Energy Feature and Mutual Information for Image Registration

  • Ye, Chul-Soo
    • 대한원격탐사학회지
    • /
    • 제27권6호
    • /
    • pp.693-701
    • /
    • 2011
  • Image registration is an essential process to analyze the time series of satellite images for the purpose of image fusion and change detection. The Mutual Information (MI) is commonly used as similarity measure for image registration because of its robustness to noise. Due to the radiometric differences, it is not easy to apply MI to multi-temporal satellite images using directly the pixel intensity. Image features for MI are more abundantly obtained by employing a Gabor filter which varies adaptively with the filter characteristics such as filter size, frequency and orientation for each pixel. In this paper we employed Bidirectional Gabor Filter Energy (BGFE) defined by Gabor filter features and applied the BGFE to similarity measure calculation as an image feature for MI. The experiment results show that the proposed method is more robust than the conventional MI method combined with intensity or gradient magnitude.

인지지도 유사도와 정신적 작업부하와의 관계에 대한 연구 (The study of the relationship between the similarity of cognitive map and the mental workload)

  • 유승동;박범
    • 대한인간공학회지
    • /
    • 제21권3호
    • /
    • pp.47-58
    • /
    • 2002
  • The similarity of shape of shape of interface between human cognitive map and real product is the important factor to determine the human performance. Nevertheless, the degree of similarity between these has not been defined quantitatively in recent studies. Therefore, in this study, the cognitive map and the mental workload were measured by SMM(Sketch Map Method) and RNASA-TLX(Revision of NASA-Task Load Index). And the numerical expression of the accuracy point was suggested for the quantitative calculation of relative positional similarity between cognitive map and real product. In the experiment, nine subjects were participated and two kinds of vehicles were used. Mental workload was mental workload was measured immediately after the road test. The result of analysis on the relationship between accuracy and mental workload shows that the negative correlation exists on each vehicle, and the lower score of mental workloads id measured on the vehicle that has the higher score of accuracy between two vehicles.