• Title/Summary/Keyword: search similarity

Search Result 530, Processing Time 0.027 seconds

Efficient Searching Technique for Nearest Neighbor Object in High-Dimensional Data (고차원 데이터의 효율적인 최근접 객체 검색 기법)

  • Kim, Jin-Ho;Park, Young-Bae
    • The KIPS Transactions:PartD
    • /
    • v.11D no.2
    • /
    • pp.269-280
    • /
    • 2004
  • The Pyramid-Technique is based on mapping n-dimensional space data into one-dimensional data and expresses it as a B+-tree. By solving the problem of search time complexity the pyramid technique also prevents the effect of "phenomenon of dimensional curse" which is caused by treatment of hypercube range query in n-dimensional data space. The SPY-TEC applies the space division strategy in pyramid method and uses spherical range query suitable for similarity search so that Improves the search performance. However, nearest neighbor query is more efficient than range query because it is difficult to specify range in similarity search. Previously proposed index methods perform well only in the specific distribution of data. In this paper, we propose an efficient searching technique for nearest neighbor object using PdR-Tree suggested to improve the search performance for high dimensional data such as multimedia data. Test results, which uses simulation data with various distribution as well as real data, demonstrate that PdR-Tree surpasses both the Pyramid-Technique and SPY-TEC in views of search performance.rformance.

Similarity-Based Subsequence Search in Image Sequence Databases (이미지 시퀀스 데이터베이스에서의 유사성 기반 서브시퀀스 검색)

  • Kim, In-Bum;Park, Sang-Hyun
    • The KIPS Transactions:PartD
    • /
    • v.10D no.3
    • /
    • pp.501-512
    • /
    • 2003
  • This paper proposes an indexing technique for fast retrieval of similar image subsequences using the multi-dimensional time warping distance. The time warping distance is a more suitable similarity measure than Lp distance in many applications where sequences may be of different lengths and/or different sampling rates. Our indexing scheme employs a disk-based suffix tree as an index structure and uses a lower-bound distance function to filter out dissimilar subsequences without false dismissals. It applies the normaliration for an easier control of relative weighting of feature dimensions and the discretization to compress the index tree. Experiments on medical and synthetic image sequences verify that the proposed method significantly outperforms the naive method and scales well in a large volume of image sequence databases.

A Design for Efficient Similar Subsequence Search with a Priority Queue and Suffix Tree in Image Sequence Databases (이미지 시퀀스 데이터베이스에서 우선순위 큐와 접미어 트리를 이용한 효율적인 유사 서브시퀀스 검색의 설계)

  • 김인범
    • Journal of the Korea Computer Industry Society
    • /
    • v.4 no.4
    • /
    • pp.613-624
    • /
    • 2003
  • This paper proposes a design for efficient and accurate retrieval of similar image subsequences using the multi-dimensional time warping distance as similarity evaluation tool in image sequence database after building of two indexing structures implemented with priority queue and suffix tree respectively. Receiving query image sequence, at first step, the proposed method searches the candidate set of similar image subsequences in priory queue index structure. If it can not get satisfied results, it retrieves another candidate set in suffix tree index structure at second step. The using of the low-bound distance function can remove the dissimilar subsequence without false dismissals during similarity evaluating process between query image sequence and stored sequences in two index structures.

  • PDF

Sequence Data Indexing Method based on Minimum DTW Distance (최소 DTW 거리 기반의 데이터 시퀀스 색인 기법)

  • Khil, Ki-Jeong;Song, Seok-Il;Song, Chai-Jong;Lee, Seok-Pil;Jang, Sei-Jin;Lee, Jong-Seol
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.12
    • /
    • pp.52-59
    • /
    • 2011
  • In this paper, we propose an indexing method to support efficient similarity search for sequence databases. We present a new distance measurement called minimum DTW distance to enhance the filtering effects. The minimum DTW distance is to measure the minimum distance between a sequence data and the group of similar sequences. It enables similarity search through hierarchical index structure by filtering sequence databases. Finally, we show the superiority of our method through some experiments.

Similarity Measurement of Part Specifications based on Ontology and ELECTRE IS (온톨로지와 ELECTRE IS을 활용한 사양 기반 부품 유사도 측정 방법)

  • Mun, Du-Hwan;Hwang, Ho-Jin
    • Korean Journal of Computational Design and Engineering
    • /
    • v.15 no.2
    • /
    • pp.144-156
    • /
    • 2010
  • When existing parts are re-used for the development of a new product or business-to-business transactions, a method for searching parts from a part database that meets user's requirements is necessary. To this end, it is important to develop a part search method which is able to measure similarity between parts and user's input data with generality as well as robustness. In this paper, the authors suggest a method for measuring part similarity using ontology and multi-criteria decision making method and address its technical details. The proposed method ensures the interoperability with existing engineering information management systems, represents part specifications systematically, and has generality in the procedure for comparing part specifications. The case study for ejector pins having been conducted to demonstrate the proposed method is also discussed.

Feature-based Similarity Assessment for Re-using CAD Models (CAD 모델 재사용을 위한 특징형상기반 유사도 측정에 관한 연구)

  • Park, Byoung-Keon;Kim, Jay-Jung
    • Korean Journal of Computational Design and Engineering
    • /
    • v.16 no.1
    • /
    • pp.21-30
    • /
    • 2011
  • Similarity assessment of a CAD model is one of important issues from the aspect of model re-using. In real practice, many new mechanical parts are designed by modifying existing ones. The reuse of part enables to save design time and efforts for the designers. Design time would be further reduced if there were an efficient way to search for existing similar designs. This paper proposes an efficient algorithm of similarity assessment for mechanical part model with design history embedded within the CAD model. Since it is possible to retrieve the design history and detailed-feature information using CAD API, we can obtain an accurate and reliable assessment result. For our purpose, our assessment algorithm can be divided by two: (1) we select suitable parts by comparing MSG (Model Signature Graph) extracted from a base feature of the required model; (2) detailed-features' similarities are assessed with their own attributes and reference structures. In addition, we also propose a indexing method for managing a model database in the last part of this article.

Development of Similarity-Based Document Clustering System (유사성 계수에 의한 문서 클러스터링 시스템 개발)

  • Woo Hoon-Shik;Yim Dong-Soon
    • Proceedings of the Society of Korea Industrial and System Engineering Conference
    • /
    • 2002.05a
    • /
    • pp.119-124
    • /
    • 2002
  • Clustering of data is of a great interest in many data mining applications. In the field of document clustering, a document is represented as a data in a high dimensional space. Therefore, the document clustering can be accomplished with a general data clustering techniques. In this paper, we introduce a document clustering system based on similarity among documents. The developed system consists of three functions: 1) gatherings documents utilizing a search agent; 2) determining similarity coefficients between any two documents from term frequencies; 3) clustering documents with similarity coefficients. Especially, the document clustering is accomplished by a hybrid algorithm utilizing genetic and K-Means methods.

  • PDF

Design of Spatial Similarity Measure for Moving Object Trajectories in Spatial Network (공간 네트워크에서 이동객체 궤적을 위한 공간 유사도 측정방법의 설계)

  • Bistao, Rabindra;Chang, Jae-Woo
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.10c
    • /
    • pp.83-87
    • /
    • 2006
  • Similarity search in moving object trajectories is an active area of research. In this paper, we introduce a new concept of measure that computes spatial distance (similarity) between two trajectories of moving objects on road networks. In addition, we propose an algorithm that generates a sequence of matching edge pairs for two trajectories that ate to be compared and computes spatial distance between them which is non Euclidian in nature. With an example, we explain how our algorithm works to show spatial similarity between trajectories of moving objects in spatial network.

  • PDF

Development of A Web Mining System Based On Document Similarity (문서 유사도 기반의 웹 마이닝 시스템 개발)

  • 이강찬;민재홍;박기식;임동순;우훈식
    • The Journal of Society for e-Business Studies
    • /
    • v.7 no.1
    • /
    • pp.75-86
    • /
    • 2002
  • In this study, we proposed design issues and structure of a web mining system and develop a system for the purpose of knowledge integration under world wide web environments resulted from our developing experiences. The developed system consists of three main functions: 1) gathering documents utilizing a search agent; 2) determining similarity coefficients between any two documents from term frequencies; 3) clustering documents based on similarity coefficients. It is believed that the developed system can be utilized for discovery of knowledge in relatively narrow domains such as news classification, index term generation in knowledge management.

  • PDF

Exploratory Methodology for Acquiring Architectural Plans Based on Spatial Graph Similarity

  • Ham, Sungil;Chang, Seongju;Suh, Dongjun;Narangerel, Amartuvshin
    • Architectural research
    • /
    • v.17 no.2
    • /
    • pp.57-64
    • /
    • 2015
  • In architectural planning, previous cases of similar spatial program provide important data for architectural design. Case-based reasoning (CBR) paradigm in the field of architectural design is closely related to the designing behavior of a planner who makes use of similar architectural designs and spatial programs in the past. In CBR, spatial graph can be constituted with most fundamental data, which can provide a method of searching spatial program by using visual graphs. This study developed a system for CBR that can analyze the similarity through graph comparison and search for buildings. This is an integrated system that is able to compare space similarity of different buildings and analyze their types, in addition to the analysis on a space within a single structure.