• Title/Summary/Keyword: 매칭 효율성

Search Result 227, Processing Time 0.021 seconds

A Study on Preprocessing Method for Effective Semantic-based Similarity Measures using Approximate Matching Algorithm (의미적 유사성의 효과적 탐지를 위한 데이터 전처리 연구)

  • Kang, Hari;Jeong, Doowon;Lee, Sangjin
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.25 no.3
    • /
    • pp.595-602
    • /
    • 2015
  • One of the challenges of the digital forensics is how to handle certain amounts of data efficiently. Although reliable and various approximate matching algorithms have been presented to quickly identify similarities between digital objects, its practical effectiveness to identify the semantic similarity is low because of frequent false positives. To solve this problem, we suggest adding a pre-processing of the approximate matching target dataset to increase matching accuracy while maintaining the reliability of the approximate matching algorithm. To verify the effectiveness, we experimented with two datasets of eml and hwp using sdhash in order to identify the semantic similarity.

Efficient Time-Series Subsequence Matching using Duality in Constructing Windows (윈도우를 구성하는 방법의 이원성을 이용한 효율적인 시계열 서부시퀀스 매칭)

  • Mun, Yang-Se;No, Ung-Gi;Hwang, Gyu-Yeong
    • Journal of KIISE:Databases
    • /
    • v.28 no.1
    • /
    • pp.15-30
    • /
    • 2001
  • 서브시퀀스 매칭은 질의 시퀀스와 유사한 서브시퀀스를 가지는 데이터 시퀀스와 해당 서브시퀀스의 위치를 찾는 문제이다. 본 논문에서는 윈도우를 구성하는 방법의 이원성을 이용한 새로운 서부시퀀스 매칭 방법인 Dual-Match는 윈도우를 구성하는 방법에 있어서 Faloutsos 등이 사용한 방법(간단히 FRM 이라한다)의 이원적 접근법이다. 즉, FRM에서는 데이터 시퀀스를 슬라이딩 윈도우로 나누고 질의 시퀀스를 디스조인트 윈도우로 나누는 방법을 사용한 반면, Dual-Match에서는 데이터 시퀀스를 디스조이트 윈도우로 나누고 질의 시퀀스를 슬라이딩 윈도우로 나누는 방법을 사용한다. FRM은 색인에 필요한 저장공간을 줄이기 위하여 개별 점 대신 최소 포함 사각형만을 저장함으로 인하여 많은 착오해답(유사하지 않은 후보 서브시퀀스)을 발생시켰다. Dual-Match는 FRM과 비슷한 크기의 저장공간에 개별 점을 직접 저장함으로써 이 문제를 해결한다. 실험결과, Dual-Match는 많은 경우에 있어서 FRM에 비하여 후보 개수를 크게 줄이고 성능을 향상시켰다. 특히, 선택률이 낮은 경우($10^{-4}$이하)에는 후보 개수를 최대 8800배 까지 줄이고, 페이지 액세스 횟수를 최대 26.9배까지 줄였으며, 성능을 최대 430배까지 향상시켰다. 또한, 동일한 크기의 색인을 생성하는데 있어서 Dual-Match는 FRM보다 4.10~25.6배 빠르게 색인을 구성하였다. 이는 색인 구성시에 CPU 오버헤드의 많은 부분을 차지하는 저차원 변환의 횟수를 FRM에 비해 크게 줄이기 때문이다. 이 같은 결과로 볼 때, Dual-Match는 대용량 데이터베이스에 대한 서부시퀀스 매칭의 성능을 크게 향상시킬 수 있는 획기적인 연구 결과라 믿는다.

  • PDF

FiST: XML Document Filtering by Sequencing Twig Patterns (가지형 패턴의 시퀀스화를 이용한 XML 문서 필터링)

  • Kwon Joon-Ho;Rao Praveen;Moon Bong-Ki;Lee Suk-Ho
    • Journal of KIISE:Databases
    • /
    • v.33 no.4
    • /
    • pp.423-436
    • /
    • 2006
  • In recent years, publish-subscribe (pub-sub) systems based on XML document filtering have received much attention. In a typical pub-sub system, subscribing users specify their interest in profiles expressed in the XPath language, and each new content is matched against the user profiles so that the content is delivered only to the interested subscribers. As the number of subscribed users and their profiles can grow very large, the scalability of the system is critical to the success of pub-sub services. In this paper, we propose a novel scalable filtering system called FiST(Filtering by Sequencing Twigs) that transforms twig patterns expressed in XPath and XML documents into sequences using Prufer's method. As a consequence, instead of matching linear paths of twig patterns individually and merging the matches during post-processing, FiST performs holistic matching of twig patterns with incoming documents. FiST organizes the sequences into a dynamic hash based index for efficient filtering. We demonstrate that our holistic matching approach yields lower filtering cost and good scalability under various situations.

A study on Heterogeneous Policy Effects Based on Propensity Score Quartiles (성향점수 분위에 따른 이질적 정책효과 분석: 소재·부품·장비 R&D지원사업을 중심으로)

  • Gyeonghee Shin;Heesang Lee
    • Journal of Technology Innovation
    • /
    • v.32 no.2
    • /
    • pp.1-21
    • /
    • 2024
  • This study aims to estimate policy effects that appear heterogeneously within the beneficiary group by matching the beneficiary and non-beneficiary groups based on propensity score quartiles and analyzing the effect of policy benefits on sales growth. To achieve this, 239 SMEs that participated in R&D support program for the manufacturing of materials, components, and equipment in 2020 were selected as the beneficiary group. The propensity scores of these companies were calculated and divided into eight quartiles for matching between the non-beneficiary and beneficiary groups. Subsequently, double difference analysis was used to calculate the sales growth rate attributable to policy support. The study found that the largest policy effect was observed in the lowest quartile group, and companies with high patent application rates and 3-year sales growth rates experienced significant policy effects. These findings suggest that propensity score quartile-based analysis can be effectively utilized to refine the criteria for selecting beneficiary companies and the scope of industrial policy support.

Correspondence Matching of Stereo Images by Sampling of Planar Region in the Scene Based on RANSAC (RANSAC에 기초한 화면내 평면 영역 샘플링에 의한 스테레오 화상의 대응 매칭)

  • Jung, Nam-Chae
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.12 no.4
    • /
    • pp.242-249
    • /
    • 2011
  • In this paper, the correspondence matching method of stereo images was proposed by means of sampling projective transformation matrix in planar region of scene. Though this study is based on RANSAC, it does not use uniform distribution by random sampling in RANSAC, but use multi non-uniform computed from difference in positions of feature point of image or templates matching. The existing matching method sampled that the correspondence is presumed to correct by use of the condition which the correct correspondence is almost satisfying, and applied RANSAC by matching the correspondence into one to one, but by sampling in stages in multi probability distribution computed for image in the proposed method, the correct correspondence of high probability can be sampled among multi correspondence candidates effectively. In the result, we could obtain many correct correspondence and verify effectiveness of the proposed method in the simulation and experiment of real images.

Matching Method of Digital Map and POI for Geospatial Web Platform (공간정보 플랫폼 구축을 위한 전자지도와 POI 정보의 매칭 방법)

  • Kim, Jung-Ok;Huh, Yong;Lee, Won-Hee;Yu, Ki-Yun
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.17 no.4
    • /
    • pp.23-29
    • /
    • 2009
  • Recent growth of the geospatial information on the Web has made it possible to easily access a wide variety of geospatial information. An integration of different geospatial objects consists of the following three steps; extracting geospatial objects from the maps, converting the coordinate system and discovering pairs of objects that represent the same real-world entity in the two maps. This paper deals mainly with the third step to correspond conjugate objects and four matching types and criteria is presented. The techniques designed and developed can be utilized to efficiently integrate distributed heterogeneous spatial databases such as the digital maps and POIs from other data sources. To achieve the goal, we presented four types and criteria for the matching schema. The main contributions of this paper are as follows. A complete process of integrating data from maps on the Web is presented. Then, we show how attributes of the objects can be used in the integration process.

  • PDF

New Illumination compensation algorithm improving a multi-view video coding performance by advancing its temporal and inter-view correlation (다시점 비디오의 시공간적 중복도를 높여 부호화 성능을 향상시키는 새로운 조명 불일치 보상 기법)

  • Lee, Dong-Seok;Yoo, Ji-Sang
    • Journal of Broadcast Engineering
    • /
    • v.15 no.6
    • /
    • pp.768-782
    • /
    • 2010
  • Because of the different shooting position between multi-view cameras and the imperfect camera calibration, Illumination mismatches of multi-view video can happen. This variation can bring about the performance decrease of multi-view video coding(MVC) algorithm. A histogram matching algorithm can be applied to recompensate these inconsistencies in a prefiltering step. Once all camera frames of a multi-view sequence are adjusted to a predefined reference through the histogram matching, the coding efficiency of MVC is improved. However the histogram distribution can be different not only between neighboring views but also between sequential views on account of movements of camera angle and some objects, especially human. Therefore the histogram matching algorithm which references all frames in chose view is not appropriate for compensating the illumination differences of these sequence. Thus we propose new algorithms both the image classification algorithm which is applied two criteria to improve the correlation between inter-view frames and the histogram matching which references and matches with a group of pictures(GOP) as a unit to advance the correlation between successive frames. Experimental results show that the compression ratio for the proposed algorithm is improved comparing with the conventional algorithms.

PrimeFilter: An Efficient XML Data Filtering based on Prime Number Indexing (PrimeFilter: 소수 인덱싱 기법에 기반한 효율적 XML 데이타 필터링)

  • Kim, Jae-Hoon;Kim, Sang-Wook;Park, Seog
    • Journal of KIISE:Databases
    • /
    • v.35 no.5
    • /
    • pp.421-431
    • /
    • 2008
  • Recently XML is becoming a de facto standard for online data exchange between heterogeneous systems and also the research of streaming XML data filtering comes into the spotlight. Since streaming XML data filtering technique needs rapid matching of queries with XML data, it is required that the query processing should be efficiently performed. Until now, most of researches focused only on partial sharing of path expressions or efficient predicate processing and they were work for time and space efficiency. However, if containment relationship between queries is previously calculated and the lowest level query is matched with XML data, we can easily get a result that high level queries can match with the XML data without any other processing. That is, using this containment technique can be another optimal solution for streaming XML data filtering. In this paper, we suggest an efficient XML data filtering based on prime number indexing and containment relationship between queries. Through some experimental results, we present that our suggested method has a better performance than the existing method. All experiments have shown that our method has a more than two times better performance even though each experiment has its own distinct test purpose.

Efficient Randomized Parallel Algorithms for the Matching Problem (매칭 문제를 위한 효율적인 랜덤 병렬 알고리즘)

  • U, Seong-Ho;Yang, Seong-Bong
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.26 no.10
    • /
    • pp.1258-1263
    • /
    • 1999
  • 본 논문에서는 CRCW(Concurrent Read Concurrent Write)와 CREW(Concurrent Read Exclusive Write) PRAM(Parallel Random Access Machine) 모델에서 무방향성 그래프 G=(V, E)의 극대 매칭을 구하기 위해 간결한 랜덤 병렬 알고리즘을 제안한다. CRCW PRAM 모델에서 m개의 선을 가진 그래프에 대해, 제안된 매칭 알고리즘은 m개의 프로세서 상에서 {{{{ OMICRON (log m)의 기대 수행 시간을 가진다. 또한 CRCW 알고리즘을 CREW PRAM 모델에서 구현한 CREW 알고리즘은 OMICRON (log^2 m)의 기대 수행 시간을 가지지만,OMICRON (m/logm) 개의 프로세서만을 가지고 수행될 수 있다.Abstract This paper presents simple randomized parallel algorithms for finding a maximal matching in an undirected graph G=(V, E) for the CRCW and CREW PRAM models. The algorithm for the CRCW model has {{{{ OMICRON (log m) expected running time using m processors, where m is the number of edges in G We also show that the CRCW algorithm can be implemented on a CREW PRAM. The CREW algorithm runs in {{{{ OMICRON (log^2 m) expected time, but it requires only OMICRON (m / log m) processors.

Design of Networked based Control Systems using Pade Approximation and Reduction of Higher-order Transfer Function (Pade 근사법과 전달함수 축소기법을 이용한 네트워크 기반 제어 시스템 설계)

  • Shin, Dong-Gi;Cho, Hyun-Cheol;Han, Hyun-Tae;Lee, Jin-Woo;Lee, Kwon-Soon
    • Proceedings of the KIEE Conference
    • /
    • 2008.04a
    • /
    • pp.39-40
    • /
    • 2008
  • 본 논문은 네트워크 기반 제어시스템(NCS: Networked Control Systems)모델매칭 제어기법을 적용한 효율적인 제어알고리즘을 제안한다. 비선형의 특징을 가지는 제어기 및 관측기의 시간지연을 Pade 근사법으로 선형화하여 선형시스템 이론을 적용한 모델매칭 제어기를 설계하였다. 또한, 제어기의 차수를 줄이기 위하여 저차시스템으로 근사화하여 그 타당성을 검증하였다. 제안한 제어알고리즘의 타당성 및 신뢰성을 검증하기 위하여 컴퓨터 시뮬레이션을 실시하였으며 기존의 PID 제어기법과 비교분석하였다.

  • PDF