DOI QR코드

DOI QR Code

Graph Compression by Identifying Recurring Subgraphs

  • Ahmed, Muhammad Ejaz (Dept. of Creative IT Engineering, POSTECH) ;
  • Lee, JeongHoon (Dept. of Creative IT Engineering, POSTECH) ;
  • Na, Inhyuk (Dept. of Creative IT Engineering, POSTECH) ;
  • Son, Sam (Dept. of Creative IT Engineering, POSTECH) ;
  • Han, Wook-Shin (Dept. of Creative IT Engineering/Dept. of Computer Science and Engineering, POSTECH)
  • Published : 2017.04.27

Abstract

Current graph mining algorithms suffers from performance issues when querying patterns are in increasingly massive network graphs. However, from our observation most data graphs inherently contains recurring semantic subgraphs/substructures. Most graph mining algorithms treat them as independent subgraphs and perform computations on them redundantly, which result in performance degradation when processing massive graphs. In this paper, we propose an algorithm which exploits these inherent recurring subgraphs/substructures to reduce graph sizes so that redundant computations performed by the traditional graph mining algorithms are reduced. Experimental results show that our graph compression approach achieve up to 69% reduction in graph sizes over the real datasets. Moreover, required time to construct the compressed graphs is also reasonably reduced.

Keywords