• Title/Summary/Keyword: Graph Databases

Search Result 91, Processing Time 0.027 seconds

A Multilevel Workflow Graph Partitioning Scheme for Efficient Placement of Workflow Tasks (워크플로우 작업의 효율적인 배치를 위한 다단계 워크플로우 그래프 분할 기법)

  • 최경훈;손진현;김명호
    • Journal of KIISE:Databases
    • /
    • v.30 no.3
    • /
    • pp.310-319
    • /
    • 2003
  • Workflow is defined as the automation of a business process, and consists of interrelated workflow tasks. Because many modem business processes nay involve activities that are geographically distributed between different departments or organizations, workflow inherently has the characteristics of distribution. In distributed workflow systems, each workflow task performs its assigned role by utilizing information resources placed at some hosts, and then transmits workflow execution control to the next tasks in a workflow definition. Hence, it is very important to appropriately allocate workflow tasks to hosts for high performance workflow processing. In this paper, we propose a multilevel workflow graph partitioning scheme for efficient placement of workflow tasks. This method can improve the performance of workflow processing by minimizing the remote communication costs occurred during workflow execution.

A Protein Sequence Prediction Method by Mining Sequence Data (서열 데이타마이닝을 통한 단백질 서열 예측기법)

  • Cho, Sun-I;Lee, Do-Heon;Cho, Kwang-Hwi;Won, Yong-Gwan;Kim, Byoung-Ki
    • The KIPS Transactions:PartD
    • /
    • v.10D no.2
    • /
    • pp.261-266
    • /
    • 2003
  • A protein, which is a linear polymer of amino acids, is one of the most important bio-molecules composing biological structures and regulating bio-chemical reactions. Since the characteristics and functions of proteins are determined by their amino acid sequences in principle, protein sequence determination is the starting point of protein function study. This paper proposes a protein sequence prediction method based on data mining techniques, which can overcome the limitation of previous bio-chemical sequencing methods. After applying multiple proteases to acquire overlapped protein fragments, we can identify candidate fragment sequences by comparing fragment mass values with peptide databases. We propose a method to construct multi-partite graph and search maximal paths to determine the protein sequence by assembling proper candidate sequences. In addition, experimental results based on the SWISS-PROT database showing the validity of the proposed method is presented.

Salient Object Detection via Multiple Random Walks

  • Zhai, Jiyou;Zhou, Jingbo;Ren, Yongfeng;Wang, Zhijian
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.4
    • /
    • pp.1712-1731
    • /
    • 2016
  • In this paper, we propose a novel saliency detection framework via multiple random walks (MRW) which simulate multiple agents on a graph simultaneously. In the MRW system, two agents, which represent the seeds of background and foreground, traverse the graph according to a transition matrix, and interact with each other to achieve a state of equilibrium. The proposed algorithm is divided into three steps. First, an initial segmentation is performed to partition an input image into homogeneous regions (i.e., superpixels) for saliency computation. Based on the regions of image, we construct a graph that the nodes correspond to the superpixels in the image, and the edges between neighboring nodes represent the similarities of the corresponding superpixels. Second, to generate the seeds of background, we first filter out one of the four boundaries that most unlikely belong to the background. The superpixels on each of the three remaining sides of the image will be labeled as the seeds of background. To generate the seeds of foreground, we utilize the center prior that foreground objects tend to appear near the image center. In last step, the seeds of foreground and background are treated as two different agents in multiple random walkers to complete the process of salient object detection. Experimental results on three benchmark databases demonstrate the proposed method performs well when it against the state-of-the-art methods in terms of accuracy and robustness.

Automatic Construction of SHACL Schemas for RDF Knowledge Graphs Generated by R2RML Mappings

  • Choi, Ji-Woong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.8
    • /
    • pp.9-21
    • /
    • 2020
  • With the proliferation of RDF knowledge graphs(KGs), there arose a need of a standardized schema representation of the graph model for effective data interchangeability and interoperability. The need resulted in the development of SHACL specification to describe and validate RDF graph's structure by W3C. Relational databases(RDBs) are one of major sources for acquiring structured knowledge. The standard for automatic generation of RDF KGs from RDBs is R2RML, which is also developed by W3C. Since R2RML is designed to generate only RDF data graphs from RDBs, additional manual tasks are required to create the schemas for the graphs. In this paper we propose an approach to automatically generate SHACL schemas for RDF KGs populated by R2RML mappings. The key of our approach is that the SHACL shemas are built only from R2RML documents. We describe an implementation of our appraoch. Then, we show the validity of our approach with R2RML test cases designed by W3C.

Implementation of WebGIS for Integration of GIS Spatial Analysis and Social Network Analysis (GIS 공간분석과 소셜 네트워크 분석의 통합을 위한 WebGIS 구현)

  • Choi, Hyo-Seok;Yom, Jae-Hong
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.32 no.2
    • /
    • pp.95-107
    • /
    • 2014
  • In general, topographical phenomena are represented graphically by data in the spatial domain, while attributes of the non-spatial domain are expressed by alpha-numeric texts. GIS functions for analysis of attributes in the non-spatial domain remain quite simple, such as search methods and simple statistical analysis. Recently, graph modeling and network analysis of social phenomena are commonly used for understanding various social events and phenomena. In this study, we applied the network analysis functions to the non-spatial domain data of GIS to enhance the overall spatial analysis. For this purpose, a novel design was presented to integrate the spatial database and the graph database, and this design was then implemented into a WebGIS system for better decision makings. The developed WebGIS with underlying synchronized databases, was tested in a simulated application about the selection of water supply households during an epidemic of the foot-and-mouse disease. The results of this test indicate that the developed WebGIS can contribute to improved decisions by taking into account the social proximity factors as well as geospatial factors.

Effect of Carrot Intake in the Prevention of Gastric Cancer: A Meta-Analysis

  • Fallahzadeh, Hossein;Jalali, Ali;Momayyezi, Mahdieh;Bazm, Soheila
    • Journal of Gastric Cancer
    • /
    • v.15 no.4
    • /
    • pp.256-261
    • /
    • 2015
  • Purpose: Gastric cancer is the third leading cause of cancer-related mortality, with the incidence and mortality being higher in men than in women. Various studies have shown that eating carrots may play a major role in the prevention of gastric cancer. We conducted a meta-analysis to determine the relationship between carrot consumption and gastric cancer. Materials and Methods: We searched multiple databases including PubMed, Cochrane Library, Scopus, ScienceDirect, and Persian databases like Scientific Information Database (SID) and IranMedx. The following search terms were used: stomach or gastric, neoplasm or cancer, carcinoma or tumor, and carrot. Statistical analyses were performed using Comprehensive Meta Analysis/2.0 software. Results: We retrieved 81 articles by searching the databases. After considering the inclusion and exclusion criteria, 5 articles were included in this study. The odds ratio (OR) obtained by fixed effects model showed that a 26% reduction in the risk of gastric cancer has been associated with the consumption of carrots) OR=0.74; 95% confidence interval=0.68~0.81; P<0.0001). According to funnel graph, the results showed that the possibility of a publication bias does not exist in this study. Conclusions: The findings of this study showed an inverse relationship between the consumption of carrots and the risk of gastric cancer.

Construction of web-based material DB and comparison of material properties using 3D graph (웹기반 재료 DB 구축 및 3D 그래프를 사용한 물성비교)

  • Chun D.M.;Ahn S.H.
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2005.06a
    • /
    • pp.724-727
    • /
    • 2005
  • Material selection is one of the important activities in design and manufacturing. A selected material at the conceptual design stage affects material properties of the designed part as well as manufacturability and cost of the final product. Unfortunately there are not many accessible material databases that can be used for design. In this research, a web-based material database was constructed. In order to assist designers to compare different materials, two-dimensional and three-dimensional graphs were provided. Using these graphical tools, multi-dimensional comparison was available in more intuitive manner. To provide environmental safety of materials, the database included National Fire Protection Association publication Standard No.704. The web-based tool is available at http://fab.snu.ac.kr/matdb.

  • PDF

DTD-dependent object database schema design methods for efficiently managing Bio-XML (Bio-XML 관리를 위한 DTD 의존적 객체 데이터베이스 스키마 설계기법)

  • 김태경;이경희;조완섭
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2003.11a
    • /
    • pp.285-289
    • /
    • 2003
  • In this paper, we present DTD-dependent object database schema design methods to efficiently store XML data and process path expression. The similarity between DTD graph model and the object database model, and the characteristics of object database, object references and set-valued attributes, are very profitable to store XML documents into object databases. We propose two kinds of schema design methods. We then compare and anayze space and time complexity for the methods.

  • PDF

A Query Language for Quantitative Analysis on Graph Databases (그래프 데이터베이스의 양적 분석을 위한 질의 언어)

  • Park, Sung-Chan;Lee, Sang-Goo
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2011.06a
    • /
    • pp.77-80
    • /
    • 2011
  • 그래프는 전산학의 주요 주제 중 하나이며 World Wide Web과 Social Network의 중요성이 커지면서 더욱 주목을 받고 있다. 그래프와 관련하여 그래프 데이터베이스에 대한 질의 모델에 관한 연구도 중요하게 다투어져 왔다. 하지만 이들 연구는 패턴 매칭을 통한 질의를 주로 다루었다. 하지만 그래프 데이터를 추천이나 검색 등의 응용하기 위해서는 PageRank 등 그래프 내의 연결 구조를 양으로 분석해내는 작업이 요구된다. 또한 SimRank 및 Random Walk with Restart 등 다양한 양적 분석 측도가 제안되고 있다. 이에 따라 본 연구에서는 Random Walk를 기반으로 하는 그래프에 대한 유연한 양적 분석을 지원하는 질의 언어를 제시한다. 또한 기존의 양적 분석 측도들이 본 질의 모델을 통하여 어떻게 표현되는지를 통하여 본 질의 모델의 유용성 및 확장성을 보인다.

The MAPN Modeling for the distributed Data Allocation based on Multiple Aspects (다중 측면 기반의 분산 데이터 할당을 위한 MAPN 모델링)

  • Park, Seong-Jin
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.3
    • /
    • pp.745-755
    • /
    • 2000
  • In designing distributed databases, DAP(Data Allocation Problem) is one of the key design issues. Because, however, most of previous researches on DAP have considered only cost aspect, they cannot increase the performance and availability and they are not proper to the system requiring high-availability or real-time processing. Therefore, we need a more formal data allocation model considering multiple aspects. In this paper, we propose the MAPN (Multiple Aspects Petri Net) modeling method for the distributed transaction modeling. The MAPN model, an extended classical petri net, is proposed for the formal modeling considering multiple aspects (cost, performance and availability) concurrently. We demonstrate that we can compose the valid DAP evaluation model considering not only cost but also performance and availability concurrently by using the MAPN structure and MAPN graph.

  • PDF