header

Impact Analysis of Data Placement Strategies on Query Efforts in Distributed RDF Stores

38 Pages Posted: 17 Jan 2020 Publication Status: Accepted

See all articles by Daniel Janke

Daniel Janke

University of Koblenz-Landau

Steffen Staab

University of Koblenz-Landau - Institute for Web Science and Technologies (WeST); University of Southampton - Faculty of Engineering, Science and Mathematics

Matthias Thimm

University of Koblenz-Landau

Abstract

In the last years, scalable RDF stores in the cloud have been developed, where graph data is distributed over compute and storage nodes for scaling efforts of query processing and memory needs. One main challenge in these RDF stores is the data placement strategy that can be formalized in terms of graph covers. These graph covers determine whether (a) the triples distribution is well-balanced over all storage nodes (storage balance) (b) different query results may be computed on several compute nodes in parallel (vertical parallelization) and (c) individual query results can be produced only from triples assigned to few — ideally one — storage node (horizontal containment). We analyse the impact of three most commonly used graph cover strategies in these terms and found out that balancing query workload reduces the query execution time more than reducing data transfer over network. To this end, we present our novel benchmark and open source evaluation platform Koral.

Keywords: Distributed RDF stores, graph partitioning, benchmark

Suggested Citation

Janke, Daniel and Staab, Steffen and Thimm, Matthias, Impact Analysis of Data Placement Strategies on Query Efforts in Distributed RDF Stores (2018). Available at SSRN: https://ssrn.com/abstract=3180338 or http://dx.doi.org/10.2139/ssrn.3180338

Daniel Janke (Contact Author)

University of Koblenz-Landau ( email )

56070 Koblenz-Metternich
Germany

Steffen Staab

University of Koblenz-Landau - Institute for Web Science and Technologies (WeST) ( email )

Universitaetsstrasse 1, Gebäude B
Campus Koblenz
Koblenz, 56070
Germany

University of Southampton - Faculty of Engineering, Science and Mathematics ( email )

United Kingdom

Matthias Thimm

University of Koblenz-Landau ( email )

56070 Koblenz-Metternich
Germany

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
50
Abstract Views
963
PlumX Metrics