header

Everything You Always Wanted to Know About Blank Nodes

43 Pages Posted: 2 Jul 2018 First Look: Accepted

See all articles by Aidan Hogan

Aidan Hogan

National University of Ireland, Galway (NUIG) - Digital Enterprise Research Institute (DERI)

Marcelo Arenas

Pontifical Catholic University of Chile - Department of Computer Science

Alejandro Mallea

Pontifical Catholic University of Chile - Department of Computer Science

Axel Polleres

National University of Ireland, Galway (NUIG) - Digital Enterprise Research Institute (DERI)

Abstract

In this paper we thoroughly cover the issue of blank nodes, which have been defined in RDF as ‘existential variables’. We first introduce the theoretical precedent for existential blank nodes from first order logic and incomplete information in database theory. We then cover the different (and sometimes incompatible) treatment of blank nodes across the W3C stack of RDF-related standards. We present an empirical survey of the blank nodes present in a large sample of RDF data published on the Web (the BTC–2012 dataset), where we find that 25.7% of unique RDF terms are blank nodes, that 44.9% of documents and 66.2% of domains featured use of at least one blank node, and that aside from one Linked Data domain whose RDF data contains many “blank node cycles”, the vast majority of blank nodes form tree structures that are efficient to compute simple entailment over. With respect to the RDF-merge of the full data, we show that 6.1% of blank-nodes are redundant under simple entailment. The vast majority of non-lean cases are isomorphisms resulting from multiple blank nodes with no discriminating information being given within an RDF document or documents being duplicated in multiple Web locations. Although simple entailment is NP-complete and leanness-checking is coNP-complete, in computing this latter result, we demonstrate that in practice, real-world RDF graphs are sufficiently “rich” in ground information for problematic cases to be avoided by non-naive algorithms.

Keywords: blank nodes, rdf, simple entailment, leanness, skolemisation, semantic web, linked data

Suggested Citation

Hogan, Aidan and Arenas, Marcelo and Mallea, Alejandro and Polleres, Axel, Everything You Always Wanted to Know About Blank Nodes (2014). Journal of Web Semantics First Look. Available at SSRN: https://ssrn.com/abstract=3199109 or http://dx.doi.org/10.2139/ssrn.3199109

Aidan Hogan (Contact Author)

National University of Ireland, Galway (NUIG) - Digital Enterprise Research Institute (DERI) ( email )

University Road
Galway, Co. Kildare
Ireland

Marcelo Arenas

Pontifical Catholic University of Chile - Department of Computer Science ( email )

Casilla 306
Correo 22
Santiago
Chile

Alejandro Mallea

Pontifical Catholic University of Chile - Department of Computer Science ( email )

Casilla 306
Correo 22
Santiago
Chile

Axel Polleres

National University of Ireland, Galway (NUIG) - Digital Enterprise Research Institute (DERI) ( email )

University Road
Galway, Co. Kildare
Ireland

Register to save articles to
your library

Register

Paper statistics

Abstract Views
211
Downloads
9