header

Building Event-Centric Knowledge Graphs from News

23 Pages Posted: 24 Jun 2018 First Look: Accepted

See all articles by Marco Rospocher

Marco Rospocher

Fondazione Bruno Kessler

Marieke van Erp

KNAW Humanities Cluster - DHLab

Piek Vossen

VU University Amsterdam

Antiske Fokkens

VU University Amsterdam

Itziar Aldabe

Universidad del País Vasco (UPV/EHU)

German Rigau

Universidad del País Vasco (UPV/EHU)

Aitor Soroa

Universidad del País Vasco (UPV/EHU)

Thomas Ploeger

SynerScope B.V.

Tessel Bogaard

SynerScope B.V.

Abstract

Knowledge graphs have gained increasing popularity in the past couple of years, thanks to their adoption in everyday search engines. Typically, they consist of fairly static and encyclopedic facts about persons and organizations–e.g. a celebrity’s birth date, occupation and family members–obtained from large repositories such as Freebase or Wikipedia.

In this paper, we present a method and tools to automatically build knowledge graphs from news articles. As news articles describe changes in the world through the events they report, we present an approach to create Event-Centric Knowledge Graphs (ECKGs) using state-of-the-art natural language processing and semantic web techniques. Such ECKGs capture long-term developments and histories on hundreds of thousands of entities and are complementary to the static encyclopedic information in traditional knowledge graphs.

We describe our event-centric representation schema, the challenges in extracting event information from news, our open source pipeline, and the knowledge graphs we have extracted from four different news corpora: general news (Wikinews), the FIFA world cup, the Global Automotive Industry, and Airbus A380 airplanes. Furthermore, we present an assessment on the accuracy of the pipeline in extracting the triples of the knowledge graphs. Moreover, through an event-centered browser and visualization tool we show how approaching information from news in an event-centric manner can increase the user’s understanding of the domain, facilitates the reconstruction of news story lines, and enable to perform exploratory investigation of news hidden facts.

Keywords: Event-centric knowledge, Natural language processing, Event extraction, Information integration, Big data, Real world data

Suggested Citation

Rospocher, Marco and van Erp, Marieke and Vossen, Piek and Fokkens, Antiske and Aldabe, Itziar and Rigau, German and Soroa, Aitor and Ploeger, Thomas and Bogaard, Tessel, Building Event-Centric Knowledge Graphs from News (2016). Journal of Web Semantics First Look 37_0_8. Available at SSRN: https://ssrn.com/abstract=3199233 or http://dx.doi.org/10.2139/ssrn.3199233

Marco Rospocher

Fondazione Bruno Kessler

Via Sommarive 18
Povo
Trento, 38123
Italy

Marieke Van Erp (Contact Author)

KNAW Humanities Cluster - DHLab ( email )

Oudezijds Achterburgwal 185
Amsterdam, NH Noord-Holland 1012 DK
Netherlands

HOME PAGE: http://https://huc.knaw.nl

Piek Vossen

VU University Amsterdam

De Boelelaan 1105
Amsterdam, ND North Holland 1081 HV
Netherlands

Antiske Fokkens

VU University Amsterdam

De Boelelaan 1105
Amsterdam, ND North Holland 1081 HV
Netherlands

Itziar Aldabe

Universidad del País Vasco (UPV/EHU)

Barrio Sarriena s/n
Leioa, Bizkaia 48940
Spain

German Rigau

Universidad del País Vasco (UPV/EHU)

Barrio Sarriena s/n
Leioa, Bizkaia 48940
Spain

Aitor Soroa

Universidad del País Vasco (UPV/EHU)

Barrio Sarriena s/n
Leioa, Bizkaia 48940
Spain

Thomas Ploeger

SynerScope B.V.

Helvoirt
Netherlands

Tessel Bogaard

SynerScope B.V.

Helvoirt
Netherlands

Register to save articles to
your library

Register

Paper statistics

Abstract Views
375
Downloads
93