Event ground truth data

The enclosed data was extracted from the Wikipedia Current Events
portal https://en.wikipedia.org/wiki/Portal:Current_events for
use in the publication:

Andreas Spitz and Michael Gertz
Exploring Entity-centric Networks in Entangled News Streams
WWW Companion, 2018

For further details, visit
https://dbs.ifi.uni-heidelberg.de/resources/newsstream/

-------------------
-------------------

The data contains two files:
The events file contains 97 individual news events that could be
linked to articles in the collection. The items file contains all
individual evaluation items that can be constructed from these
events (for details, please refer to the original publication).

Relevant fields in the data:
content:           full text of the event description
category:          category of the event 
date:              date on which the event occurred
source_links:      array containing the URLs of source
source_article_id: internal integer ID of the article (with the prefix "p",
                   these correspond to document IDs in the network data)
entities:          Wikidata identifers of all entities in the event description
neClasses:         named enttiey classes of the entities (loc, org, or per)
terms:             non-entity terms in the event description
context:           context vector of the event description
verbs:             verbs that occur in the event description
verb_context:      context embedding of the verbs in the event description

The items file contains additional attributes for the evaluation
process that correspond to the target entity (the entity to predict)
and all remaining entities.

-------------------
-------------------

Importing JSON objects into MongoDB

For importing the JSON formatted ground truth data into a MongoDB
installation, you may use the mongoimport function that comes with
MongoDB.

Simply replace <server address>, <port>, and <database name> according
to your local installation setup and run the following two commands:

mongoimport --host <server address>:<port> --db <database name> --collection ground_truth_events --file entangledNews_groundTruth_events.json --jsonArray

mongoimport --host <server address>:<port> --db <database name> --collection ground_truth_items --file entangledNews_groundTruth_items.json --jsonArray
