Wikipedia Glossaries
(Data Version: November 2017)

The data set contains glossary entries extracted from Wikipedia for terms from the
fields of astronomy, biology, chemistry and geology. The data is formatted as a
tab-separated txt file with three columns:

<wikidata ID> \t <entity label> \t <glossary description>

<wikidata ID>:          Wikidata identifier of the entity
<entity label>:         English Wikidata label of the entity
<glossary description>: Wikipedia description of the entity in the glossary

The data was originally used as evaluation set for the extraction of descriptive
sentences for entities and entity relations. If you use the data set for research,
please cite this as the source:

A. Spitz, G. Feher, and M. Gertz.
Extracting Descriptions of Location Relations from Implicit Textual Networks.
11th Workshop on Geographic Information Retrieval (GIR), pages 1:1–1:9, 2017
