This repository contains reusable code to generate reusable benchmark datasets for data integration tasks.
Currently it contains code to:
- Generate DBpedia-TKG: A temporal Knowledge Graph by extraction multiple revisions of Wikipedia pages with the DBpedia Extraction Framework (DIEF) and annotating triples with their lifespan diffing triple revision versions.
Documentation was moved to dbpedia/dbpedia-temporal.