YAGO 4.5 is the latest version of the YAGO knowledge base. It is based on Wikidata — the largest public general-purpose knowledge base. YAGO refines the data as follows:
- All entity identifiers and property identifiers are human-readable.
- The top-level classes come from schema.org — a standard repertoire of classes and properties maintained by Google and others. The lower level classes are a careful selection of the Wikidata taxonomy.
- The properties come from schema.org.
- YAGO 4.5 contains semantic constraints in the form of SHACL. These constraints keep the data clean, and allow for logical reasoning on YAGO.
YAGO is thus a simplified, cleaned, and “reasonable” version of Wikidata. It contains 49 million entities and 109 million facts.
If you use YAGO 4.5 for scientific purposes, please cite our paper for YAGO 4:
Thomas Pellissier Tanon, Gerhard Weikum, Fabian M. Suchanek: “YAGO 4: A Reason-able Knowledge Base” Resource paper at the Extended Semantic Web Conference (ESWC), 2020