DATASET
DATASET_GPIulanWIKIDATA_FULL.csv brings together actors in provenance entities from the Getty Provenance Index (GPI) with corresponding Wikidata identifiers for artists, collectors, dealers, galleries, and auction houses (not museums).
VIEW DATASET HERE
The file contains 61,419 records, each representing a person or organization appearing in the GPI.
Every record retains six key GPI fields and is enriched, where available, with open-data identifiers and descriptions from Wikidata.
DOWNLOAD DATASET CSV
Merge details
Primary key:
ULANurl(Getty ULAN link)Join type: Left join — all GPI rows are preserved, even if no Wikidata match exists
Wikidata coverage: 11,146 matched entities out of 24,881 unique ULANs (≈45%)
Columns included: All original GPI columns plus Wikidata fields such as
item(Wikidata QID)itemLabel(name in Wikidata)itemDescriptionExternal identifiers:
VIAF,GND,ISNI,RKD,Proveana, and others
Use
This dataset enables cross-referencing between the Getty Provenance Index and Wikidata, facilitating linked-data research on art-market actors, networks, and provenance patterns.
It is particularly useful for identifying entities appearing in both GPI and open knowledge graphs, enriching provenance chains with additional biographical and institutional context.
It is also useful for identifying GAPS in the data (for example, missing ULAN codes in either GPI or WIKIDATA) and for targeting useful actions to improve data coherence and completeness.
📘 Columns from the Getty Provenance Index (GPI)
URI Linkedart json – Persistent URI for the Linked Art JSON record
name – Preferred name of the person or organization
ULANurl – Getty ULAN identifier in URL form (merge key)
starId – Internal GPI identifier
birthYear – Year of birth (where applicable)
biography GPI – Textual biographical note from GPI
🟦 Columns from Wikidata
item – Full Wikidata entity URI (e.g.,
http://www.wikidata.org/entity/Q5582)itemLabel – English label (name of the entity)
itemDescription – Short descriptive phrase from Wikidata
ulan – ULAN numeric identifier used in Wikidata (e.g.,
500044458)VIAF – Virtual International Authority File ID (
P214)GND – German National Library identifier (
P227)Lexikon – Künstlerlexikon der Schweiz identifier (
P9585)Proveana – Proveana database ID (
P9434)RKD – RKDartists ID (
P650)ArtHist – arthist.net identifier (
P10015or equivalent, if present)BritishM – British Museum person or org ID (
P1711or similar)ISNI – International Standard Name Identifier (
P213)LoC – Library of Congress ID (
P244)BNF – Bibliothèque nationale de France ID (
P268)YadVashem – Yad Vashem Holocaust database ID (
P6890)SNAC – Social Networks and Archival Context ID (
P3430)Joconde – French museum catalogue ID (
P347)BiografischPortaal – Biografisch Portaal van Nederland ID (
P651)ULANurl_wikidata – Getty ULAN URL as represented within Wikidata
Merge details
Primary key:
ULANurl(Getty ULAN link)Join type: Left join — all GPI rows are preserved, even if no Wikidata match exists
Wikidata coverage: 11,146 matched entities out of 24,881 unique ULANs (≈45%)