Open Datasets
Unrestricted access to the entire Etruscan digital corpus. Machine-readable formats updated via nightly builds.
Corpus JSON Extract
Deep-nested JSON of the 6,633-inscription unified corpus including morphological parses, geolocation, and linked-open-data URIs.
Tabular Metadata CSV
Flat tabular structure ideal for R or Pandas. Excludes nested morphological trees but retains all classification, findspot, and raw text data.
Semantic RDF/XML
Linked Open Data conformant export using the CIDOC-CRM ontology. Ready for integration with Pleiades and Trismegistos SPARQL endpoints.
Data Sovereignty
OpenEtruscan is committed to Open Science. We believe that cultural heritage data should be freely accessible to everyone - from university researchers to the general public.
All texts, translations, and morphological analyses are published under a Creative Commons Zero (CC0) license, effectively placing them in the public domain. You are free to copy, modify, and distribute this data without requesting permission.
Automation API
For continuous integration pipelines, we expose an endpoint to trigger direct headless downloads of the latest nightly dumps.
GET /api/v1/dumps/latest.json.gz
Headers:
Accept-Encoding: gzip
// Streams the raw compiled corpus directly