Skip to main content
MenuEtruscan
Open Access

Open Datasets

Unrestricted access to the entire Etruscan digital corpus. Machine-readable formats updated via nightly builds.

Nightly Build

Corpus JSON Extract

Deep-nested JSON of the 6,633-inscription unified corpus including morphological parses, geolocation, and linked-open-data URIs.

Size14.2 MB
SHA-256(see /api/dumps/corpus.json.gz.sha256 for live checksum)
Download JSON
Stable v2.0

Tabular Metadata CSV

Flat tabular structure ideal for R or Pandas. Excludes nested morphological trees but retains all classification, findspot, and raw text data.

Size3.8 MB
SHA-256f7e8d9c0b1a2...pending
Download CSV
Nightly Build

Semantic RDF/XML

Linked Open Data conformant export using the CIDOC-CRM ontology. Ready for integration with Pleiades and Trismegistos SPARQL endpoints.

Size22.5 MB
SHA-2563d4e5f6a7b8c...pending
Download RDF

Data Sovereignty

OpenEtruscan is committed to Open Science. We believe that cultural heritage data should be freely accessible to everyone - from university researchers to the general public.

All texts, translations, and morphological analyses are published under a Creative Commons Zero (CC0) license, effectively placing them in the public domain. You are free to copy, modify, and distribute this data without requesting permission.

Automation API

For continuous integration pipelines, we expose an endpoint to trigger direct headless downloads of the latest nightly dumps.

GET /api/v1/dumps/latest.json.gz
Headers:
  Accept-Encoding: gzip
// Streams the raw compiled corpus directly