Skip to main content
Computational Toolkit

Script Normalizer

A real-time translation engine mapping epigraphic Unicode (U+10300) to standardized scholarly transliteration (CIE/Canonical).

Quick Examples:
mi πŒ€πŒ‹πŒ‰πŒ’πŒ–
πŒ€πŒ…πŒ‰πŒ‹ πŒ”πŒ–πŒˆ
larΞΈia velus
Input Text
Auto-Detect
Output

How it Works

Etruscan can be written using ancient Old Italic symbols (πŒ€, 𐌁, πŒ‚) or modern Latin characters used by scholars (a, b, c).

This tool automatically detects which script you are typing and instantly normalizes it to the other format. This ensures that a researcher typing 'th' and another typing '𐌈' can both query the same exact word in the database.

Developer API

The Normalization engine is exposed as a WASM-compiled lexer via the API, returning all valid script transformations instantly.

POST /api/v1/toolkit/normalize
{
  "input": "πŒ€πŒ…πŒ‰πŒ‹"
}
// Returns { "canonical": "avil", "phonetic": "a.wil" }