MONDOGRAPH · THE WORLD'S PROPER-NAME KNOWLEDGE GRAPH
MondoGraph is Mondonomo's defensible moat. A decade of academic and industrial research distilled into a single graph: 126 billion attestations, every script in active use, every country, with formal etymology and the connections between names — romanizations, transliterations, soundalikes, cognates, variants — modelled as first-class edges.
WHY A KNOWLEDGE GRAPH
Every general-purpose model fails on the long tail of proper names. They mis-pronounce, mis-translate, mis-gender, and confuse two unrelated people with similar spellings. MondoGraph is the substrate that lets a tiny specialized model do all of that correctly — and a hundred times cheaper than GPT-4.
53M distinct given-name forms, 49M surname forms. Rare regional names with three bearers are in MondoGraph and not in your foundation model's training corpus.
Every name connects to its romanizations, transliterations, IPA, soundalikes, etymological cognates, and bearer demographics. The graph is the model.
B2C surfaces (mondonomo.ai, thai.mondonomo.ai, echoes) feed user contributions back into MondoGraph. A flywheel that's hard to copy.
SCHEMA
Most "name lists" are flat tables. MondoGraph models a name as a node connected to scripts, languages, countries, IPA realizations, soundalike clusters, etymological roots, variants, parsed parts, gender distributions, and known bearers. The connections are what make the downstream models possible.
WHAT'S INSIDE — TOKEN INVENTORY
126B total attestations across 165M unique strings
GIVEN + SURNAME — the substrate of every PNEUMA-DD model
Of 2,410 language codes
xx9.1%bo3.8%xx bucket (9.1%) covers tokens
where source language could not be determined. Both surfaced rather than hidden.
ACCESS
Hit MondoGraph through PNEUMA-DD and MondoPhon endpoints. Sub-50ms typical latency. Same keys work across all six demo endpoints. Free tier covers research and prototyping.
Filtered slices for academic use (Apache 2.0). Full graph available under commercial license. Snapshot updates quarterly.
Mondonomo collaborates on onomastic research and dataset extensions. Past partners: University of Zagreb, Chulalongkorn University. Reach out for new languages.