MODEL 03 · NAMED-ENTITY TYPING

Name Classifier

Powered by PNEUMA-DD (production) · hybrid PNEUMA in development

Decide whether a string is a personal name, a company, a place, a brand or product, or none of the above. The model returns calibrated probabilities — so you can route the long tail of CRM and KYC noise to the right pipeline before downstream parsing kicks in. The hybrid CNN-Transformer follow-up is published as Onomas-CNN X: 92.1% accuracy at 2,813 names/sec on a single CPU.

Model card Benchmark report

Try it — what kind of name is this?

Paste a string from a form field, a CRM record, or anywhere else. The classifier returns probabilities across the five-way label and the chosen top class.

EXAMPLES:
Enter a string and press Classify.

Model card

Architecture
XLM-RoBERTa-base fine-tuned · 270M params
Labels
person · organization · place · brand · noise
Accuracy
96.1% (held-out, balanced)
F1 by class
person 0.974 · org 0.951 · place 0.943 · brand 0.918 · noise 0.987
Training data
Nomograph DB + OpenStreetMap + ROR + Wikidata · ~14M labelled strings
Languages
Trained on 102, evaluated on 38
Calibration
Temperature-scaled; ECE 0.018

API

curl -X POST https://api.mondonomo.ai/v1/classify \ -H "Authorization: Bearer $TOKEN" \ -d '{"text": "Maria Teresa García"}' { "top": "person", "confidence": 0.992, "distribution": { "person": 0.992, "organization": 0.004, "place": 0.002, "brand": 0.001, "noise": 0.001 } }
Use it before parsing. Forms collect noise. Sales teams type "see attached" in the Name field. Running Classifier first cuts the work the downstream Name Parser and KYC pipeline have to do — and lets you ask the user to re-enter rather than mangling a non-name into a structured record.

RELATED

Use Classifier as the gate.