Benchmarks
Industry3 benchmarks
Accounting & Audit
Numerically exact work over tax and financial documents — reconciliation, controls, and the arithmetic discipline audits demand.
The Accounting & Auditscore is the mean of a model’s normalized 0–100 scores (direction-aware, so lower-is-better metrics are inverted) across the 3 benchmarks below — the same figure the leaderboard’s industry view ranks by.
Leaders
Nemotron 3 Ultra leads this industry with a score of 86.8.
Benchmarks in this score
Each model’s scores on these are normalized and averaged to produce the industry score above.
Tax & Accounting%
TaxEval
Vals AI TaxEval v2 — 1,500+ expert-written tax questions, scored on overall accuracy. Independent, in-house-run.
Finance%
Corporate Finance
Vals AI CorpFin v2 — expert-built questions over long-context corporate credit agreements; an independent, in-house-run finance benchmark.
Knowledge%
MMLU-Pro
A harder, cleaned-up successor to MMLU spanning 57+ subjects with 10-way multiple choice and reasoning-heavy items.
