Language ModelsProprietary

GPT-5.4

OpenAI's GPT-5.4 — the tier between GPT-5.2 and GPT-5.5; reasoning + tool use.

Crosshair Index

60.5

#15 of 32 · Language Models

Token pricing

USD per 1M tokens · cache read = cached input (hit), cache write = caching surcharge · official list pricing (June 2026).

Professional-domain strengths, composed from the benchmarks relevant to each field. Highlight an axis to see the benchmarks behind it.

skill

Numerically exact work over tax and financial documents — reconciliation, controls, and the arithmetic discipline audits demand.

Benchmark	Score	Source	Status
GPQA Diamond Reasoning	—	not evaluated
Humanity's Last Exam Frontier	—	not evaluated
SWE-bench Verified Agentic Coding	—	not evaluated
LiveCodeBench Coding	—	not evaluated
MMLU-Pro Knowledge	—	not evaluated
LMArena Elo Human Preference	1,467	LMArena (arena.ai)3rd-party	unverified
AA Intelligence Index Composite	57	Artificial Analysis3rd-party	unverified
Corporate Finance Finance	65.3%	Vals AI — CorpFin v23rd-party	unverified
LegalBench Law	86%	Vals AI — LegalBench3rd-party	unverified
TaxEval Tax & Accounting	74%	Vals AI — TaxEval v23rd-party	unverified
Medical Coding Medicine	41.3%	Vals AI — MedCode3rd-party	unverified