Crosshair
Leaderboard
Language ModelsProprietary

GPT-5.4

OpenAI's GPT-5.4 — the tier between GPT-5.2 and GPT-5.5; reasoning + tool use.

textimagecodeOfficial site
Crosshair Index
70.4
#11 of 32 · Language Models
Provider
OpenAI
Released
2026-03-05
Parameters
Undisclosed
Context
400K tokens

Token pricing

Input
$2.5 /1M
Output
$15 /1M
Cache read
$0.25 /1M

USD per 1M tokens · cache read = cached input (hit), cache write = caching surcharge · official list pricing (June 2026).

Industry skill web

Professional-domain strengths, composed from the benchmarks relevant to each field. Highlight an axis to see the benchmarks behind it.

SoftwareIB / FinanceLawMedicineResearchConsultingAccounting

Corporate Law

86
skill

Legal reasoning — issue spotting, rule application, and contract analysis — plus the broad knowledge a generalist counsel needs.

Scorecard

BenchmarkScoreSourceStatus
GPQA Diamond
Reasoning
not evaluated
Humanity's Last Exam
Frontier
not evaluated
SWE-bench Verified
Agentic Coding
not evaluated
LiveCodeBench
Coding
not evaluated
MMLU-Pro
Knowledge
not evaluated
LMArena Elo
Human Preference
1,467LMArena (arena.ai)3rd-partyunverified
AA Intelligence Index
Composite
57Artificial Analysis3rd-partyunverified
Corporate Finance
Finance
65.3%Vals AI — CorpFin v23rd-partyunverified
LegalBench
Law
86%Vals AI — LegalBench3rd-partyunverified
TaxEval
Tax & Accounting
74%Vals AI — TaxEval v23rd-partyunverified
Medical Coding
Medicine
41.3%Vals AI — MedCode3rd-partyunverified