Leaderboard
Language ModelsProprietary
GPT-5.4
OpenAI's GPT-5.4 — the tier between GPT-5.2 and GPT-5.5; reasoning + tool use.
Crosshair Index
70.4
#11 of 32 · Language Models
- Provider
- OpenAI
- Released
- 2026-03-05
- Parameters
- Undisclosed
- Context
- 400K tokens
Token pricing
- Input
- $2.5 /1M
- Output
- $15 /1M
- Cache read
- $0.25 /1M
USD per 1M tokens · cache read = cached input (hit), cache write = caching surcharge · official list pricing (June 2026).
Industry skill web
Professional-domain strengths, composed from the benchmarks relevant to each field. Highlight an axis to see the benchmarks behind it.
Corporate Law
86
skill
Legal reasoning — issue spotting, rule application, and contract analysis — plus the broad knowledge a generalist counsel needs.
Scorecard
| Benchmark | Score | Source | Status |
|---|---|---|---|
| GPQA Diamond Reasoning | — | not evaluated | |
| Humanity's Last Exam Frontier | — | not evaluated | |
| SWE-bench Verified Agentic Coding | — | not evaluated | |
| LiveCodeBench Coding | — | not evaluated | |
| MMLU-Pro Knowledge | — | not evaluated | |
| LMArena Elo Human Preference | 1,467 | LMArena (arena.ai)3rd-party | unverified |
| AA Intelligence Index Composite | 57 | Artificial Analysis3rd-party | unverified |
| Corporate Finance Finance | 65.3% | Vals AI — CorpFin v23rd-party | unverified |
| LegalBench Law | 86% | Vals AI — LegalBench3rd-party | unverified |
| TaxEval Tax & Accounting | 74% | Vals AI — TaxEval v23rd-party | unverified |
| Medical Coding Medicine | 41.3% | Vals AI — MedCode3rd-party | unverified |
