Language ModelsProprietary

Gemini 2.5 Pro

Prior-generation Gemini flagship; widely used, slated for deprecation.

textimageaudiovideocodeOfficial site

Crosshair Index

44.6

#25 of 32 · Language Models

Token pricing

USD per 1M tokens · cache read = cached input (hit), cache write = caching surcharge · official list pricing (June 2026).

Professional-domain strengths, composed from the benchmarks relevant to each field. Highlight an axis to see the benchmarks behind it.

skill

Clinical knowledge and diagnostic reasoning, including the medical coding accuracy and science depth that real practice demands.

Benchmark	Score	Source	Status
GPQA Diamond Reasoning	86.4%	Google DeepMind — Gemini 2.5 Pro model cardvendor	unverified
Humanity's Last Exam Frontier	21.6%	Google DeepMind — Gemini 2.5 Pro model cardvendor	unverified
SWE-bench Verified Agentic Coding	59.6%	Google DeepMind — Gemini 2.5 Pro model cardvendor	unverified
LiveCodeBench Coding	69%	Google DeepMind — Gemini 2.5 Pro model cardvendor	unverified
MMLU-Pro Knowledge	—	not evaluated
LMArena Elo Human Preference	1,446	LMArena (arena.ai)3rd-party	unverified
AA Intelligence Index Composite	35	Artificial Analysis3rd-party	unverified
Corporate Finance Finance	60.8%	Vals AI — CorpFin v23rd-party	unverified
LegalBench Law	—	not evaluated
TaxEval Tax & Accounting	—	not evaluated
Medical Coding Medicine	50.6%	Vals AI — MedCode3rd-party	unverified