Language ModelsProprietary

Gemini 3 Pro

Google DeepMind's frontier multimodal, long-context model.

textimageaudiovideocodeOfficial site

Crosshair Index

70.3

#9 of 32 · Language Models

Token pricing

USD per 1M tokens · cache read = cached input (hit), cache write = caching surcharge · official list pricing (June 2026).

Professional-domain strengths, composed from the benchmarks relevant to each field. Highlight an axis to see the benchmarks behind it.

skill

Clinical knowledge and diagnostic reasoning, including the medical coding accuracy and science depth that real practice demands.

Benchmark	Score	Source	Status
GPQA Diamond Reasoning	91.9%	Google — Gemini 3 Provendor	unverified
Humanity's Last Exam Frontier	37.5%	Google — Gemini 3 Provendor	unverified
SWE-bench Verified Agentic Coding	76.2%	Google — Gemini 3 Provendor	unverified
LiveCodeBench Coding	—	not evaluated
MMLU-Pro Knowledge	—	not evaluated
LMArena Elo Human Preference	1,486	LMArena (arena.ai)3rd-party	unverified
AA Intelligence Index Composite	48	Artificial Analysis3rd-party	unverified
Corporate Finance Finance	63.7%	Vals AI — CorpFin v23rd-party	unverified
LegalBench Law	87%	Vals AI — LegalBench3rd-party	unverified
TaxEval Tax & Accounting	72.6%	Vals AI — TaxEval v23rd-party	unverified
Medical Coding Medicine	52.2%	Vals AI — MedCode3rd-party	unverified