Language ModelsOpen weights

DeepSeek V4-Pro

Open-weights MoE flagship (1.6T total / 49B active) with built-in reasoning modes (MIT).

Crosshair Index

72.0

#7 of 32 · Language Models

Token pricing

USD per 1M tokens · cache read = cached input (hit), cache write = caching surcharge · official list pricing (June 2026).

Professional-domain strengths, composed from the benchmarks relevant to each field. Highlight an axis to see the benchmarks behind it.

skill

Financial analysis over filings and credit agreements — valuation math, document QA, and the quantitative reasoning behind deals.

Benchmark	Score	Source	Status
GPQA Diamond Reasoning	90.1%	DeepSeek — V4-Pro model cardvendor	unverified
Humanity's Last Exam Frontier	37.7%	DeepSeek — V4-Pro model cardvendor	unverified
SWE-bench Verified Agentic Coding	80.6%	DeepSeek — V4-Pro model cardvendor	unverified
LiveCodeBench Coding	93.5%best	DeepSeek — V4-Pro model cardvendor	unverified
MMLU-Pro Knowledge	87.5%best	DeepSeek — V4-Pro model cardvendor	unverified
LMArena Elo Human Preference	1,457	LMArena (arena.ai)3rd-party	unverified
AA Intelligence Index Composite	52	Artificial Analysis3rd-party	unverified
Corporate Finance Finance	61.4%	Vals AI — CorpFin v23rd-party	unverified
LegalBench Law	80.3%	Vals AI — LegalBench3rd-party	unverified
TaxEval Tax & Accounting	72.1%	Vals AI — TaxEval v23rd-party	unverified
Medical Coding Medicine	40.5%	Vals AI — MedCode3rd-party	unverified