Language ModelsProprietary

Qwen3.7 Max

Alibaba's proprietary Qwen3.7 Max flagship; agentic-focused.

Crosshair Index

74.9

#5 of 32 · Language Models

Token pricing

USD per 1M tokens · cache read = cached input (hit), cache write = caching surcharge · official list pricing (June 2026).

Professional-domain strengths, composed from the benchmarks relevant to each field. Highlight an axis to see the benchmarks behind it.

skill

Shipping working code against real repositories: bug fixes, feature patches, and competitive programming under tests.

Benchmark	Score	Source	Status
GPQA Diamond Reasoning	92.4%	Qwen — Qwen3.7 Maxvendor	unverified
Humanity's Last Exam Frontier	41.4%	Qwen — Qwen3.7 Maxvendor	unverified
SWE-bench Verified Agentic Coding	80.4%	Qwen — Qwen3.7 Maxvendor	unverified
LiveCodeBench Coding	91.6%	Qwen — Qwen3.7 Maxvendor	unverified
MMLU-Pro Knowledge	—	not evaluated
LMArena Elo Human Preference	1,474	LMArena (arena.ai)3rd-party	unverified
AA Intelligence Index Composite	57	Artificial Analysis3rd-party	unverified
Corporate Finance Finance	63.7%	Vals AI — CorpFin v23rd-party	unverified
LegalBench Law	84.9%	Vals AI — LegalBench3rd-party	unverified
TaxEval Tax & Accounting	75.3%	Vals AI — TaxEval v23rd-party	unverified
Medical Coding Medicine	38.8%	Vals AI — MedCode3rd-party	unverified