Crosshair

Models

Every system on the board, language and world models alike. Select one for its full scorecard and sources.

Language Models

· 32

Claude Opus 4.8

Anthropic

Proprietary

Anthropic's frontier model and the current intelligence leader; excels at agentic coding and long-horizon reasoning.

textimagecode1M ctx

Claude Sonnet 4.6

Anthropic

Proprietary

Balanced speed/quality workhorse of the Claude 4 family.

textimagecode1M ctx

Claude Haiku 4.5

Anthropic

Proprietary

Fast, low-cost Claude tier with near-frontier coding.

textimagecode200K ctx

GPT-5.5

OpenAI

Proprietary

OpenAI's flagship multimodal reasoning model (figures shown for the high reasoning-effort config).

textimageaudiocode400K ctx

GPT-5.2

OpenAI

Proprietary

Prior OpenAI flagship; still available but scheduled for API retirement in Aug 2026.

textimagecode400K ctx

Gemini 3.1 Pro

Google DeepMind

Proprietary

Google's most capable Gemini (preview); tops several reasoning benchmarks.

textimageaudiovideo1.048576M ctx

Gemini 3 Pro

Google DeepMind

Proprietary

Google DeepMind's frontier multimodal, long-context model.

textimageaudiovideo1M ctx

Gemini 3 Flash

Google DeepMind

Proprietary

Fast, low-cost tier of the Gemini 3 line (preview).

textimageaudiovideo1M ctx

DeepSeek V4-Pro

DeepSeek

Open weights

Open-weights MoE flagship (1.6T total / 49B active) with built-in reasoning modes (MIT).

textcode1M ctx

DeepSeek V4-Flash

DeepSeek

Open weights

Efficient open-weights V4 tier (284B total / 13B active), MIT-licensed.

textcode1M ctx

Kimi K2.6

Moonshot AI

Open weights

Moonshot's trillion-param MoE (32B active); strong agentic coding (Modified MIT).

textcode262.144K ctx

GLM-5.1

Z.ai (Zhipu)

Open weights

Zhipu/Z.ai open-weights flagship (754B total / 40B active), MIT.

textcode200K ctx

Qwen3.6-27B

Alibaba Qwen

Open weights

Alibaba's dense open-weights Qwen3.6 (Apache-2.0); multimodal.

textimagecode262.144K ctx

Llama 4 Maverick

Meta AI

Open weights

Meta's open-weights MoE (~17B active); the current open Llama flagship.

textimagecode1M ctx

Mistral Large 3

Mistral AI

Open weights

Mistral's open-weights MoE flagship (41B active, Apache-2.0); a non-reasoning model.

textcode256K ctx

Grok 4.3

xAI

Proprietary

xAI's current flagship; ranks high on the Artificial Analysis Index (xAI publishes few classic benchmarks).

textimagecode1M ctx

Claude Opus 4.7

Anthropic

Proprietary

Anthropic's prior Opus flagship (superseded by 4.8); still strong on agentic coding.

textimagecode1M ctx

Claude Opus 4.6

Anthropic

Proprietary

Earlier Opus 4.6 frontier model; still generally available.

textimagecode1M ctx

GPT-5.4

OpenAI

Proprietary

OpenAI's GPT-5.4 — the tier between GPT-5.2 and GPT-5.5; reasoning + tool use.

textimagecode400K ctx

Gemini 3.5 Flash

Google DeepMind

Proprietary

Google's current fast/default Gemini (GA); strong agentic performance.

textimageaudiovideo1.048576M ctx

Gemini 2.5 Pro

Google DeepMind

Proprietary

Prior-generation Gemini flagship; widely used, slated for deprecation.

textimageaudiovideo1.048576M ctx

Grok 4.20

xAI

Proprietary

xAI Grok 4.20 (reasoning); multi-agent 'council' architecture, very long context.

textimagecode2M ctx

DeepSeek V3.2

DeepSeek

Open weights

Open-weights MoE (671B total / 37B active), MIT; predecessor to V4.

textcode128K ctx

Kimi K2 Thinking

Moonshot AI

Open weights

Moonshot's Nov-2025 long-horizon reasoning MoE (1T total / 32B active), Modified MIT.

textcode262.144K ctx

Qwen3.7 Max

Alibaba Qwen

Proprietary

Alibaba's proprietary Qwen3.7 Max flagship; agentic-focused.

textcode1M ctx

Qwen3.6-35B-A3B

Alibaba Qwen

Open weights

Open-weights hybrid-MoE Qwen3.6 (35B total / 3B active), Apache-2.0.

textimagevideocode262.144K ctx

Muse Spark

Meta AI

Proprietary

Meta Superintelligence Labs' first proprietary frontier model (free via Meta AI).

textimagecode262.144K ctx

Llama 4 Scout

Meta AI

Open weights

Open-weights MoE (~17B active / 109B total) with very long context.

textimagecode10M ctx

Nova 2 Pro

Amazon

Proprietary

Amazon's Nova 2 Pro flagship on Bedrock; hybrid reasoning, multimodal.

textimageaudiocode1M ctx

Nemotron 3 Ultra

NVIDIA

Open weights

NVIDIA's open-weights hybrid Mamba-MoE (550B total / 55B active), OpenMDW license.

textcode1M ctx

MiniMax M3

MiniMax

Proprietary

MiniMax's M3 flagship; tops the Artificial Analysis Intelligence Index (open weights pending).

textimagevideocode1M ctx

Doubao Seed 2.0 Pro

ByteDance

Proprietary

ByteDance's Doubao Seed 2.0 Pro; multimodal flagship served via Volcano Engine.

textimagevideocode262.144K ctx

World Models

emerging· 5