Crosshair IntelligenceCrosshair

Models

Every system on the board, language and world models alike. Select one for its full scorecard and sources.

Language Models

· 32

Claude Opus 4.8

Anthropic

Anthropic's frontier model and the current intelligence leader; excels at agentic coding and long-horizon reasoning.

textimagecode1M ctx

Claude Sonnet 4.6

Anthropic

Balanced speed/quality workhorse of the Claude 4 family.

textimagecode1M ctx

Claude Haiku 4.5

Anthropic

Fast, low-cost Claude tier with near-frontier coding.

textimagecode200K ctx

GPT-5.5

OpenAI

OpenAI's flagship multimodal reasoning model (figures shown for the high reasoning-effort config).

textimageaudiocode400K ctx

GPT-5.2

OpenAI

Prior OpenAI flagship; still available but scheduled for API retirement in Aug 2026.

textimagecode400K ctx

Gemini 3.1 Pro

Google DeepMind

Google's most capable Gemini (preview); tops several reasoning benchmarks.

textimageaudiovideo1.048576M ctx

Gemini 3 Pro

Google DeepMind

Google DeepMind's frontier multimodal, long-context model.

textimageaudiovideo1M ctx

Gemini 3 Flash

Google DeepMind

Fast, low-cost tier of the Gemini 3 line (preview).

textimageaudiovideo1M ctx

DeepSeek V4-Pro

DeepSeek

Open-weights MoE flagship (1.6T total / 49B active) with built-in reasoning modes (MIT).

DeepSeek V4-Flash

DeepSeek

Efficient open-weights V4 tier (284B total / 13B active), MIT-licensed.

Kimi K2.6

Moonshot AI

Moonshot's trillion-param MoE (32B active); strong agentic coding (Modified MIT).

textcode262.144K ctx

GLM-5.1

Z.ai (Zhipu)

Zhipu/Z.ai open-weights flagship (754B total / 40B active), MIT.

textcode200K ctx

Qwen3.6-27B

Alibaba Qwen

Alibaba's dense open-weights Qwen3.6 (Apache-2.0); multimodal.

textimagecode262.144K ctx

Llama 4 Maverick

Meta AI

Meta's open-weights MoE (~17B active); the current open Llama flagship.

textimagecode1M ctx

Mistral Large 3

Mistral AI

Mistral's open-weights MoE flagship (41B active, Apache-2.0); a non-reasoning model.

textcode256K ctx

Grok 4.3

xAI

xAI's current flagship; ranks high on the Artificial Analysis Index (xAI publishes few classic benchmarks).

textimagecode1M ctx

Claude Opus 4.7

Anthropic

Anthropic's prior Opus flagship (superseded by 4.8); still strong on agentic coding.

textimagecode1M ctx

Claude Opus 4.6

Anthropic

Earlier Opus 4.6 frontier model; still generally available.

textimagecode1M ctx

GPT-5.4

OpenAI

OpenAI's GPT-5.4 — the tier between GPT-5.2 and GPT-5.5; reasoning + tool use.

textimagecode400K ctx

Gemini 3.5 Flash

Google DeepMind

Google's current fast/default Gemini (GA); strong agentic performance.

textimageaudiovideo1.048576M ctx

Gemini 2.5 Pro

Google DeepMind

Prior-generation Gemini flagship; widely used, slated for deprecation.

textimageaudiovideo1.048576M ctx

Grok 4.20

xAI

xAI Grok 4.20 (reasoning); multi-agent 'council' architecture, very long context.

textimagecode2M ctx

DeepSeek V3.2

DeepSeek

Open-weights MoE (671B total / 37B active), MIT; predecessor to V4.

textcode128K ctx

Kimi K2 Thinking

Moonshot AI

Moonshot's Nov-2025 long-horizon reasoning MoE (1T total / 32B active), Modified MIT.

textcode262.144K ctx

Qwen3.7 Max

Alibaba Qwen

Alibaba's proprietary Qwen3.7 Max flagship; agentic-focused.

Qwen3.6-35B-A3B

Alibaba Qwen

Open-weights hybrid-MoE Qwen3.6 (35B total / 3B active), Apache-2.0.

textimagevideocode262.144K ctx

Muse Spark

Meta AI

Meta Superintelligence Labs' first proprietary frontier model (free via Meta AI).

textimagecode262.144K ctx

Llama 4 Scout

Meta AI

Open-weights MoE (~17B active / 109B total) with very long context.

textimagecode10M ctx

Nova 2 Pro

Amazon

Amazon's Nova 2 Pro flagship on Bedrock; hybrid reasoning, multimodal.

textimageaudiocode1M ctx

Nemotron 3 Ultra

NVIDIA

NVIDIA's open-weights hybrid Mamba-MoE (550B total / 55B active), OpenMDW license.

MiniMax M3

MiniMax

MiniMax's M3 flagship; tops the Artificial Analysis Intelligence Index (open weights pending).

textimagevideocode1M ctx

Doubao Seed 2.0 Pro

ByteDance

ByteDance's Doubao Seed 2.0 Pro; multimodal flagship served via Volcano Engine.

textimagevideocode262.144K ctx

World Models

emerging· 10

V-JEPA 2

Meta AI

Self-supervised video joint-embedding predictive architecture; learns world dynamics for motion understanding, action anticipation, and zero-shot robot planning.

videoactionembodied

Genie 3

Google DeepMind

Foundation world model that generates interactive, controllable environments in real time. Shown qualitatively — DeepMind publishes no standardized cross-model benchmark numbers.

Cosmos Predict 2.5

NVIDIA

NVIDIA's open-weights World Foundation Model for physical AI (future-frame prediction for robotics and AV); the 2B variant is independently scored on PAI-Bench-G.

Marble

World Labs

World Labs' spatially-grounded generative model — persistent, editable 3D scenes from text, images, or video. Marketed on persistence/editability; no standardized benchmarks published.

Sora

OpenAI

OpenAI's text/image-to-video model — the system that popularized 'video as world simulator'. The original Sora was independently scored on Physics-IQ, where generative physical understanding proved severely limited.

Sora 2

OpenAI

OpenAI's flagship video+audio model (the 'GPT-3.5 moment for video'); a large jump in physical plausibility over the original Sora. Consumer app retired Apr 2026, API through Sep 2026.

Veo 3

Google DeepMind

Google DeepMind's text/image-to-video model with native audio and real-world physics; among the top performers on the PAI-Bench physical-AI generation benchmark.

Wan 2.2

Alibaba Qwen

Alibaba Tongyi Lab's open-weights MoE video generator (Apache-2.0); the I2V-A14B variant leads the open models on PAI-Bench-G.

HunyuanVideo

Tencent

Tencent's 13B open-weights video foundation model; the image-to-video variant is benchmarked on PAI-Bench-G.

Ray 3

Luma AI

Luma's reasoning-driven video model with 3D-aware generation and native HDR. Demos only — no standardized cross-model benchmark results published.