Crosshair
Benchmarks
Codinghigher is better

LiveCodeBench

Contamination-resistant competitive-programming problems collected over time to avoid training-set overlap.

Benchmark source
Domain
Coding
Metric
%
Orientation
Higher is better
Results
13

Ranking

#ModelScoreSourceStatus
1DeepSeek V4-Pro
DeepSeek
93.5%DeepSeek — V4-Pro model cardvendorunverified
2Qwen3.7 Max
Alibaba Qwen
91.6%Qwen — Qwen3.7 Maxvendorunverified
3DeepSeek V4-Flash
DeepSeek
91.6%DeepSeek — V4-Flash model cardvendorunverified
4Kimi K2.6
Moonshot AI
89.6%Moonshot — Kimi K2.6 model cardvendorunverified
5Nemotron 3 Ultra
NVIDIA
89%NVIDIA — Nemotron 3 Ultra model cardvendorunverified
6Qwen3.6-27B
Alibaba Qwen
83.9%Alibaba — Qwen3.6-27B model cardvendorunverified
7DeepSeek V3.2
DeepSeek
83.3%DeepSeek — V3.2 technical reportvendorunverified
8Kimi K2 Thinking
Moonshot AI
83.1%Moonshot — Kimi K2 Thinking model cardvendorunverified
9Qwen3.6-35B-A3B
Alibaba Qwen
80.4%Alibaba — Qwen3.6-35B-A3B model cardvendorunverified
10Nova 2 Pro
Amazon
74.6%Amazon — Nova 2 technical reportvendorunverified
11Gemini 2.5 Pro
Google DeepMind
69%Google DeepMind — Gemini 2.5 Pro model cardvendorunverified
12Llama 4 Maverick
Meta AI
43.4%Meta — Llama 4vendorunverified
13Llama 4 Scout
Meta AI
32.8%Meta — Llama 4vendorunverified