Leaderboard
Language ModelsOpen weights
Crosshair Index
70.1
#12 of 32 · Language Models
- Provider
- Alibaba Qwen
- Released
- 2026-04-22
- Parameters
- 27B
- Context
- 262.144K tokens
Token pricing
- Input
- $0.25 /1M
- Output
- $1.49 /1M
USD per 1M tokens · cache read = cached input (hit), cache write = caching surcharge · official list pricing (June 2026).
Industry skill web
Professional-domain strengths, composed from the benchmarks relevant to each field. Highlight an axis to see the benchmarks behind it.
Medicine
87
skill
Clinical knowledge and diagnostic reasoning, including the medical coding accuracy and science depth that real practice demands.
Scorecard
| Benchmark | Score | Source | Status |
|---|---|---|---|
| GPQA Diamond Reasoning | 87.8% | Alibaba — Qwen3.6-27B model cardvendor | unverified |
| Humanity's Last Exam Frontier | 24% | Alibaba — Qwen3.6-27B model cardvendor | unverified |
| SWE-bench Verified Agentic Coding | 77.2% | Alibaba — Qwen3.6-27B model cardvendor | unverified |
| LiveCodeBench Coding | 83.9% | Alibaba — Qwen3.6-27B model cardvendor | unverified |
| MMLU-Pro Knowledge | 86.2% | Alibaba — Qwen3.6-27B model cardvendor | unverified |
| LMArena Elo Human Preference | — | not evaluated | |
| AA Intelligence Index Composite | 46 | Artificial Analysis3rd-party | unverified |
| Corporate Finance Finance | 62.3% | Vals AI — CorpFin v23rd-party | unverified |
| LegalBench Law | — | not evaluated | |
| TaxEval Tax & Accounting | 71.3% | Vals AI — TaxEval v23rd-party | unverified |
| Medical Coding Medicine | — | not evaluated |
