Model Category
Open tracker
Coding
Mistral Small 4
Efficient instruct
Reasoning 82.0
Coding 86.0
256k tokens
DeepSeek-V4-Flash
Very low-cost long-context API workflows where DeepSeek compatibility and economics are attractive.
Reasoning 84.0
Coding 85.0
1M tokens
Claude Opus 4.8
High-autonomy agentic coding
Reasoning 96.0
Coding 96.0
1M tokens
Devstral 2
Code agents
Reasoning 82.0
Coding 92.0
256K tokens
Gemini 3.1 Pro Preview
Large-context reasoning
Reasoning 94.0
Coding 95.0
1M tokens
Mistral Medium 3.5
Developers who want a capable European API option for coding and agent-style product work.
Reasoning 84.0
Coding 88.0
256K tokens
Qwen3-235B-A22B
Teams that want a serious open-weight frontier alternative with strong multilingual and agentic behavior.
Reasoning 90.0
Coding 91.0
128K tokens
DeepSeek V4 Pro
Cost-sensitive advanced reasoning and coding where context size still matters.
Reasoning 89.0
Coding 89.0
1M tokens
Grok 4.3
Teams evaluating xAI for general chat
Reasoning 90.0
Coding 89.0
1M tokens
Claude Sonnet 4.6
Balanced production use where you want strong reasoning
Reasoning 91.0
Coding 92.0
1M tokens