Models Tested
154
Providers
19
New This Month
9
models added
Latest Added
DeepSeek V4 Pro
2026-05
154 models
1–50 of 154
| Sel | # | Model | Config | Score ↓ | Success | Date | |
|---|---|---|---|---|---|---|---|
1 | GPT-5.5OpenAI Configlow | low | 80.3 | 79.7% | 86 | 2026-04 | |
2 | ↳ GPT-5.5 Configxhigh | xhigh | 77.6 | 78.4% | 70 | 2026-04 | |
3 | ↳ GPT-5.5 Confighigh | high | 77.4 | 77.3% | 78 | 2026-04 | |
4 | Claude Opus 4.6Anthropic Confighigh· 8,192 tokens | high· 8,192 tokens | 76.4 | 77.2% | 70 | 2026-04 | |
5 | Claude Sonnet 4.6Anthropic Confighigh· 8,192 tokens | high· 8,192 tokens | 76.3 | 77.2% | 69 | 2026-04 | |
6 | Claude Opus 4.6 FastAnthropic Config— | — | 76.2 | 77.2% | 68 | 2026-04 | |
7 | Claude Opus 4.6Anthropic Configmedium· 2,048 tokens | medium· 2,048 tokens | 76.2 | 77.2% | 68 | 2026-04 | |
8 | ↳ Claude Opus 4.6 Config— | — | 76.1 | 77.2% | 67 | 2026-02 | |
9 | GPT-5.5OpenAI Config— | — | 76.0 | 75.2% | 83 | 2026-04 | |
10 | Claude Sonnet 4.6Anthropic Configmedium· 2,048 tokens | medium· 2,048 tokens | 76.0 | 77.0% | 67 | 2026-04 | |
11 | GLM 5Z.AI Config— | — | 75.6 | 77.2% | 61 | 2026-02 | |
12 | GPT 5.4OpenAI Confighigh | high | 74.8 | 73.5% | 87 | 2026-04 | |
13 | Kimi K2.6Moonshotai Configlow | low | 74.2 | 73.6% | 80 | 2026-04 | |
14 | GPT-5.5OpenAI Configmedium | medium | 73.9 | 72.7% | 85 | 2026-04 | |
15 | Claude Sonnet 4Anthropic Config— | — | 72.6 | 73.7% | 63 | 2025-08 | |
16 | DeepSeek V4 ProDeepSeek Configgen | gen | 72.5 | 72.9% | 69 | 2026-05 | |
17 | Claude Sonnet 4.5Anthropic Config— | — | 72.3 | 73.2% | 64 | 2025-10 | |
18 | Claude Opus 4.7Anthropic Configmedium· 2,048 tokens | medium· 2,048 tokens | 72.2 | 72.7% | 68 | 2026-04 | |
19 | Claude Sonnet 4.6Anthropic Config— | — | 72.2 | 72.7% | 68 | 2026-02 | |
20 | GPT 5.2OpenAI Config— | — | 72.1 | 71.6% | 77 | 2025-12 | |
21 | Claude Opus 4.5Anthropic Config— | — | 71.6 | 72.7% | 62 | 2025-11 | |
22 | GPT-5.5OpenAI Confignone | none | 71.5 | 69.9% | 87 | 2026-04 | |
23 | Claude Opus 4.1Anthropic Config— | — | 71.3 | 73.2% | 55 | 2025-08 | |
24 | Gemma 4 26B A4BGoogle Confignone | none | 71.0 | 72.2% | 61 | 2026-04 | |
25 | Gemma 4 31BGoogle Configmedium | medium | 70.9 | 70.5% | 75 | 2026-04 | |
26 | Claude Opus 4Anthropic Config— | — | 70.8 | 72.3% | 58 | 2025-08 | |
27 | GPT 4.1OpenAI Config— | — | 70.8 | 73.1% | 50 | 2025-08 | |
28 | GPT 5.3 ChatOpenAI Config— | — | 70.2 | 70.9% | 64 | 2026-03 | |
29 | Gemma 4 31BGoogle Confignone | none | 70.0 | 70.0% | 70 | 2026-04 | |
30 | Horizon BetaOther Config— | — | 69.8 | 70.7% | 62 | 2025-08 | |
31 | GPT 5.1OpenAI Config— | — | 69.8 | 69.8% | 69 | 2025-11 | |
32 | DeepSeek V4 ProDeepSeek Confignone | none | 69.4 | 70.5% | 60 | 2026-05 | |
33 | GPT 5.1 CodexOpenAI Config— | — | 69.4 | 68.1% | 81 | 2025-11 | |
34 | DeepSeek V3.2 SpecialeDeepSeek Config— | — | 69.3 | 69.9% | 64 | 2026-02 | |
35 | Kimi K2 ThinkingMoonshot AI Config | 69.2 | 69.2% | 70 | 2025-12 | ||
36 | Gemma 4 26B A4BGoogle Configlow | low | 69.1 | 70.1% | 61 | 2026-04 | |
37 | DeepSeek V4 FlashDeepSeek Configmedium | medium | 68.8 | 68.7% | 70 | 2026-05 | |
38 | GPT 5OpenAI Config— | — | 68.8 | 69.8% | 59 | 2025-09 | |
39 | GPT 4oOpenAI Config— | — | 68.7 | 68.6% | 70 | 2025-08 | |
40 | GPT 5.3 CodexOpenAI Config— | — | 68.6 | 67.1% | 82 | 2026-02 | |
41 | Claude Sonnet 3.7Anthropic Config | 68.6 | 68.9% | 66 | 2025-08 | ||
42 | DeepSeek V3.2 ExpDeepSeek Config— | — | 68.6 | 68.5% | 69 | 2025-12 | |
43 | Gemma 4 31BGoogle Confighigh | high | 68.5 | 68.3% | 71 | 2026-04 | |
44 | o3 miniOpenAI Config— | — | 68.5 | 69.0% | 64 | 2025-08 | |
45 | DeepSeek V4 ProDeepSeek Configmedium | medium | 68.4 | 69.0% | 64 | 2026-05 | |
46 | o1 miniOpenAI Config— | — | 68.3 | 68.8% | 64 | 2025-08 | |
47 | Claude Sonnet 3.5Anthropic Config— | — | 68.0 | 67.0% | 77 | 2025-08 | |
48 | DeepSeek R1DeepSeek Config— | — | 67.9 | 67.5% | 72 | 2025-08 | |
49 | DeepSeek V4 ProDeepSeek Configlow | low | 67.8 | 67.5% | 71 | 2026-05 | |
50 | o4 miniOpenAI Config— | — | 67.7 | 67.5% | 69 | 2025-08 |
1–50 of 154