# | Model | Resolved Rate (%) | Pass@5 (%) | Cost per Problem ($) | Tokens per Problem |
|---|---|---|---|---|---|
| 1 | gpt-5.5-2026-04-23-xhigh Model | 62.7%± 0.91% | 70.0% | $2.25 | 2,120,66090.0% cached |
| 2 | Junie Agent | 61.6%± 0.64% | 72.7% | $1.84 | 1,866,49791.6% cached |
| 3 | Codex Agent | 60.4%± 1.37% | 71.8% | $1.75 | 1,898,13192.5% cached |
| 4 | Claude Code Agent | 59.6%± 1.98% | 72.7% | $1.74 | 1,878,24893.6% cached |
| 5 | gpt-5.5-2026-04-23-medium Model | 58.9%± 0.78% | 70.0% | $0.98 | 708,41883.5% cached |
| 6 | Claude Opus 4.8-xhigh Model | 56.5%± 1.20% | 67.3% | $2.02 | 2,479,38795.3% cached |
| 7 | gpt-5.4-2026-03-05-medium Model | 54.9%± 1.02% | 70.9% | $0.60 | 834,45283.5% cached |
| 8 | Claude Opus 4.7-high Model | 53.1%± 1.45% | 66.4% | $1.32 | 1,526,13594.2% cached |
| 9 | Cursor Agent | 53.0%± 0.53% | 64.5% | $0.23 | 1,031,65398.7% cached |
| 10 | Claude Sonnet 4.6 Model | 51.3%± 0.55% | 63.6% | $1.29 | 2,644,57795.6% cached |
| 11 | Gemini 3.1 Pro Preview Model | 51.1%± 1.20% | 66.4% | $0.75 | 1,545,44580.1% cached |
| 12 | GLM-5.1 Model | 50.7%± 0.93% | 65.5% | $0.94 | 2,664,00191.8% cached |
| 13 | Gemini 3.5 Flash Model | 49.5%± 0.98% | 61.8% | $0.77 | 1,848,59375.7% cached |
| 14 | Claude Opus 4.6-high Model | 47.8%± 1.37% | 60.9% | $1.53 | 1,828,64993.6% cached |
| 15 | Kimi K2.6 Model | 46.5%± 1.27% | 64.5% | $0.61 | 2,466,97790.4% cached |
| 16 | MiniMax M3 Model | 45.6%± 1.27% | 67.3% | $1.06 | 6,885,81893.5% cached |
| 17 | GLM-4.7 Model | 38.2%± 0.86% | 59.1% | $0.39 | 2,256,18286.4% cached |
| 18 | Claude Opus 4.1 Model | N/A | N/A | N/A | N/A |
| 19 | Claude Opus 4.5 Model | N/A | N/A | N/A | N/A |
| 20 | Claude Sonnet 3.5 Model | N/A | N/A | N/A | N/A |
| 21 | Claude Sonnet 4 Model | N/A | N/A | N/A | N/A |
| 22 | Claude Sonnet 4.5 Model | N/A | N/A | N/A | N/A |
| 23 | DeepSeek-R1-0528 Model | N/A | N/A | N/A | N/A |
| 24 | DeepSeek-V3 Model | N/A | N/A | N/A | N/A |
| 25 | DeepSeek-V3-0324 Model | N/A | N/A | N/A | N/A |
| 26 | DeepSeek-V3-0324 Model | N/A | N/A | N/A | N/A |
| 27 | DeepSeek-V3.1 Model | N/A | N/A | N/A | N/A |
| 28 | DeepSeek-V3.2 Model | N/A | N/A | N/A | N/A |
| 29 | Devstral-2-123B-Instruct-2512 Model | N/A | N/A | N/A | N/A |
| 30 | Devstral-Small-2-24B-Instruct-2512 Model | N/A | N/A | N/A | N/A |
| 31 | Devstral-Small-2505 Model | N/A | N/A | N/A | N/A |
| 32 | Gemini 3 Flash Preview Model | N/A | N/A | N/A | N/A |
| 33 | Gemini 3 Pro Preview Model | N/A | N/A | N/A | N/A |
| 34 | gemini-2.0-flash Model | N/A | N/A | N/A | N/A |
| 35 | gemini-2.0-flash Model | N/A | N/A | N/A | N/A |
| 36 | gemini-2.5-flash Model | N/A | N/A | N/A | N/A |
| 37 | gemini-2.5-flash-preview-05-20 no-thinking Model | N/A | N/A | N/A | N/A |
| 38 | gemini-2.5-flash-preview-05-20 no-thinking Model | N/A | N/A | N/A | N/A |
| 39 | gemini-2.5-pro Model | N/A | N/A | N/A | N/A |
| 40 | Gemma 4 31B Model | N/A | N/A | N/A | N/A |
| 41 | gemma-3-27b-it Model | N/A | N/A | N/A | N/A |
| 42 | GLM-4.5 Model | N/A | N/A | N/A | N/A |
| 43 | GLM-4.5 Air Model | N/A | N/A | N/A | N/A |
| 44 | GLM-4.6 Model | N/A | N/A | N/A | N/A |
| 45 | GLM-4.7 Flash Model | N/A | N/A | N/A | N/A |
| 46 | GLM-5 Model | N/A | N/A | N/A | N/A |
| 47 | GLM-5.1 Model | N/A | N/A | N/A | N/A |
| 48 | gpt-4.1-2025-04-14 Model | N/A | N/A | N/A | N/A |
| 49 | gpt-4.1-2025-04-14 Model | N/A | N/A | N/A | N/A |
| 50 | gpt-4.1-mini-2025-04-14 Model | N/A | N/A | N/A | N/A |
| 51 | gpt-4.1-mini-2025-04-14 Model | N/A | N/A | N/A | N/A |
| 52 | gpt-4.1-nano-2025-04-14 Model | N/A | N/A | N/A | N/A |
| 53 | gpt-5-2025-08-07-high Model | N/A | N/A | N/A | N/A |
| 54 | gpt-5-2025-08-07-medium Model | N/A | N/A | N/A | N/A |
| 55 | gpt-5-2025-08-07-minimal Model | N/A | N/A | N/A | N/A |
| 56 | gpt-5-codex Model | N/A | N/A | N/A | N/A |
| 57 | gpt-5-mini-2025-08-07-high Model | N/A | N/A | N/A | N/A |
| 58 | gpt-5-mini-2025-08-07-medium Model | N/A | N/A | N/A | N/A |
| 59 | gpt-5.1-codex Model | N/A | N/A | N/A | N/A |
| 60 | gpt-5.1-codex-max Model | N/A | N/A | N/A | N/A |
| 61 | gpt-5.2-2025-12-11-medium Model | N/A | N/A | N/A | N/A |
| 62 | gpt-5.2-2025-12-11-xhigh Model | N/A | N/A | N/A | N/A |
| 63 | gpt-5.2-codex Model | N/A | N/A | N/A | N/A |
| 64 | gpt-5.3-codex Model | N/A | N/A | N/A | N/A |
| 65 | gpt-5.3-codex-xhigh Model | N/A | N/A | N/A | N/A |
| 66 | gpt-oss-120b Model | N/A | N/A | N/A | N/A |
| 67 | gpt-oss-120b-high Model | N/A | N/A | N/A | N/A |
| 68 | gpt-oss-20b Model | N/A | N/A | N/A | N/A |
| 69 | Grok 4 Model | N/A | N/A | N/A | N/A |
| 70 | Grok Code Fast 1 Model | N/A | N/A | N/A | N/A |
| 71 | horizon-alpha Model | N/A | N/A | N/A | N/A |
| 72 | horizon-beta Model | N/A | N/A | N/A | N/A |
| 73 | Kimi K2 Model | N/A | N/A | N/A | N/A |
| 74 | Kimi K2 Instruct 0905 Model | N/A | N/A | N/A | N/A |
| 75 | Kimi K2 Thinking Model | N/A | N/A | N/A | N/A |
| 76 | Kimi K2.5 Model | N/A | N/A | N/A | N/A |
| 77 | Llama-3.3-70B-Instruct Model | N/A | N/A | N/A | N/A |
| 78 | Llama-4-Maverick-17B-128E-Instruct Model | N/A | N/A | N/A | N/A |
| 79 | Llama-4-Scout-17B-16E-Instruct Model | N/A | N/A | N/A | N/A |
| 80 | MiniMax M2 Model | N/A | N/A | N/A | N/A |
| 81 | MiniMax M2.1 Model | N/A | N/A | N/A | N/A |
| 82 | MiniMax M2.5 Model | N/A | N/A | N/A | N/A |
| 83 | MiniMax M2.7 Model | N/A | N/A | N/A | N/A |
| 84 | o3-2025-04-16 Model | N/A | N/A | N/A | N/A |
| 85 | o4-mini-2025-04-16 Model | N/A | N/A | N/A | N/A |
| 86 | Qwen2.5-72B-Instruct Model | N/A | N/A | N/A | N/A |
| 87 | Qwen2.5-Coder-32B-Instruct Model | N/A | N/A | N/A | N/A |
| 88 | Qwen3-235B-A22B Model | N/A | N/A | N/A | N/A |
| 89 | Qwen3-235B-A22B no-thinking Model | N/A | N/A | N/A | N/A |
| 90 | Qwen3-235B-A22B thinking Model | N/A | N/A | N/A | N/A |
| 91 | Qwen3-235B-A22B-Instruct-2507 Model | N/A | N/A | N/A | N/A |
| 92 | Qwen3-235B-A22B-Thinking-2507 Model | N/A | N/A | N/A | N/A |
| 93 | Qwen3-30B-A3B-Instruct-2507 Model | N/A | N/A | N/A | N/A |
| 94 | Qwen3-30B-A3B-Thinking-2507 Model | N/A | N/A | N/A | N/A |
| 95 | Qwen3-32B Model | N/A | N/A | N/A | N/A |
| 96 | Qwen3-32B no-thinking Model | N/A | N/A | N/A | N/A |
| 97 | Qwen3-32B thinking Model | N/A | N/A | N/A | N/A |
| 98 | Qwen3-Coder-30B-A3B-Instruct Model | N/A | N/A | N/A | N/A |
| 99 | Qwen3-Coder-480B-A35B-Instruct Model | N/A | N/A | N/A | N/A |
| 100 | Qwen3-Coder-Next Model | N/A | N/A | N/A | N/A |
| 101 | Qwen3-Next-80B-A3B-Instruct Model | N/A | N/A | N/A | N/A |
| 102 | Qwen3.5-27B Model | N/A | N/A | N/A | N/A |
| 103 | Qwen3.5-35B-A3B Model | N/A | N/A | N/A | N/A |
| 104 | Qwen3.5-397B-A17B Model | N/A | N/A | N/A | N/A |
| 105 | Step-3.5-Flash Model | N/A | N/A | N/A | N/A |
Inspect button.