---
title: "Best LLM for Coding"
description: "Compare all proprietary and open source models across programming benchmarks including SWE-Bench, LiveCodeBench, Aider Polyglot, BFCL tool use, and more."
canonical_url: "https://www.vellum.ai/best-llm-for-coding"
md_url: "https://www.vellum.ai/md/best-llm-for-coding"
type: "leaderboard"
---

# Best LLM for Coding

Compare all proprietary and open source models across programming benchmarks including SWE-Bench, LiveCodeBench, Aider Polyglot, BFCL tool use, and more.

## Top Models for Coding

### Best in Live CodeBench

| Rank | Model | Score |
| --- | --- | --- |
| 1 | DeepSeek V4 Pro | 93.5% |
| 2 | DeepSeek V4 Flash | 91.6% |
| 3 | Kimi K2 Thinking | 83.1% |
| 4 | Gemini 3 Pro | 79.7% |
| 5 | Grok 3 [Beta] | 79.4% |

### Best in Agentic Coding (SWE Bench)

| Rank | Model | Score |
| --- | --- | --- |
| 1 | Claude Mythos 5 | 95.5% |
| 2 | Claude Fable 5 | 95% |
| 3 | Claude Opus 4.8 | 88.6% |
| 4 | Claude Opus 4.7 | 87.6% |
| 5 | Claude Sonnet 5 | 85.2% |

### Best in Tool Use (BFCL)

| Rank | Model | Score |
| --- | --- | --- |
| 1 | GPT-4.5  | 69.94% |
| 2 | OpenAI o3-mini | 65.12% |
| 3 | Qwen2.5-VL-32B | 62.79% |
| 4 | Gemma 3 27b | 59.11% |
| 5 | DeepSeek V3 0324 | 58.55% |

## All Coding Models



| Model | Provider | Context Window | Input Cost (1M) | Output Cost (1M) | Knowledge Cutoff |

| --- | --- | --- | --- | --- | --- |

| Claude Opus 4.7 | Anthropic | 128,000 | $5 | $25 | Apr 2026 |

| Claude Opus 4.6 | Anthropic | 128,000 | $5 | $25 | May 2025 |

| Claude Sonnet 4.6 | Anthropic | 64,000 | $3 | $15 | Aug 2025 |

| GPT-5.3 Codex | OpenAI | 128,000 | $1.75 | $14 | Aug 2025 |

| DeepSeek V3 0324 | DeepSeek | 8,000 | $0.27 | $1.1 | Dec 2024 |

| Qwen2.5-VL-32B | Qwen | 8,000 | - | - | Dec 2024 |

| OpenAI o1-mini | OpenAI | 8,000 | $3 | $12 | Dec 2024 |

| OpenAI o3-mini | OpenAI | 8,000 | $1.1 | $4.4 | Dec 2024 |

| DeepSeek-R1 | DeepSeek | 8,000 | $0.55 | $2.19 | Dec 2024 |

| Claude 3.7 Sonnet [R] | Anthropic | 64,000 | $3 | $15 | Nov 2024 |

| GPT-4.5  | OpenAI | 16,384 | $75 | $150 | Nov 2024 |

| Claude 3.7 Sonnet | Anthropic | 128,000 | $3 | $15 | Nov 2024 |

| Gemini 2.5 Pro | Google | 65,000 | $1.25 | $10 | Nov 2024 |

| Grok 3 [Beta] | xAI | / | - | - | Nov 2024 |

| Gemma 3 27b | Google | 8192 | $0.07 | $0.07 | Nov 2024 |

| Llama 4 Maverick | Meta | 8,000 | $0.2 | $0.6 | November 2024 |

| Llama 4 Scout | Meta | 8,000 | $0.11 | $0.34 | November 2024 |

| Llama 4 Behemoth | Meta | - | - | - | November 2024 |

| GPT-4.1 | OpenAI | 16,000 | $2 | $8 | December 2024 |

| GPT-4.1 mini | OpenAI | 16,000 | $0.4 | $1.6 | December 2024 |

| GPT-4.1 nano | OpenAI | 32,000 | $0.1 | $0.4 | December 2024 |

| Claude 4 Sonnet | Anthropic | 64,000 | $3 | $15 | Mar 2025 |

| Claude 4 Opus | Anthropic | 32,000 | $15 | $75 | Mar 2025 |

| GPT oss 120b | OpenAI | 131,072 | $0.15 | $0.6 | April 2025 |

| GPT oss 20b | OpenAI | 131,072 | $0.08 | $0.35 | April 2025 |

| Claude Opus 4.1 | Anthropic | 32,000 | $15 | $75 | April 2025 |

| GPT-5 | OpenAI | 128,000 | $1.25 | $10 | April 2025 |

| GPT 5.1 | OpenAI | 128,000 | $1.25 | $10 | April 2025 |

| Kimi K2 Thinking | Kimi | 16,400 | $0.6 | $2.5 | April 2025 |

| Gemini 3 Pro | Google | 650000 | $2 | $12 | April 2025 |

| Claude Sonnet 4.5 | Anthropic | 160000 | $3 | $15 | April 2025 |

| Claude Opus 4.5 | Anthropic | 64,000 | $5 | $25 | April 2025 |

| GPT 5.2 | OpenAI | 16,000 | $1.5 | $14 | Aug 2025 |

| Claude Fable 5 | Anthropic | 128,000 | $10 | $50 | Jan 2026 |

| Claude Mythos 5 | Anthropic | 128,000 | $10 | $50 | Jan 2026 |

| Claude Opus 4.8 | Anthropic | 128,000 | $5 | $25 | Jan 2026 |

| Claude Sonnet 5 | Anthropic | 128,000 | $3 | $15 | Jan 2026 |

| DeepSeek V4 Flash | DeepSeek | 384000 | $0.14 | $0.28 | Jan 2026 |

| DeepSeek V4 Pro | DeepSeek | 384000 | $0.435 | $0.87 | Jan 2026 |

| Gemini 3.1 Pro | Google | 65,536 | $2 | $12 | Jan 2026 |

| Gemini 3.5 Flash | Google | 65,536 | $1.5 | $9 | Jan 2026 |

| GLM 5.2 | Z-AI | 128,000 | $0.95 | $3 | Mar 2026 |

| GPT-5.5 | OpenAI | 128,000 | $5 | $30 | Apr 2026 |

| GPT-5.5 Pro | OpenAI | 128,000 | $30 | $180 | Apr 2026 |

| MiniMax M3 | MiniMax | 512,000 | $0.6 | $2.4 | Mar 2026 |
