Updated May 2026 · 8 models tracked

Compare AI ModelsSide by Side

Unbiased comparisons of Claude, GPT-4o, Gemini, and more. Find the best AI model for your use case — with real benchmarks, up-to-date pricing, and honest pros & cons.

Benchmark Comparisons

MMLU, HumanEval, MATH and more — see how every model stacks up on standardized tests.

Transparent Pricing

Up-to-date API pricing per million tokens so you can estimate costs before committing.

Capability Matrix

Context window, multimodal support, max output — every spec in one place.

Use-Case Guides

Not sure which AI to pick? Our guides match you to the right model for coding, writing, research, and more.

All AI Models

Click any column header to sort. Click a model name for full details.

Advanced compare →
Model TierContext Input / 1M Output / 1MMMLU MultimodalLink
Claude Sonnet 4.6
Anthropic
frontier200K$3.00$15.0088.7YesTry
GPT-4o
OpenAI
frontier128K$2.50$10.0088.7YesTry
Llama 3.1 405B
Meta
frontier128K$3.00$3.0088.6NoTry
Grok 2
xAI
frontier131K$2.00$10.0087.5NoTry
Gemini 1.5 Pro
Google
frontier2.0M$1.25$5.0085.9YesTry
GPT-4o mini
OpenAI
budget128K$0.150$0.60082YesTry
Gemini 1.5 Flash
Google
budget1.0M$0.075$0.30078.9YesTry
Claude Haiku 4.5
Anthropic
budget200K$0.800$4.0073.8YesTry

Top Frontier Models

View all models →
Anthropic

Claude Sonnet 4.6

frontier

Anthropic's latest and most capable model, excelling at complex reasoning, coding, and nuanced instruction following.

Context
200K
Input / 1M
$3.00
codinganalysiswriting
OpenAI

GPT-4o

frontier

OpenAI's flagship omnimodal model with strong performance across text, vision, and audio tasks.

Context
128K
Input / 1M
$2.50
generalcodingvision
Google

Gemini 1.5 Pro

frontier

Google's most capable model with an industry-leading 2M token context window and strong multimodal capabilities.

Context
2.0M
Input / 1M
$1.25
long documentsvideo analysisresearch
Advertisement — Google AdSense unit goes here