WWhichAITry via AI/ML API

Updated Feb 2026 · 20 models tracked

Pick the right AI model
in 60 seconds.

Independent, opinionated rankings of every major LLM. Compare price, context, and real-world fit for coding, writing, agents, and research — without wading through a dozen vendor sites.

Find my model →Browse all comparisons Try via AI/ML API

Best for coding →Best for writing →Best for agents →Best for reasoning →Best for cheap bulk →Best for vision →

Head-to-head

Most-compared this month

See all 190 →

Claude Opus 4.7 vs GPT-5

Anthropic ↔ OpenAI

Claude Sonnet 4.6 vs GPT-5 Mini

Anthropic ↔ OpenAI

Gemini 3 Pro vs Claude Opus 4.7

Google ↔ Anthropic

Gemini 3 Flash vs GPT-5 Mini

Google ↔ OpenAI

DeepSeek V3 vs Qwen 3 Coder

DeepSeek ↔ Alibaba

Claude Haiku 4.5 vs Gemini 3 Flash

Anthropic ↔ Google

Llama 4 405B vs GPT-5

Meta ↔ OpenAI

o4 vs Claude Opus 4.7

OpenAI ↔ Anthropic

Our picks

Top models for coding

Full ranking →

Claude Opus 4.7

Anthropic's flagship. Deep reasoning, long-horizon agents.

DeepSeek V3

Open-weights reasoning at bargain pricing.

Claude Sonnet 4.6

Sonnet strikes back. The price-performance sweet spot.

Cheapest that still ship

Top models for bulk workloads

Full ranking →

Claude Haiku 4.5

Fast, cheap, surprisingly smart.

Gemini 3 Flash

Cheap, fast, 2M-token context.

Gemini Nano 2

On-device first. Free inference, private by default.

Hard problems only

Top models for reasoning

Full ranking →

Claude Opus 4.7

Anthropic's flagship. Deep reasoning, long-horizon agents.

GPT-5

OpenAI's flagship. Omni-modal, omni-purpose.

o4

Reasoning model. Thinks before it speaks.

Everything

All tracked models

Claude Opus 4.7

Anthropic's flagship. Deep reasoning, long-horizon agents.

Claude Sonnet 4.6

Sonnet strikes back. The price-performance sweet spot.

Claude Haiku 4.5

Fast, cheap, surprisingly smart.

GPT-5

OpenAI's flagship. Omni-modal, omni-purpose.

GPT-5 Mini

GPT-5 shrunk for scale.

GPT-4.1

Long-context workhorse still on the menu.

o4

Reasoning model. Thinks before it speaks.

Gemini 3 Pro

Google's flagship. Massive context, native multimodality.

Gemini 3 Flash

Cheap, fast, 2M-token context.

Grok 4

Real-time-aware, unfiltered, strong reasoner.

Llama 4 405B

Open-weights heavyweight. Self-host or run anywhere.

Llama 4 70B

Mid-size open model. Excellent price-performance when self-hosted.

DeepSeek V3

Open-weights reasoning at bargain pricing.

Mistral Large 3

European flagship with strong multilingual chops.

Mistral Small 3

Small, open-weights, surprisingly sharp.

Qwen 3 Max

Alibaba's flagship. Top-tier multilingual and coding.

Qwen 3 Coder

Open-weights coding specialist.

Command R+

RAG-tuned enterprise workhorse.

GPT-OSS 20B

OpenAI's open-weights return.

Gemini Nano 2

On-device first. Free inference, private by default.

By use case

Pick a guide

Best AI models for coding

Which LLM ships the most working code per dollar in 2026.

Best AI models for writing

Long-form, marketing, and editorial — ranked by style not benchmarks.

Best AI models for chat

Conversational assistants that feel fast and natural.

Best AI models for agents

Tool use, planning, long-horizon reliability.

Best AI models for summarization

Distill big inputs down, cheaply and faithfully.

Best AI models for translation

Multilingual quality that holds up across languages.

Best AI models for reasoning

Math, logic, proofs, hard planning.

Best AI models for research

Long documents, citations, grounded answers.

Best AI models for vision and multimodal

Image and video understanding, diagram reading, OCR.

Cheapest AI models for bulk workloads

High-throughput workloads where cost dominates.

Beyond text

Other AI you'll want

Best AI voice generators

ElevenLabs, PlayHT, Murf, and open-weights alternatives — ranked for production use.

Best AI calendar assistants

Reclaim, Motion, Clockwise, Sunsama — honest ranking of AI schedulers for ICs and teams.