OpenAI ships GPT-5.4—three variants and a 1.05M token context

What is new

GPT-5.4 is OpenAI’s most capable and most efficient model to date. It comes in three variants:

A context window of 1,050,000 tokens lets you fit into a single call:

For dev workflows this means the AI can see the full repo context without chunking or RAG.

GPT-5.4 hit a record 83% on OpenAI’s internal GDPval test and set new bests on computer-use benchmarks (OSWorld-Verified, WebArena Verified).

For clients who were on the fence about deploying an AI agent on complex tasks, this is the inflection point:

Reasoning — the Thinking variant unlocks task-decomposition workflows that previously failed at planning
Long context — eliminates a lot of RAG feature engineering for 80% of use cases
Three pricing tiers — start on Standard and switch production-critical paths to Pro

I expect the competition (Claude, Gemini) to ship upgrades within a month—prices and context up, entry barriers down.