AI models

OpenAI ships GPT-5.5 Instant as the new ChatGPT default


Read · 3 min

Detail view of a server rack inside a data centre, illustrating the AI infrastructure powering ChatGPT.
GPT-5.5 Instant became the default ChatGPT model on 5 May 2026, with halved hallucination rates in regulated domains.Photo: Pexels

OpenAI rolled out GPT-5.5 Instant as the default ChatGPT model on 5 May 2026, replacing GPT-5.3 Instant for free, Plus and Team users. The company said the upgrade roughly halves hallucination rates in legal, medical and financial queries while preserving sub-300-millisecond first-token latency.

Key facts

  • Release date: 5 May 2026 (Pacific Time), staged over 48 hours.
  • Replaces: GPT-5.3 Instant, in service since 14 January 2026.
  • Hallucination cut: 47 percent reduction on the company's HALU-Eval-Pro benchmark.
  • Latency: median first-token 273 ms; full 500-token answer 4.1 s on the OpenAI Evals harness.
  • Pricing: USD 3.00 per million input tokens, USD 12.00 per million output tokens via the API — unchanged from GPT-5.3 Instant.

The model is the first major default refresh since OpenAI's January 2026 reset of the consumer ChatGPT experience. Tech Startups reported that GPT-5.5 Instant ships with a tighter retrieval layer that lets the model attach paragraph-level citations from authoritative sources for medicine, law and finance prompts. OpenAI president Greg Brockman called the change "a step toward citation-by-default for high-stakes verticals".

What is actually different

GPT-5.5 Instant introduces three things consumers will notice. First, a refusal pathway flagged "high-risk verification" prompts the user when a question requires a licensed professional, with explicit pointers to local equivalents (the Conseil de l'Ordre for French legal questions, for instance). Second, the model now refuses to invent case-law citations: a long-running pain point in US legal-tech deployments documented since 2023. Third, financial-advice prompts default to disclosing the model's training cut-off and surface live data only via verified plug-ins.

For Luxembourg users in particular, the model handles French and German with closer-to-native fluency in regulatory contexts. FinTech Futures reported that two CSSF-regulated fund administrators have begun internal pilots of GPT-5.5 Instant for prospectus drafting, citing the model's improved understanding of UCITS and AIFMD terminology.

Why latency matters more than capability now

The flagship reasoning model GPT-5.5 Pro, available since March 2026, is more capable on hard benchmarks. But the Instant tier handles the bulk of consumer traffic — an estimated 92 percent of free-tier requests — and the company has been under pressure to make the default model both faster and safer. The 273-millisecond first-token figure is the lowest OpenAI has published for any frontier model and undercuts Anthropic's Haiku 4.5 (310 ms) and Google's Gemini 3.0 Flash (290 ms).

The competitive picture

OpenAI's release lands days after Anthropic published a blog about a constitutional update to Claude 4.5 and a week after Meta detailed its 2026 capex plans (USD 91 billion, of which USD 76 billion goes to AI infrastructure). The big four hyperscalers — Meta, Amazon, Microsoft and Alphabet — together signal roughly USD 725 billion in 2026 AI capex, a 75 percent year-over-year jump. eWeek reported that the same companies announced more than 33,000 layoffs in April, mostly in middle-management and product roles outside core AI groups.

Regulatory friction

The EU AI Act's omnibus revision, currently stalled in the European Parliament, would treat default consumer AI assistants as "general-purpose AI with systemic risk" once they cross 100 million monthly active EU users. ChatGPT clears that threshold; OpenAI confirmed on 6 May that GPT-5.5 Instant has been notified to the EU AI Office. The Luxembourg regulator ILR continues to coordinate with the AI Office on cross-border transparency reporting.

The Luxembourg angle

The state digital agency, GovTech Luxembourg, runs a sovereign-deployment LLM for ministry workflows that does not include OpenAI models. But the Chamber of Deputies' research service quietly began using ChatGPT Plus seats in 2025 for translation drafting between Luxembourgish, French and German. With GPT-5.5 Instant's tighter handling of legal terminology, that internal use is likely to expand. The University of Luxembourg's SnT lab confirmed it will retest its multilingual benchmark suite against the new default within four weeks.

Bottom line

GPT-5.5 Instant is OpenAI's safer, faster default — not its most capable model. The 47 percent hallucination cut and 273-millisecond first-token latency are the headline numbers. The hard test is whether regulated verticals in Luxembourg and across the EU now trust ChatGPT enough to embed it into client workflows.

What is GPT-5.5 Instant?
GPT-5.5 Instant is OpenAI's new default ChatGPT model, released on 5 May 2026. It replaces GPT-5.3 Instant and focuses on faster latency and reduced hallucinations in legal, medical and financial domains.
How much does GPT-5.5 Instant cost via the API?
The OpenAI API charges USD 3.00 per million input tokens and USD 12.00 per million output tokens for GPT-5.5 Instant — the same pricing as the prior GPT-5.3 Instant.
Is GPT-5.5 Instant compliant with the EU AI Act?
OpenAI notified GPT-5.5 Instant to the EU AI Office on 6 May 2026 under the general-purpose-AI-with-systemic-risk regime that applies to assistants with more than 100 million monthly EU users.

See more on: Openai, Generative Ai, Ai Models, Chatgpt, Gpt 5 5

A look at recent reporting on tech & science from the Étude newsroom.


navigateopenescclose