The 499 status code is the AI search problem no one audits
Mike King's new data shows HTTP 499s and slow origins quietly remove pages from AI citations entirely. Most SEO tools can't see the problem.
Mike King published a piece yesterday that should rearrange how technical SEOs spend their next quarter. He's flagging a status code most practitioners have never logged, never alerted on, and never seen in an audit tool — and arguing it's now one of the biggest determinants of whether your content is eligible to appear in AI search at all.
The status code is 499. Client closed connection. Introduced by NGINX, never made it into the HTTP spec, doesn't show up in Screaming Frog, doesn't appear in Search Console. It logs the moment a client gives up waiting for your server to finish responding.
In RAG-based AI search, that client is increasingly the retrieval agent. And when it gives up, your page doesn't get demoted. It gets removed from the candidate set entirely.
This is a bigger deal than the industry has clocked.
Eligibility is the layer the SEO industry hasn't built tooling for
Traditional SEO operates on a mental model that's roughly: crawl, index, rank. It's the model every tool, every dashboard, and most strategy decks are still organised around. The crawler eventually gets your page, the index eventually stores it, and ranking signals eventually decide where it sits.
That model is generous. It assumes your content will be considered, even if it ranks poorly. It assumes patience on the part of the system. It assumes a single, persistent index doing the work overnight.
AI search doesn't work like that. ChatGPT and Perplexity don't maintain a Google-style index. They fetch in real time, at query time, under tight latency budgets, often through multiple agents making decisions at each stage of an agentic retrieval pipeline. If your page takes too long to respond, the agent moves on. The retrieval step concludes. Synthesis happens without you in the candidate set.
There is no second chance in that pipeline. There is no overnight recrawl that catches you when the server's having a better day. The query was answered. The user moved on. You weren't in the room.
The data King is citing is not subtle
King got Profound to run the numbers across a 700,000-page sample over multiple days in April. Pages with high failure rates for AI crawlers — above 75% timeout rate — received roughly 18 times fewer citation events than reliably-served pages. In many cases, zero citations. Not lower citations. None.
Pages that frequently time out for AI crawlers don't rank lower. They aren't there at all.
His own client data shows a 22% bump in AI search visibility after addressing 499 spikes. Not from new content. Not from new links. From letting the existing content actually finish responding.
That's the kind of intervention that, in traditional SEO terms, would be considered a hygiene fix — not a strategic lever. The reframe King is making is that under RAG, hygiene *is* the strategic lever. Eligibility precedes everything.
Pages that frequently time out for AI crawlers don't rank lower. They aren't there at all.
Why most teams won't see this in their existing tooling
Here's the awkward bit. Most SEO platforms don't surface 499s. Google Search Console certainly doesn't — it's reporting Googlebot's experience, not OpenAI's or Anthropic's. Screaming Frog and Sitebulb crawl from your own infrastructure, not from the actual edges that AI agents are hitting.

The data lives in your CDN logs, your edge proxy logs, your origin access logs. Cloudflare logs 499s. Fastly does. NGINX does, by default, in any halfway-modern config. But almost nobody on the SEO side of the house is reading those logs. They're a DevOps artefact, often not even shared with marketing.
This is the practical consequence: a site can be losing AI citations every day and the SEO team has no visibility into why. The dashboards say everything is fine. The content is published. Schema is valid. Core Web Vitals look reasonable on the field-data sample Google reports. Meanwhile, the AI agents are giving up on the page before TTFB resolves, and citations are quietly evaporating.
If your stack doesn't include log analysis at the edge, segmented by user-agent, you cannot see this problem. And if you can't see it, you can't fix it.
The honest read on what this means for the industry
A few things follow from this if King's data holds up across more samples.
First, the technical SEO discipline just got more important, not less. There's been a lazy assumption running through 2025 and into this year that AI search makes technical work less critical because "the bots are smarter now." The opposite is true. Agentic retrieval has tighter latency budgets than Googlebot ever did. Smarter consumers of HTML, less patient ones.
Second, this further weakens the case for most of the GEO tooling being sold right now. Citation tracking dashboards are useful for measuring outcomes, but they don't tell you *why* you're not getting cited. A flat citation graph could mean your content is bad, your brand is unknown, or your edge is timing out for OpenAI's user-agent on 30% of fetches. The diagnostics live in your logs, not in a vendor dashboard.
Third, the measurement problem just got an additional dimension. We already knew citation tracking was noisy and inconsistent. Now we have to add: even your raw eligibility for citation is a thing you cannot reliably measure without infrastructure most marketing teams don't own.
Fourth — and this is the uncomfortable one — agencies that have been positioning themselves as GEO specialists need to be able to read a CDN log file. If they can't, they're optimising for an outcome whose primary determinant they can't see.
What to actually do about it
Three concrete things, in priority order.
Pull a sample of CDN or edge logs from the last 30 days, segmented by user-agent. Look for OAI-SearchBot, ChatGPT-User, PerplexityBot, ClaudeBot, GoogleOther, Google-Extended. Count 499s, 5xx errors, and slow responses (anything beyond 2–3 seconds TTFB) per user-agent. If any AI user-agent is seeing failure rates above a few percent, that's a fixable problem with potentially significant upside.
Check your origin's behaviour under load. AI crawlers don't politely throttle the way Googlebot does. They tend to fetch in bursts when a query trends. If your origin chokes when concurrent requests spike, you're shedding citations in exactly the moments your content is most relevant.
Confirm your CDN isn't returning challenge pages or rate-limit responses to legitimate AI user-agents. Cloudflare's bot management is aggressive by default and will happily serve a 403 or a JavaScript challenge to OpenAI's fetcher if you haven't whitelisted it. Many sites are blocking the very agents they're trying to be cited by, then wondering why they aren't being cited.
None of this is glamorous. None of it makes a good slide. It is, however, the actual mechanism that determines whether your content has a chance of being part of an AI answer.
The reframe worth holding onto
The single line from King's piece worth keeping is "eligibility is the new ranking." It captures something the industry has been circling but not quite naming. In the crawl-index-rank world, ranking was the bottleneck because eligibility was assumed. Of course Google had your page. The question was just where it sat.
In agentic retrieval, eligibility is the bottleneck because nothing is assumed. Every query is a fresh decision about which pages get fetched, which fetches succeed, which extracted passages make it into the synthesis. Each step has a failure mode. Each failure removes you from the answer.
The pieces that were already arguing the fundamentals matter more than the GEO branding suggested has a new piece of evidence in support. Server reliability, response latency, and edge configuration are now first-order discovery factors. They were always second-order in classical SEO. They're load-bearing now.
The work hasn't disappeared. It's moved closer to the metal.
Ready to improve your visibility in AI search?
If you're an SME in Surrey or London and you want more qualified leads from search — including the growing AI answer layer — let's talk.
Book a discovery call