Cloudflare sets September 15 deadline for AI crawlers to separate search from training use
New default blocks 'mixed-use' crawlers from ad-hosting pages unless site owners opt out, aiming to push AI firms toward paid content access.
1 source · cross-referenced
- Cloudflare will block 'mixed-use' crawlers from ad-hosting pages by default starting September 15, 2026, unless site owners adjust settings.
- The policy targets crawlers that blend search, agent use, and training, requiring separation to avoid blocking.
- Cloudflare’s move aims to push AI companies toward paying publishers for content access.
- Publishers can opt into tools like 'Pay Per Crawl' and 'Pay Per Use' to monetize AI-driven access to their content.
- Cloudflare’s data suggests over 50% of AI crawler traffic involves re-fetching unchanged pages.
Cloudflare announced a September 15, 2026 deadline for AI companies to separate the web crawlers used for traditional search from those used for AI training and agents. Starting on that date, Cloudflare’s default settings will block 'mixed-use' crawlers from any pages that host ads unless the site owner adjusts the settings.
The policy applies to new Cloudflare customers, new sites set up by existing customers, and all existing free customers. Cloudflare co-founder and CEO Matthew Prince framed the move as necessary to address the milestone where non-human bot traffic surpassed human traffic online, a shift that Cloudflare noted was not expected until 2027.
Cloudflare’s announcement highlights tensions between publishers and AI companies over access to content. Publishers often want their content discoverable via search and AI services but seek protections against giving away intellectual property for free. Cloudflare cited concerns that some AI companies access significantly more content than others, referencing what it described as the 'world’s largest search engine' having access to '2x more information' than other AI companies.
The company also introduced or expanded tools to give publishers more control over AI-driven access to their content. Cloudflare’s 'Pay Per Crawl' marketplace, which lets websites charge AI bots for scraping, is evolving into 'Pay Per Use.' This new model aims to let publishers charge AI companies when their content creates value, not just when it is fetched. Cloudflare is initially working with partners Ceramic.ai and You.com to implement this model, with publishers paid when their content appears in Ceramic.ai’s AI search results or when You.com accesses premium content.
Cloudflare’s data suggests that over 50% of crawl traffic from AI crawlers involves re-fetching unchanged pages, which the company argues wastes publishers’ bandwidth and compute resources. The policy and tools are designed to encourage AI companies to adopt clearer, more sustainable crawling practices while providing publishers with commercial opportunities.
- Jul 1, 2026 · TechCrunch — AI
U.S. lifts export restrictions on Anthropic’s Mythos and Fable models after weeks of talks
Trust74 - Jun 30, 2026 · Ars Technica — Technology Lab
U.S. State Department offers $10 million reward for information on Russian cyber group targeting Signal and WhatsApp accounts
Trust75 - Jun 29, 2026 · The Verge — AI
Senators Warren and Scanlon propose bill to restrict sale of health and location data to brokers, including data entered into AI systems
Trust75