The Open Licensing Standard for AI Crawlers – Giving publishers a voice and AI a smarter path forward — beyond scraping vs. paywalls to balanced collaboration https://peekthenpay.org/#how-it-works expect one of these new standards per month for the next months.
How AI internet scraping is evolving, current techniques used :
– Direct HTTP Crawlers (Traditional Crawlers) : GPTBot, ClaudeBot, Meta-ExternalAgent, Google-Extended, Bytespider, Amazonbot, Applebot-Extended, CCBot
– Cloud Browser Infrastructure (Browser-as-a-Service) : Browserbase, Hyperbrowser
– Web Scraping & Data Extraction Platforms : Firecrawl, Apify, Zyte
– Browser-driven web agents : Comet (Perplexity), Dia (The Browser Company)
– Real-Time Fetchers (On-Demand) : ChatGPT-User, OAI-SearchBot, Claude-User, Perplexity-User
Tollbit state of the bots : https://tollbit.com/bots/25q2/

Really simple licencing : The open content licensing standard
for the AI-first Internet https://rslstandard.org/
Vibe Coding Without System Design is a Trap : https://www.focusedchaos.co/p/vibe-coding-without-system-design-is-a-trap