Tag

Content Scraping

AI crawlers, web scraping, and protecting your website content from unauthorised automated data collection.

2 articles

AI crawlers are hammering websites at scale, scraping content to train language models. Some respect robots.txt. Many don't. The bandwidth costs alone can be a problem, and that's before you consider the intellectual property implications of your content appearing in AI outputs.

These articles cover the technical and legal sides of content scraping: identifying AI crawlers in your logs, blocking them effectively, understanding which controls actually work, and following the policy debates that will shape how web scraping is regulated going forward.

Explore topics

WordPress Hosting UK Business Small Business AI Visibility SEO Web Hosting Content Strategy AI Discovery Files Security WordPress

WordPress 10 Jun 2026

Is Your WordPress Host Quietly Blocking AI Bots? (And Cutting You Out of AI Search)

Some managed WordPress hosts now block AI crawlers by default, silently, and that can quietly cut your site out of AI search. Here is the case against blanket blocking, our own 19-day crawler log showing which AI bots actually read a real site, and how to set it up properly at the infrastructure layer.

14 min read Read

News 8 Mar 2026

House of Lords Says AI Is Strip-Mining UK Websites. Here's What to Do About It

A 180-page Lords report warns that AI companies are scraping UK website content without permission, payment, or disclosure. The committee wants licensing, not opt-out. Here's what it means for the 5.5 million UK businesses publishing content online, and five steps you can take before the law catches up.

7 min read Read

Back to all articles