9.8 C
New York
Monday, March 31, 2025

Anthropic’s crawler is ignoring web sites’ anti-AI scraping insurance policies


The ClaudeBot net crawler that Anthropic makes use of to scrape coaching knowledge for AI fashions like Claude has hammered iFixit’s web site virtually 1,000,000 instances in a 24-hour interval, seemingly violating the restore firm’s Phrases of Use within the course of. 

“If any of these requests accessed our phrases of service, they might have instructed you that use of our content material expressly forbidden. However don’t ask me, ask Claude!” stated iFixit CEO Kyle Wiens on X, posting pictures that present Anthropic’s chatbot acknowledging that iFixit’s content material was off limits. “You’re not solely taking our content material with out paying, you’re tying up our devops assets. If you wish to have a dialog about licensing our content material for business use, we’re proper right here.”

iFixit’s Phrases of Use coverage states that “reproducing, copying or distributing” any content material from the web site is “strictly prohibited with out the categorical prior written permission” from the corporate, with particular inclusion of “coaching a machine studying or AI mannequin.” When Anthropic was questioned on this by 404 Media, nevertheless, the AI firm linked again to an FAQ web page that claims its crawler can solely be blocked by way of a robots.txt file extension.

Wiens says iFixit has since added the crawl-delay extension to its robots.txt. We now have requested Wiens and Anthropic for remark and can replace this story if we hear again.

iFixit doesn’t appear to be alone, with Learn the Docs co-founder Eric Holscher and Freelancer.com CEO Matt Barrie saying in Wiens’ thread that their web site had additionally been aggressively scraped by Anthropic’s crawler. This additionally doesn’t appear to be new conduct for ClaudeBot, with a number of months-old Reddit threads reporting a dramatic improve in Anthropic’s net scraping. In April this 12 months, the Linux Mint net discussion board attributed a web site outage to pressure attributable to ClaudeBot’s scraping actions.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles