All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

AI Scrapers Attempt to Access Commented-Out Scripts, Revealing Bot Behavior Patterns

By

ColinWright

7mo ago· 14 min readenInsight

Summary

A developer discovered abusive bot behavior when AI scrapers attempted to access a commented-out JavaScript file that didn't exist on the server, revealing a new method for identifying poorly-behaved bots. The article details how the author accidentally deployed a page with a commented script tag, leading to 404 errors when AI scrapers tried to fetch the non-existent file. This incident exposes how some AI training data collection bots ignore HTML comments and attempt to scrape any referenced resources, regardless of whether they're actually part of the page content.

Key quotes

· 5 pulled
AI scrapers request commented scripts
I discovered some abusive bot behaviour during a routine follow-up on anomalies that had shown up in my server's logfiles
There were a bunch of 404 errors ('Not Found') for a specific JavaScript file
I accidentally committed and deployed a commented-out script tag that I'd included in the page while prototyping a new feature
A new avenue for identifying greedy, badly-behaved bots
Snippet from the RSS feed
A new avenue for identifying greedy, badly-behaved bots

You might also wanna read