All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Perplexity AI Accused of Using Stealth Crawlers to Bypass No-Crawl Directives

By

rrampage

10mo ago· 7 min readenNews

Summary

Perplexity, an AI-powered answer engine, is reportedly using stealth tactics to bypass website no-crawl directives. Evidence suggests the company modifies user agents, changes IPs and ASNs, and ignores robots.txt files to continue crawling despite explicit blocks. This behavior raises concerns about compliance with web standards and website preferences.

Key quotes

· 3 pulled
Perplexity is repeatedly modifying their user agent and changing their source ASNs to hide their crawling activity.
The company appears to obscure their crawling identity in an attempt to circumvent the website’s preferences.
Perplexity is ignoring — or sometimes failing to even fetch — robots.txt files.
Snippet from the RSS feed
Perplexity is repeatedly modifying their user agent and changing IPs and ASNs to hide their crawling activity, in direct conflict with explicit no-crawl preferences expressed by websites.

You might also wanna read