Meta Accused of Scraping Independent Sites for AI Training Data
By
nogajun
Pulled from the oven just right. Trustworthy, fact-dense, deeply satisfying.
Summary
A report from Dropsite News alleges that Meta is scraping independent websites for AI training data, ignoring robots.txt directives. Meta's communications representative, Andy Stone, has not yet responded to these claims. The article highlights concerns about data privacy and ethical scraping practices.
Key quotes
· 3 pulledA new report from Dropsite News makes the claim that Meta is allegedly scraping a large amount of independent sites for content to train their AI.
What’s worse is that this scraping operation appears to completely disregard robots.txt, a control list used to tell crawlers, search engines, and bots which parts of a site should be accessed, and which parts should be avoided.
Andy Stone, a communications representative for Meta, has not yet responded to these claims.
You might also wanna read

Meta's Manus AI runs ads promoting get-rich-quick schemes using AI website building
Meta's acquired AI company Manus is running advertising campaigns promoting get-rich-quick schemes using AI tools. The ads target people loo

Meta tests AI account on Threads that users cannot block, sparking backlash
Meta is testing a new Threads feature that allows users to tag a Meta AI account to get answers or context about conversations, similar to h

Meta's AI Glasses Reportedly Send Sensitive Footage to Human Reviewers in Kenya
Meta's AI-powered smart glasses are reportedly sending sensitive footage, including intimate moments like bathroom visits and sex, to human
Elsevier joins class action lawsuit against Meta over alleged use of copyrighted content for AI training
Scientific publishing giant Elsevier has joined a class action lawsuit against Meta Platforms, alleging that Meta used Elsevier's copyrighte

Meta Implements Interim Safeguards for AI Chatbot Interactions with Minors
Meta is implementing interim changes to its AI chatbot rules after a Reuters investigation revealed concerning interactions with minors. The

Meta Tracks Employee Computer Activity to Train AI Models
Meta is installing tracking software called Model Capability Initiative (MCI) on US-based employees' computers to record mouse movements, cl
