All Topics

Technology

Art

AI Companies' Copyright Dilemma: Scraping Data vs. Fair Use

bgwalter

9mo ago· 6 min readenOpinion

100/100

Golden Brown

Bagelometer↗

Fresh out the oven, still warm. Top of the tray.

Score100TypeopinionSentimentnegative

Summary

The article criticizes AI companies for scraping vast amounts of online content, including text, photos, and videos, to train their models without regard for copyright laws. The author argues that this practice is not transformative fair use but outright theft, highlighting the ethical and legal issues surrounding AI development.

Key quotes

· 4 pulled

Artificial Intelligence companies have a copyright problem, and they know it.

One of AI’s original sins (there are many) is scraping every corner of the Internet for training data.

Intellectual property rights and copyrights be damned, in true Silicon Valley style, AI companies are moving fast and breaking things.

They need the data, so they take it and claim it’s transformative fair use. It’s not; it’s theft.

Snippet from the RSS feed

Theft is not fair use Artificial Intelligence companies have a copyright problem, and they know it. One of AI’s original sins (there are many) is scraping every corner of the Internet for training …

You might also wanna read

California's AB-412: The AI Copyright Transparency Act Explained

This article is a call-to-action urging support for California's AB-412, the AI Copyright Transparency Act. It argues that generative AI dev

actionnetwork.org·2d ago

AI Companies Target Students with Free Tools Despite Cheating Concerns

The article examines how AI companies like OpenAI, Google, and Perplexity are actively targeting students with free or discounted access to

The Verge·6mo ago

AI's Use of Faces and Voices: The Emerging Legal Frontier

The article discusses the emerging legal and cultural battle over AI's use of people's faces and voices, using the example of the AI-generat

The Verge·7mo ago

Anthropic Accuses Chinese AI Firms of Unauthorized Use of Claude Model for Training

Anthropic has accused three Chinese AI companies—DeepSeek, MiniMax, and Moonshot—of conducting industrial-scale campaigns to misuse its Clau

The Verge·3mo ago

CNN Sues AI Company Perplexity for Copyright Infringement Over Content Scraping

CNN has filed a lawsuit against AI company Perplexity, accusing it of copyright and trademark infringement by scraping over 17,000 CNN stori

Variety·3d ago

Reddit Sues Perplexity AI and Data-Scraping Companies Over Copyright Infringement

Reddit is suing AI company Perplexity and three data-scraping service providers (SerpApi, Oxylabs, and AWMProxy) for allegedly circumventing

The Verge·7mo ago