AI Copyright Cases: Courts Rule on LLM Training Data Memorization and Infringement
By
oldnetguy
Plain bagel done well. Pleasantly substantive.
Summary
The article discusses legal cases involving AI companies and copyright infringement related to training data memorization. It covers a US court ruling that found Anthropic's LLM training could be considered fair use as transformative, but storing pirated works was infringing, leading to a $1.5 billion settlement. Additionally, a German court ruled that OpenAI infringed copyright by memorizing song lyrics, a landmark case brought by music rights organization GEMA. The context notes that LLMs memorize more training data than previously thought.
Key quotes
· 4 pulledA US court last year found that Anthropic's training of LLMs on some copyrighted content could be considered fair use as it was deemed 'transformative.'
But it determined that storing pirated works was 'inherently, irredeemably infringing,' which then led the AI group to pay $1.5 billion to settle the lawsuit.
In Germany, a ruling from November last year found that OpenAI had infringed on copyright because its model had memorized song lyrics.
The case, brought by GEMA, an association representing composers, lyricists, and publishers, was considered a landmark ruling.
You might also wanna read

Anthropic Settles Copyright Lawsuit Over AI Training on Pirated Books
Anthropic has settled a class action lawsuit filed by US authors who accused the AI startup of copyright infringement for allegedly training

Anthropic Agrees to $1.5 Billion Settlement with Authors Over AI Training Data Copyright Lawsuit
Anthropic has reached a landmark settlement agreement to pay at least $1.5 billion to authors in what is believed to be the largest recovery
German court rules OpenAI's ChatGPT violated copyright law by using song lyrics without license fees
A Munich court ruled that OpenAI's ChatGPT infringes German copyright law by using song lyrics in its responses without paying license fees.
Elsevier joins class action lawsuit against Meta over alleged use of copyrighted content for AI training
Scientific publishing giant Elsevier has joined a class action lawsuit against Meta Platforms, alleging that Meta used Elsevier's copyrighte

Five major publishers and author sue Meta over alleged copyright infringement in AI training
Five major book publishers (Macmillan, McGraw Hill, Elsevier, Hachette, Cengage) and author Scott Turow have filed a class action lawsuit ag

Encyclopedia Britannica and Merriam-Webster Sue OpenAI Over Copyright Infringement Claims
Encyclopedia Britannica and Merriam-Webster have filed a lawsuit against OpenAI, alleging that the company used their copyrighted content to
