Anna's Archive offers $200,000 bounty for scalable Google Books extraction method
By
Cider9986
Summary
This is a bounty announcement from Anna's Archive, offering $200,000 for a method to systematically download all scanned books from Google Books at scale. The post explains that Google Books has many scanned books but only exposes them as small snippets through search, and invites researchers to find a scalable method to extract them. It directs readers to read the bounty guidelines before working on the bounty.
Source
Key quotes
· 3 pulledGoogle Books has lots of scanned books, but which are only exposed through search, where it shows up as tiny snippets around the search results.
If you've found a method that you believe will scale up, then please...
Please read https://annas-archive.li/volunteering#bounties carefully before working on a bounty.
You might also wanna read
Journal giant Elsevier unveiled an AI tool that scans millions of paywalled papers. Is it worth it?
Court documents reveal Anthropic destroyed millions of print books to train its Claude AI model
Court documents reveal that AI company Anthropic physically destroyed millions of print books by cutting them from their bindings and scanni
arstechnica.com·13d agoLibrarians Push for Per-Use E-Book Pricing; Commonwealth Prize Uses Interviews Over AI Detectors
This article discusses two main topics: (1) Librarians' proposal for a fairer e-book pricing model, specifically a $1 per-use licensing fee
selfpublishingadvice.org·1d agoZjOjxLzH.gif (498×371)

Anthropic Agrees to $1.5 Billion Settlement with Authors Over AI Training Data Copyright Lawsuit
Anthropic has reached a landmark settlement agreement to pay at least $1.5 billion to authors in what is believed to be the largest recovery

Comments
Sign in to join the conversation.
No comments yet. Be the first.