Court Dismisses Authors’ Copyright Infringement Claims Against OpenAI
TorrentFreak
FEBRUARY 13, 2024
The Books3 dataset was created by AI researcher Shawn Presser in 2020, who scraped the library of ‘pirate’ site Bibliotik. The vision wasn’t wrong; large text archives are great training material for Large Language Models, but many authors disapprove of their works being used in this manner, without permission or compensation.
Let's personalize your content