1
Feature Story
Mark Zuckerberg Gave Meta's Llama Team the OK To Train On Copyright Works, Filing Claims - Slashdot
Jan 09, 2025 · yro.slashdot.orgThe filing aligns with previous reports suggesting that Meta cut corners in data gathering for its AI projects, including hiring contractors in Africa to summarize books and considering purchasing the publisher Simon & Schuster. Meta's executives reportedly believed that negotiating licenses would take too long and considered fair use a viable defense. The new accusations in the filing suggest that Meta may have attempted to hide its alleged copyright infringement by stripping attribution from the LibGen data.
Key takeaways
- Meta is accused of using a dataset of pirated ebooks and articles, known as LibGen, for training its Llama AI models.
- Mark Zuckerberg allegedly approved the use of the LibGen dataset despite internal concerns about its legality.
- Meta employees reportedly acknowledged that LibGen was a pirated dataset and worried about its impact on regulatory negotiations.
- The plaintiffs claim Meta attempted to hide its actions by removing copyright information from the LibGen data.