Filing NeMo: Nvidia's AI framework hit with copyright lawsuit

Nvidia is facing a proposed class action lawsuit alleging that it used copyrighted works to train AI models without the authors' permission. The lawsuit, filed by authors Abdi Nazemian, Brian Keene, and Stewart O'Nan, claims that Nvidia used their books to train large language models in the Megatron library for its NeMo generative AI framework. The models in question were released in September 2022 and were trained on "The Pile" dataset, which includes a collection of books called Books3, containing the works of the three authors.

This is not the first case of an AI company being sued over copyright infringement related to training data. In December 2021, The New York Times sued Microsoft and OpenAI for allegedly using its articles without permission to build AI models. Nvidia, however, maintains that it created NeMo in full compliance with copyright law. The authors are seeking a jury trial and damages for the alleged copyright violations.

Key takeaways

Nvidia is facing a proposed class action lawsuit for allegedly using copyrighted works to train AI models in its Megatron library without the authors' permission.
The lawsuit was filed by three authors who claim their books were used in the training of the Megatron large language models.
The models in question were released in September 2022 and were trained on "The Pile" dataset, which is known to contain a number of unlicensed copyrighted works.
This follows a similar case where The New York Times sued Microsoft and OpenAI for using its articles without permission to build AI models.

Filing NeMo: Nvidia's AI framework hit with copyright lawsuit

Key takeaways

Discussion (0)