Technology Nvidia sued over AI training data as copyright clashes continue

silversurfer

Level 85
Thread author
Verified
Honorary Member
Top Poster
Content Creator
Malware Hunter
Well-known
Aug 17, 2014
10,178
Book authors are suing Nvidia, alleging that the chipmaker's AI platform NeMo—used to power customized chatbots—was trained on a controversial dataset that illegally copied and distributed their books without their consent.

In a proposed class action, novelists Abdi Nazemian (Like a Love Story), Brian Keene (Ghost Walk), and Stewart O’Nan (Last Night at the Lobster) argued that Nvidia should pay damages and destroy all copies of the Books3 dataset used to power NeMo large language models (LLMs).

The Books3 dataset, novelists argued, copied "all of Bibliotek," a shadow library of approximately 196,640 pirated books. Initially shared through the AI community Hugging Face, the Books3 dataset today "is defunct and no longer accessible due to reported copyright infringement," the Hugging Face website says.
 
Nov 1, 2022
28
I'm also intrigued by this situation. It seems like a lot is happening lately, with many major players facing legal challenges over AI-related issues. It's becoming increasingly clear that protecting intellectual property is a significant concern nowadays, especially with the widespread use of AI platforms like NeMo and Chat GPT.

It's no secret that most (serious) companies monitor copyright infringement and that it's getting more and more challenging to detect it. The rise of platforms like Hugging Face, having all models and libraries for free, adds another layer of complexity to the issue. (Although it seems unlikely that end users will have a say.)

There is definitely a need for more effective solutions for preventing the unauthorized use of copyrighted material, and I dare to say the final verdicts in the lawsuits mentioned might play a role in the matter.

I mean, cases like this highlight the importance of legal (well, and ethical) considerations in the development and use of AI technologies, but we are yet to see if they will make a significant difference and shape future development.
 
  • Like
Reactions: vtqhtr413

About us

  • MalwareTips is a community-driven platform providing the latest information and resources on malware and cyber threats. Our team of experienced professionals and passionate volunteers work to keep the internet safe and secure. We provide accurate, up-to-date information and strive to build a strong and supportive community dedicated to cybersecurity.

User Menu

Follow us

Follow us on Facebook or Twitter to know first about the latest cybersecurity incidents and malware threats.

Top