Skip to main content
Netherlands News in English

Main navigation

  • Top stories
  • Health
  • Crime
  • Politics
  • Business
  • Tech
  • Culture
  • Sports
  • Weird
  • 1-1-2
Image
Data mining concept
Data mining concept - Credit: kentoh / DepositPhotos - License: DepositPhotos
Crime
Tech
Innovation
Stichting Brein
AI
artificial intelligence
data set
copyright infringement
copyright
Bastiaan van Ramhorst
machine learning
large language models
Tuesday, 13 August 2024 - 12:50

Share this article:

Dutch foundation takes down dataset illegally used for training AI

Copyright foundation Stichting BREIN has taken a Dutch dataset offline that was intended to train artificial intelligence (AI). It is a first for the Netherlands, the foundation said on Tuesday.

According to BREIN, the dataset was “enormous,” containing illegal copies of tens of thousands of books, millions of lines from news articles from websites like NU.nl, and subtitles of countless films and TV series from illegal sources. It was compressed to be easily used by AI computer models like large language models, the foundation said.

“We searched the dataset for the literal text: ‘Nothing from this publication may be reproduced,’ and this yielded more than 10,000 results. Each of these concerned illegally copied books,” BREIN director Bastiaan van Ramhorst said. “The news articles were also copied from websites with copyright reservations. This clearly shows that copyrights have not been respected. We call that a red-handed act.”

BREIN identified the person who made the dataset. They promised the foundation in writing not to use it anymore and told the foundation to who they provided the dataset. BREIN is investigating which AI models have used the dataset so that the parties can be held accountable.

More like this

Image
Meta
Dutch writers, journalists demand that Meta stop using their work to train its AI
Image
Deepfake
Dutch parliament considering copyright on faces, voices in fight against deepfakes
Image
Trains at Rotterdam Central Station
NS turns to AI to cut train electricity use as Dutch power grid is overloaded
Image
ChatGPT app icon on smartphone screen with pushing finger. Artificial intelligence chatbot service on mobile phone
Dutch parents want complete smartphone ban at school, more communication about AI use
Make NL Times your top Google source

Follow us:

Latest stories

  • De Jong shocks French Open, defeating Khachanov; To take on Zverev in quarterfinal
  • Amsterdam tourism hits record 23.7 million overnight stays despite city tourism cap
  • New bunq promotion lets savers boost their summer holiday pay with higher interest rate
  • Lightning storms ignite multiple house fires, paralyze rail travel across Netherlands
  • New Amsterdam-Paris train from €19 will stop in Haarlem, The Hague, Roosendaal & Gent

Top stories

  • Lightning storms ignite multiple house fires, paralyze rail travel across Netherlands
  • New Amsterdam-Paris train from €19 will stop in Haarlem, The Hague, Roosendaal & Gent
  • Police arrest 35-year-old man after youth soccer leader found dead in Herpen ditch
  • Urgent Code Orange warning issued as heavy storms hit eastern Netherlands
  • Prosecutors target alleged drug profits of former Oranje international Quincy Promes

© 2012-2026, NL Times, All rights reserved.

Footer menu

  • Change Privacy Settings
  • Privacy Policy
  • Contact
  • Partner Content