Sign up to save tools and stay up to date with the latest in AI
bg
bg
1

Tumblr is selling user data to train AI. Things could get weird.

Feb 28, 2024 - businessinsider.com
Auttomatic, the company that owns WordPress and Tumblr, is reportedly planning to share data from their sites with OpenAI and Midjourney to help train AI. The company's sites currently block AI crawlers, but they will offer an opt-out option when they start sharing data. The data sharing has raised concerns as it includes posts from deleted or suspended blogs, private posts, and content marked NSFW or "mature".

Other social platforms like Reddit, Facebook, and Instagram are also making similar deals. Reddit licenses its data to Google for $60 million a year, while Facebook and Instagram use data for Meta's own internal AI tools. This has sparked controversy among users who feel uncomfortable about their personal content being used to train AI.

Key takeaways:

  • Auttomatic, the company that owns WordPress and Tumblr, is reportedly making a deal to provide data from their sites to help train OpenAI and Midjourney.
  • Auttomatic's sites currently block AI crawlers, but they plan to start sharing data with AI companies, offering an opt-out for users who do not wish to participate.
  • Internal Auttomatic employee messages revealed that engineers made mistakes while compiling posts, including content from deleted or suspended blogs, private posts, and content marked NSFW or "mature".
  • Other social platforms like Reddit, Facebook, and Instagram are also using user-generated content to train AI, which has sparked controversy among users who feel uncomfortable about their personal content being used in this way.
View Full Article

Comments (0)

Be the first to comment!