The article also highlights specific instances of data selling. Automattic, the parent company for Tumblr and WordPress, is reportedly preparing to announce deals selling user data to OpenAI and Midjourney. Reddit has already sold access to its posts to Google in a $60 million deal. The article concludes by stating that large AI models are likely being trained on posts across the internet, with public posts from platforms like Facebook and Instagram being used to train AI models.
Key takeaways:
- Internet users' data is often scraped and used to train AI systems, sometimes without the permission of the content creators.
- Companies like Tumblr, WordPress, and Reddit have reportedly been selling user data to AI companies like OpenAI and Midjourney.
- Automattic, the parent company of Tumblr and WordPress, has announced a way for users to opt out of sharing their public content with third parties.
- Reddit has sold access to its posts to Google for $60 million to train its generative AI models.