Sign up to save tools and stay up to date with the latest in AI
bg
bg
1

Bluesky's open API means anyone can scrape your data for AI training | TechCrunch

Nov 27, 2024 - techcrunch.com
Bluesky, a popular social network, is under scrutiny after a machine learning librarian from AI firm Hugging Face used its Firehose API to pull 1 million public posts for research, later pushing the dataset to a public repository. The data was subsequently removed due to ensuing controversy, highlighting the public nature of posts on Bluesky and raising concerns about third-party use of user content for AI training.

In response, Bluesky is exploring ways to allow users to communicate their consent preferences externally, although it will be up to third parties to respect these preferences. The company acknowledges that it cannot enforce this consent outside of its systems and is currently in discussions with engineers and lawyers to address the issue.

Key takeaways:

  • A machine learning librarian at AI firm Hugging Face pulled 1 million public posts from Bluesky for research, causing controversy.
  • The data was later removed by Daniel van Strien due to the controversy.
  • Bluesky is considering ways to allow users to communicate their consent preferences externally, but it will be up to third parties to respect these preferences.
  • Despite its rising popularity, Bluesky is subject to the same levels of scrutiny as other major social platforms.
View Full Article

Comments (0)

Be the first to comment!