Bluesky may not be using user content to train AI systems like other social networks, but that doesn’t mean third parties can’t do it.
## AI Training with Public Posts
According to a report by 404 Media, Daniel van Strien from AI firm Hugging Face extracted 1 million public posts from Bluesky using its Firehose API for machine learning research. The dataset was later uploaded to a public repository, sparking controversy and raising concerns about data privacy.
## Consent Preferences and Enforcement
Bluesky mentioned that they are exploring ways to allow users to communicate their consent preferences externally. However, the company clarified that they cannot enforce these preferences outside their systems, leaving it up to external developers to respect user settings.
## Rising Scrutiny for Bluesky
As Bluesky gains popularity and enters the global spotlight, it will face increased scrutiny similar to other major social platforms. Users should be aware that anything posted publicly on Bluesky is indeed public information.
