Daniel van Strien, a machine learning librarian at Hugging Face, took a million Bluesky posts and turned them into a dataset expressly for training AI models:“This dataset could be used for “training and testing language models on social media content, analyzing social media …