this post was submitted on 15 Dec 2025
8 points (78.6% liked)
Artificial Intelligence
1800 readers
1 users here now
Welcome to the AI Community!
Let's explore AI passionately, foster innovation, and learn together. Follow these guidelines for a vibrant and respectful community:
- Be kind and respectful.
- Share high-quality contributions.
- Stay on-topic.
- Enhance accessibility.
- Verify information.
- Encourage meaningful discussions.
You can access the AI Wiki at the following link: AI Wiki
Let's create a thriving AI community together!
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I always kind of assumed that the misconception was splitting the difference between two unconnected ideas.
First being this as you've said. Which, yes, solved problem. MS proudly doesn't use your Office365 cloud files to train AIs. They use it and an ML algorithm to make synthetic data. Then that gets used to train AIs.
The second idea being that as AI slop comes to fill every corner of social media, it will make it into training data, unconnected from source and destination. For example, chatbot armies fill Reddit and FB with slop, and OpenAI gobbles it up, wrongly assumed to be "real human engagement," reinforcing a certain style and type of content as the baseline. Even with synthetic data, something had to build the synthetic data. Though, that's less of a collapse and more of a stalling out in one narrow range of performance.