this post was submitted on 22 Apr 2025
1485 points (98.9% liked)
Memes
49953 readers
683 users here now
Rules:
- Be civil and nice.
- Try not to excessively repost, as a rule of thumb, wait at least 2 months to do it if you have to.
founded 6 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Thank you for testing that out.
My experience with AI is that it's at a point where it can comprehend something like this very easily, and won't be tricked.
I suspect that this can, however, pollute a model if it's included as training data, especially if done regularly, as OP is suggesting.
If it was done with enough regularity to eb a problem, one could just put an LLM model like this in-between to preprocess the data.
That doesn't work, you can't train models on another model's output without degrading the quality. At least not currently.
I don't think he was suggesting training on another model's output, just using ai to filter the training data before it is used.