Anon doesn't like reddit

🍹Early to RISA 🧉@sh.itjust.works · 8 months ago

Anon doesn't like reddit

NostraDavid@programming.dev · 8 months ago

Make sure to have some LLM generate the comment for you, as LLMs learning synthetic data may fuck them up over time: AI models fed AI-generated data quickly spew nonsense

ClamDrinker@lemmy.world · edit-2 8 months ago

I hate to ruin this for you, but if you post nonsense, it will get downvoted by humans and excluded from any data set (or included as examples of what to avoid). If it’s not nonsensical enough to be downvoted, it still won’t do well vote wise, and will not realistically poison any data. And if it’s upvoted… it just might be good data. That is why Reddit’s data is valuable to Google. It basically has a built in system for identifying ‘bad’ data.