You must log in or register to comment.
Well its ML land. No one cares! Just drop your paper and move on!
If OpenAI paid their license violation debt over their history, they would probably fold. And they’re above average.
You’ll get better datasets IMO using GPT to filter real datasets for quality rather than purely synthetic (which in theory would compound LLM flaws).
But it’s a dumb law. It’s hard to even tell whats in a models dataset for sure, let alone where it came from.