• Monkey_1505B
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    You’ll get better datasets IMO using GPT to filter real datasets for quality rather than purely synthetic (which in theory would compound LLM flaws).

    But it’s a dumb law. It’s hard to even tell whats in a models dataset for sure, let alone where it came from.