• ambient_temp_xenoB
    link
    fedilink
    English
    arrow-up
    1
    ·
    10 months ago

    The first thing I looked for was the number of training tokens. I think yi34 got a lot of benefit from 3 trillion, so this model having 3 trillion bodes well.