There has been a lot of movement around and below the 13b parameter bracket in the last few months but it’s wild to think the best 70b models are still llama2 based. Why is that?

We have 13b models like 8bit bartowski/Orca-2-13b-exl2 approaching or even surpassing the best 70b models now

  • candre23B
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    It’s adorable that you think any 13b model is anywhere close to a 70b llama2 model.