• WolframRavenwolfB
    link
    fedilink
    English
    arrow-up
    1
    ·
    10 months ago

    I took a short break from my 70B tests (still working on that!) and tried TheBloke/dolphin-2_2-yi-34b-GGUF Q4_0. It instantly claimed 4th place on my list.

    A 34B taking 4th place among the 13 best 70Bs! A 34B model that beats 9 70Bs (including dolphin-2.2-70B, Samantha-1.11-70B, StellarBright, Airoboros-L2-70B-3.1.2 and many others). A 34B with 16K native context!

    Yeah, I’m just a little excited. I see a lot of potential with the Yi series of models and proper finetunes like Eric’s.

    Haven’t done the RP tests yet, so back to testing. Will report back once I’m done with the current batch (70Bs take so damn long, and 120B even more so).

    • iChristB
      link
      fedilink
      English
      arrow-up
      1
      ·
      10 months ago

      Wow i gotta try it thanks for the hype! Does the GPTQ/AWQ versions differ from GGUF in terms of context? It listed that the context is only 4096