Goliath-120B - quants and future plans

AlpinDale · 2 years ago

Goliath-120B - quants and future plans

noeda · 2 years ago

Not sure if you misread, but it’s actually high, i.e. it’s better than Xwin and Euryale it’s made out of (in this particular quick test).

It beat all the 70B models I tested there, although the gap is not super high.

AlpinDale · 2 years ago

Yes well it should perform much higher than that. Turboderp ran MMLU at 3.25bpw and it was performing worse than other 70B models. I assume quantization further degrades the spelling consistency.