I want to download the goliath model but I can only afford Q3_K_M. It is written that it has high quality losses. How much quality loss is there?
I heard that the larger the model, the less it suffers intellectually when it is optimized. I usually use 70B Q5_K_M. Can I expect that 120B Q3_K_M will be significantly better than 70B Q5_K_M so that the time spent on downloading will be worth it?
I imagine it’s pretty solid.
I’ve tested around the with q4_K_M and the q8 on my Mac Studio, and the q4 is pretty darn good. There’s some difference in that the q4 does seem to get confused when I talk to it sometimes, whereas the q8 seems unshakeable in its quality, but honestly the q4 still feels better than almost any other model I’ve ever used.