Venus-120b: A merge of three different models in the style of Goliath-120b

nsfw_throwitaway69 · 1 year ago

Venus-120b: A merge of three different models in the style of Goliath-120b

a_beautiful_rhind · 1 year ago

I have 2x3090 for exl2. I have tess and goliath and both fit with ~3400 context so somehow your quant is slightly bigger.

nsfw_throwitaway69 · 1 year ago

Venus-120b is actually a bit bigger than Goliath-120b. Venus has 140 layers and Goliath has 136 layers, so that would explain it.

a_beautiful_rhind · 1 year ago

Makes sense… it’s doing pretty well. Like the replies. Set the limit to 3400 in tabby, no oom yet but using 98%/98%. I assume this means I can bump up the other models past 3400 too if I’m using tabby and autosplit.