For roleplay purposes, Goliath-120b is absolutely thrilling me

tenmileswide · 1 year ago

For roleplay purposes, Goliath-120b is absolutely thrilling me

DominicanGreg · 1 year ago

So does this fit in 48gb vram or nah?

Aaaaaaaaaeeeee · 1 year ago

Yes, 3bpw model gets 4k

a_beautiful_rhind · 1 year ago

Hopefully someone makes some bigger GGUF than Q2. I’ve got 1/2 P40s and 1/2 3090s so can’t use EXL for a model this big.

Aaaaaaaaaeeeee · 1 year ago

Does the magic die at 3bpw?

ArtifartX · 1 year ago

What service do you use for GPU rental and inference for it?

tenmileswide · 1 year ago

Ah, sorry I missed this one - http://www.runpod.io

SlavaSobov · 1 year ago

Goliath-120b - License to Thrill.

Monkey_1505 · 1 year ago

Unfortunately this is beyond the edge of what can reasonably be run on consumer hardware so unlikely to be easily available to most people. Hell, a 70b really requires two graphics cards or a high end mac mini already. If it can’t run on that kinda gear, it’s probably not going to be on ai horde or any API either. Which means you have to use runpod or something - most people are not going to do that.

ttkciar · 1 year ago

Nah, if you’re willing to tolerate CPU inference this is achievable for downright cheap.

literal_garbage_man · 1 year ago

Will try this out on runpod. Thanks for the heads up

BalorNG · 1 year ago

Can we have some non-cherry-picked examples of writing?

Does not have to be highly nsfw/whatever, but a comparison of goliath writing compared to output from constituent models at same settings and same (well-crafted) prompts will be very interesting to see, and preferably at least 3 examples per model due to inherent randomness of model output…

If you say this is “night and day” difference, it should be apparent… I’m not sceptical per se, but “writing quality” is highly subjective and the model style may simply mesh better with your personal preferences?

multiverse_fan · 1 year ago

Cool, sounds like a good model to download and store for future when I can get access to better hardware.