Venus-120b: A merge of three different models in the style of Goliath-120b

nsfw_throwitaway69 · 1 year ago

Venus-120b: A merge of three different models in the style of Goliath-120b

noeda · 1 year ago

I will set this to run overnight on Hellaswag 0-shot like I did here on Goliath when it was new: https://old.reddit.com/r/LocalLLaMA/comments/17rsmox/goliath120b_quants_and_future_plans/k8mjanh/

Thanks for the model! I started investigating some approaches to combine models and see if it can be better than its individual parts. Just today I finished code to use a genetic algorithm to pick out parts and frankenstein 7B models together (trying to prove that there is merit to this approach using smalelr models…but we’ll see).

I’ll report back on the Hellaswag results on this model.

nsfw_throwitaway69 · 1 year ago

Thanks! I’m eager to see the results :)