Top 3B model is a distilled llama 7B

Aaaaaaaaaeeeee · 2 years ago

Top 3B model is a distilled llama 7B

bot-333 · 2 years ago

I see that their distilled model is much worse than StableLM 3E1T, so the finetuning improved a lot. Unfortunately they didn’t release the datasets(Would that still be considered Open Source?). Also I’m pretty sure my StableLM finetunes are better in the Open LLM Benchmarks, they just don’t allow StableLM models to be submitted.

Amgadoz · 2 years ago

Are your finetunes full training or lora/qlora?

Top 3B model is a distilled llama 7B

Top 3B model is a distilled llama 7B

GeneZC/MiniChat-3B · Hugging Face