Good and fast model around ~1B to run on web?

palpapeen · 2 years ago

Good and fast model around ~1B to run on web?

LyPreto · 2 years ago

Deepseek-Coder has a 1B model I believe that’s outperforming 13B models— I’ll check back once I find a link

Edit: found it https://evalplus.github.io/leaderboard.html

palpapeen · 2 years ago

Thanks! But I’m not looking for one that does coding, more one that’s good at detecting fallacies and reasoning. Phi-1.5 seems a better fit for that

LyPreto · 2 years ago

I would still give it a try— it’s misleading to think these coding models are only good at that, being good at coding actually has shown to improve its scores across multiple benchmarks.