Fun_Tangerine_1086B to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

Why is Mistral-7b so capable? Any ideas re: dataset?

1

Why is Mistral-7b so capable? Any ideas re: dataset?

Fun_Tangerine_1086B to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

So Mistral-7b is a pretty impressive 7B param model … but why is it so capable? Do we have any insights into its dataset? Was it trained very far beyond the scaling limit? Any attempts at open reproductions or merges to scale up # of params?

Chat

meetraisB
link
fedilink
English
arrow-up
1·
1 year ago
I second this. Mistral-7B gave me good results. After fine-tuning it’s result is even better.
- kaszebeB
  link
  fedilink
  English
  arrow-up
  1·
  1 year ago
  
  Mistral-7B gave me good results
  
  Can you expand upon that? Do you mean in terms of its ability to write at a college level without major grammatical errors?