Hello mates!
While working with local modals mostly on software development, I was wondering what would be the best model to work in my native language (Portuguese). Until now I never needed that kind of quality (language specific trained model), but I was thinking if I would need, what would be the model to have my hands on (and even further fine tune it). I know a llama based fine tuned model (Cabrita) but its a little bit restrictive in terms of usage, does anyone know any other llama or mistral model trained or fine tuned in portuguese?
Mistral and Llama2 work with many languages even if are marked as English.
Here is a quote from a benchmark on the German language, I think you will get a similar conclusion if you will do it for Portuguese.
“Kinda ironic that the English models worked better with the German data and exam than the ones finetuned in German. Looks like language doesn’t matter as much as general intelligence and a more intelligent model can cope with different languages more easily. German-specific models need better tuning to compete in general and excel in German.”
https://www.reddit.com/r/LocalLLaMA/comments/178nf6i/mistral_llm_comparisontest_instruct_openorca/