Hello mates!

While working with local modals mostly on software development, I was wondering what would be the best model to work in my native language (Portuguese). Until now I never needed that kind of quality (language specific trained model), but I was thinking if I would need, what would be the model to have my hands on (and even further fine tune it). I know a llama based fine tuned model (Cabrita) but its a little bit restrictive in terms of usage, does anyone know any other llama or mistral model trained or fine tuned in portuguese?

  • vasileerB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    Mistral and Llama2 work with many languages even if are marked as English.

    Here is a quote from a benchmark on the German language, I think you will get a similar conclusion if you will do it for Portuguese.

    “Kinda ironic that the English models worked better with the German data and exam than the ones finetuned in German. Looks like language doesn’t matter as much as general intelligence and a more intelligent model can cope with different languages more easily. German-specific models need better tuning to compete in general and excel in German.”

    https://www.reddit.com/r/LocalLLaMA/comments/178nf6i/mistral_llm_comparisontest_instruct_openorca/