🚀 Launching SauerkrautLM-7b-HerO: A New Era in German Language Modeling!

AffectionateCan2342 · 10 months ago

🚀 Launching SauerkrautLM-7b-HerO: A New Era in German Language Modeling!

AffectionateCan2342 · 10 months ago

Yes, we hope so too ;-) At least our first tests in real-world operation have shown quite good results. However, it should be noted that even if the benchmark results sound very promising, it is still a 7b model that has been pre-trained in English.

Although the model can respond very well in German thanks to our fine-tuning with German data, there can still be slight grammatical errors here and there, especially if the parameters for the inference were set too high. This is currently difficult to avoid, especially when it comes to smaller models. But we are already working on a solution.

There is always a fine line between: Keep the intelligence of the original English-language model and teach the model just enough so that it can “speak” German well.