Translate to and from 400+ languages locally with MADLAD-400

jbochi · 2 years ago

Translate to and from 400+ languages locally with MADLAD-400

danigoncalves · 2 years ago

What would be the equivalent models based on open source and free for commercial use?

remixer_dec · 2 years ago

Thanks a lot for converting and quantizing these. I have a couple of questions.

How does it compare to ALMA? (13B)

Is it capable of translating more than 1 sentence at a time?

Is there a way to specify source language or does it always detect it on its own?

jbochi · 2 years ago

Thanks!

- I’m not familiar with ALMA, but it seems to be similar to MADLAD-400. Both are smaller than NLLB-54B, but competitive with it. Because ALMA is a LLM and not a seq2seq model with cross-encoding, I’d guess it’s faster.
- You can translate up to 128 tokens at the time.
- You can only specify the target language, not the source language.

Puzzleheaded_Mall546 · 2 years ago

I don’t think its working.

jbochi · 2 years ago

Sorry, but what is not working?

Puzzleheaded_Mall546 · 2 years ago

I write text that is incomplete to see how it will translate it and the results is a coninuation of my text not the translation.

jbochi · 2 years ago

How are you running it? Did you prepended a “<2xx>” token for the target language? For example, “<2fr> hello” will translate “hello” to French. If you are using this space, you can select the target language in the dropdown.

Puzzleheaded_Mall546 · 2 years ago

I am using the code of the space.

jbochi · 2 years ago

Got it. Can you please share the full prompt?

phoneixAdi · 2 years ago

Nice thank you!! Tried in space. Works well for me. Noob question. Can I run this with llama.cpp? Since it’s gguf. Can I download this and run it locally?

vasileer · 2 years ago

I tested the 3B model for Romanian, Russian, French, and German translations of the “The sun rises in the East and sets in the West.” and it works 100%: it gets 10/10 from ChatGPT

a_beautiful_rhind · 2 years ago

If anything needed some minimalist app, this would be it.