Translate to and from 400+ languages locally with MADLAD-400

jbochi · 1 year ago

Translate to and from 400+ languages locally with MADLAD-400

remixer_dec · 1 year ago

Thanks a lot for converting and quantizing these. I have a couple of questions.

How does it compare to ALMA? (13B)

Is it capable of translating more than 1 sentence at a time?

Is there a way to specify source language or does it always detect it on its own?

jbochi · 1 year ago

Thanks!

- I’m not familiar with ALMA, but it seems to be similar to MADLAD-400. Both are smaller than NLLB-54B, but competitive with it. Because ALMA is a LLM and not a seq2seq model with cross-encoding, I’d guess it’s faster.
- You can translate up to 128 tokens at the time.
- You can only specify the target language, not the source language.