Google released T5X checkpoints for MADLAD-400 a couple of months ago, but nobody could figure out how to run them. Turns out the vocabulary was wrong, but they uploaded the correct one last week.

I’ve converted the models to the safetensors format, and I created this space if you want to try the smaller model.

I also published quantized GGUF weights you can use with candle. It decodes at ~15tokens/s on a M2 Mac.

It seems that NLLB is the most popular machine translation model right now, but the license only allows non commercial usage. MADLAD-400 is CC BY 4.0.

    • jbochi
      cake
      OPB
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      Sorry, but what is not working?

      • Puzzleheaded_Mall546B
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        I write text that is incomplete to see how it will translate it and the results is a coninuation of my text not the translation.

        • jbochi
          cake
          OPB
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          How are you running it? Did you prepended a “<2xx>” token for the target language? For example, “<2fr> hello” will translate “hello” to French. If you are using this space, you can select the target language in the dropdown.