jbochi@alien.topB to

LocalLLaMAEnglish · 2 years ago

Translate to and from 400+ languages locally with MADLAD-400

1

Translate to and from 400+ languages locally with MADLAD-400

jbochi@alien.topB to

LocalLLaMAEnglish · 2 years ago

Google released T5X checkpoints for MADLAD-400 a couple of months ago, but nobody could figure out how to run them. Turns out the vocabulary was wrong, but they uploaded the correct one last week.

I’ve converted the models to the safetensors format, and I created this space if you want to try the smaller model.

I also published quantized GGUF weights you can use with candle. It decodes at ~15tokens/s on a M2 Mac.

It seems that NLLB is the most popular machine translation model right now, but the license only allows non commercial usage. MADLAD-400 is CC BY 4.0.

Chat

yugaljain1999@alien.topB
link
fedilink
English
arrow-up
1·
2 years ago
@jbochi , Is it possible to run cargo example for batch inputs?

cargo run --example t5 --release --features cuda – \ –model-id “jbochi/madlad400-3b-mt” \ –prompt “<2de> How are you, my friend?” \ –temperature 0

Thanks
- fractal83@alien.topB
  link
  fedilink
  English
  arrow-up
  1·
  2 years ago
  Yes, I would be interested to know if this is possible