opus-mt-ca-fr ONNX int8 (movil offline)

Conversion movil-ready de Helsinki-NLP/opus-mt-ca-fr.

ONNX

  • encoder_model_quantized.onnx
  • decoder_model_quantized.onnx
  • decoder_with_past_model_quantized.onnx (KV cache, decoding O(1) por token)

Quant: int8 dynamic ARM64 + per_channel.

Tokenizer

  • tokenizer.json (Fast / Xenova / transformers.js)
  • source.spm, target.spm, vocab.json (Marian raw)

Source: Helsinki-NLP/opus-mt-ca-fr Repo: R4kSo1997/opus-mt-ca-fr-onnx-int8

Downloads last month
19
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support