opus-mt-fr-ca ONNX int8 (movil offline)

Conversion movil-ready de Helsinki-NLP/opus-mt-fr-ca.

ONNX

  • encoder_model_quantized.onnx
  • decoder_model_quantized.onnx
  • decoder_with_past_model_quantized.onnx (KV cache, decoding O(1) por token)

Quant: int8 dynamic ARM64 + per_channel.

Tokenizer

  • tokenizer.json (Fast / Xenova / transformers.js)
  • source.spm, target.spm, vocab.json (Marian raw)

Source: Helsinki-NLP/opus-mt-fr-ca Repo: R4kSo1997/opus-mt-fr-ca-onnx-int8

Downloads last month
20
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support