base model encoder choice

by sugintama - opened Jul 31, 2025

Jul 31, 2025

For base model's encoder choice, is an encoder with a different FastConformer-like structure compatible with this SALM, such as nvidia/parakeet-tdt-0.6b-v2, or must it strictly be an encoder which is combined with a transformer decoder in base model?

piotrzelasko

NVIDIA org Aug 3, 2025

•

edited Aug 3, 2025

For Canary-Qwen-2.5B specifically, it has to be the encoder architecture and parameters in this checkpoint, otherwise it won’t work.

But if you wanted to train your own SALM with NeMo, you can or course use any pretrained model (just change „pretrained_asr” to a different name or .nemo checkpoint)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment