Problem with Enlgish Speaking

by dove88 - opened Oct 15, 2025

Oct 15, 2025

When trying to use this multilingual-onnx scripts to do tts task for English text, the result is not good. I remember the original torch multillingual version is good for both English and other language.

Is the languagemodel.onnx correct? Kindly pelase share the scripts for converting lalnguage_model.onnx.

Thank you~

vladislavbro

ONNX Community org Oct 17, 2025

Hi @dove88 ! Thank you for reported this issue. It seems the issue was in tokenizer.config that was introduced by accident during replacement. I uploaded a new one, so you could try again

dove88

Oct 17, 2025

@vladislavbro , thank you very much, multilingual model also working for english now!

How about the speed from your side? The float16 lang_model version has performance of Time-To-First-Token around 4.5s with streaming_size = 30 on A100 GPU card, seems not good enough for real-time setttings.
I dont know if it is my problem?

I aslo tried to convert the onnx to tensorRT, but failed due to dynamic shape and customized operators.

vladislavbro

ONNX Community org Oct 18, 2025

Hmm good question, I did not measure performance for these models tbh, probably @Xenova did some tests

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment