Instructions to use microsoft/speecht5_tts with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use microsoft/speecht5_tts with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-to-speech", model="microsoft/speecht5_tts")# Load model directly from transformers import AutoProcessor, AutoModelForTextToSpectrogram processor = AutoProcessor.from_pretrained("microsoft/speecht5_tts") model = AutoModelForTextToSpectrogram.from_pretrained("microsoft/speecht5_tts") - Notebooks
- Google Colab
- Kaggle
fix missing import
#2
by pete-rrr - opened
README.md
CHANGED
|
@@ -33,6 +33,7 @@ Use the code below to convert text into a mono 16 kHz speech waveform.
|
|
| 33 |
from transformers import SpeechT5Processor, SpeechT5ForTextToSpeech, SpeechT5HifiGan
|
| 34 |
import torch
|
| 35 |
import soundfile as sf
|
|
|
|
| 36 |
|
| 37 |
processor = SpeechT5Processor.from_pretrained("microsoft/speecht5_tts")
|
| 38 |
model = SpeechT5ForTextToSpeech.from_pretrained("microsoft/speecht5_tts")
|
|
|
|
| 33 |
from transformers import SpeechT5Processor, SpeechT5ForTextToSpeech, SpeechT5HifiGan
|
| 34 |
import torch
|
| 35 |
import soundfile as sf
|
| 36 |
+
from datasets import load_dataset
|
| 37 |
|
| 38 |
processor = SpeechT5Processor.from_pretrained("microsoft/speecht5_tts")
|
| 39 |
model = SpeechT5ForTextToSpeech.from_pretrained("microsoft/speecht5_tts")
|