mozilla-foundation/common_voice_17_0
Updated • 5.5k • 16
How to use 6x16/whisper-small-nan-tw-quicktrain with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="6x16/whisper-small-nan-tw-quicktrain") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("6x16/whisper-small-nan-tw-quicktrain")
model = AutoModelForSpeechSeq2Seq.from_pretrained("6x16/whisper-small-nan-tw-quicktrain")This model is a fine-tuned version of openai/whisper-small on the Common Voice 17.0 (nan-tw) dataset. It achieves the following results on the evaluation set:
(Example Source: https://sutian.moe.edu.tw/zh-hant/su/27169/)
Original sentence: 萬事起頭難。
Inference by Whisper-Small: บันซู ขี้เท่าหลัน
Inference by this model: 萬事起頭難
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Cer |
|---|---|---|---|---|
| 0.3511 | 2.9240 | 1000 | 0.7512 | 125.6361 |
| 0.0117 | 5.8480 | 2000 | 0.7479 | 141.2850 |
| 0.001 | 8.7719 | 3000 | 0.7629 | 136.0814 |
| 0.0006 | 11.6959 | 4000 | 0.7699 | 138.9186 |
Base model
openai/whisper-small