Breeze-ASR-26-GGML
This repo contains GGML exports of MediaTek-Research/Breeze-ASR-26 for use with whisper.cpp.
The base model is a Whisper large-v2 fine-tune for Taiwanese Hokkien / Taigi ASR, producing mostly Mandarin Chinese characters.
Included files
| File | Format | Size | SHA256 |
|---|---|---|---|
ggml-model.bin |
fp16 GGML | 2.9 GB | 6d58f81d79155deb5037f995a048856f6deaa9e06f59a89183cc421fa37cb1ad |
ggml-model-q5_0.bin |
q5_0 GGML | 1.0 GB | 60f25e3a21feca12ec082e6d36f08f94455d9900d6343f7fcb2906f71cc7c449 |
ggml-model-q8_0.bin |
q8_0 GGML | 1.5 GB | e878f611bf3ef407cff3c1e2ecf5f60dc92bc6dd09a85b22166731c0d92764a1 |
Checksums are also provided in SHA256SUMS.
Usage with whisper.cpp
Build whisper.cpp, then run:
./build/bin/whisper-cli \
-m ggml-model-q5_0.bin \
-f audio.wav \
-l zh
Or use the fp16 model:
./build/bin/whisper-cli \
-m ggml-model.bin \
-f audio.wav \
-l zh
Notes
whisper.cppcurrently uses GGML model files for Whisper models.- I am not publishing a GGUF variant here because that is not the standard/runtime-compatible Whisper artifact for
whisper.cpptoday. - If an official Whisper GGUF path becomes stable, this export pipeline can be extended.
Provenance
- Source model:
MediaTek-Research/Breeze-ASR-26 - Conversion path: HF Transformers checkpoint →
whisper.cppGGML - Quantization:
whisper-quantizewithq5_0andq8_0
Model tree for doggy8088/Breeze-ASR-26-GGML
Base model
openai/whisper-large-v2 Finetuned
MediaTek-Research/Breeze-ASR-26