UBC-NLP/Casablanca
Viewer • Updated • 13.6k • 962 • 32
This model is a fine-tuned version of OpenAI's Whisper-tiny specifically for Algerian dialect automatic speech recognition (ASR).
from transformers import WhisperForConditionalGeneration, WhisperProcessor
import torch
import numpy as np
# Load model and processor
model = WhisperForConditionalGeneration.from_pretrained("MohammedNasri/whisper-algerian-dialect")
processor = WhisperProcessor.from_pretrained("MohammedNasri/whisper-algerian-dialect")
# Process audio (audio_array should be 16kHz mono)
inputs = processor(audio_array, sampling_rate=16000, return_tensors="pt")
# Generate transcription
with torch.no_grad():
predicted_ids = model.generate(inputs["input_features"])
transcription = processor.batch_decode(predicted_ids, skip_special_tokens=True)[0]
print(transcription)
The model was trained on the Algeria subset of the UBC-NLP/Casablanca dataset, which contains authentic Algerian dialect audio recordings with transcriptions.
If you use this model, please cite:
@misc{whisper-algerian-dialect,
title={Whisper Fine-tuned for Algerian Dialect},
author={Mohammed Nasri},
year={2024},
publisher={Hugging Face},
url={https://huggingface.co/MohammedNasri/whisper-algerian-dialect}
}