You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Model Overview

Dependencies

To train, fine-tune or play with the model you will need to install NVIDIA NeMo.

For inference just run:

pip install nemo_toolkit['all']

How to Use this Model

The model is available for use in the NeMo toolkit, and can be used as a pre-trained checkpoint for inference or for fine-tuning on another dataset.

Load the model weights

import nemo.collections.asr as nemo_asr
asr_model = nemo_asr.models.ASRModel.from_pretrained("DigitalUmuganda/Mbaza-ASR-Afrivoice-660h")

Transcribing using Python

asr_model.transcribe(['<audio_sample>'])

Transcribing many audio files

python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py  pretrained_name="DigitalUmuganda/nemo_kin_pretrained_800h_retrained_tokenizer"  audio_dir="<DIRECTORY CONTAINING AUDIO FILES>"

Input

This model accepts 16000 KHz Mono-channel Audio (wav files) as input.

Downloads last month: 9

DigitalUmuganda
/

Mbaza-ASR-Afrivoice-660h