Automatic Speech Recognition
NeMo
Safetensors
PyTorch
sortformer
speaker-diarization
speaker-recognition
speech
audio
Transformer
FastConformer
Conformer
NEST
NeMo
Eval Results (legacy)
Instructions to use nvidia/diar_sortformer_4spk-v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use nvidia/diar_sortformer_4spk-v1 with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("nvidia/diar_sortformer_4spk-v1") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
| { | |
| "feature_extractor": { | |
| "feature_extractor_type": "SortformerFeatureExtractor", | |
| "feature_size": 80, | |
| "hop_length": 160, | |
| "n_fft": 512, | |
| "padding_side": "right", | |
| "padding_value": 0.0, | |
| "preemphasis": 0.97, | |
| "processor_class": "SortformerProcessor", | |
| "return_attention_mask": true, | |
| "sampling_rate": 16000, | |
| "win_length": 400 | |
| }, | |
| "processor_class": "SortformerProcessor" | |
| } | |