Verified Story Studio creative-audio models and runtimes for Vox Jot.
Kimani James
IrieDinamik
·
AI & ML interests
None yet
Recent Activity
updated a collection 8 days ago
Vox Jot - Creative Audio Verified updated a collection 8 days ago
Vox Jot - Creative Audio Verified updated a collection 8 days ago
Vox Jot - Creative Audio VerifiedOrganizations
None yet
Vox Jot – Speech Analysis Runtime
Public runtime group for Vox Jot Python-sidecar file ASR and speaker isolation dependencies.
Vox Jot – Speaker Isolation Verified
Curated speaker diarization and isolation models verified for Vox Jot file transcription.
-
pyannote/speaker-diarization-community-1
Automatic Speech Recognition • Updated • 2.76M • 461 -
pyannote/speaker-diarization-3.1
Automatic Speech Recognition • Updated • 9.65M • 2.12k -
BUT-FIT/diarizen-wavlm-large-s80-md-v2
Voice Activity Detection • Updated • 1.64k • 13 -
nvidia/diar_sortformer_4spk-v1
Automatic Speech Recognition • 0.1B • Updated • 13.3k • 139
Vox Jot – OCR Verified
Curated on-device OCR models verified for Vox Jot. Image-to-text and document scanning models optimized for local inference on-device.
Vox Jot – TTS Verified
Curated on-device TTS models verified for Vox Jot speech synthesis. Ranked: Fastest → Balanced → Best Quality → Voice Cloning. MIT/Apache licensed.
Vox Jot - TTS Candidates
Candidate on-device TTS models under evaluation for Vox Jot. Models here are not ranked or verified until full Vox Jot benchmark suites pass.
ML Models
Machine Learning Models
Vox Jot – File ASR Verified
Curated file-transcription ASR models verified for Vox Jot. File/audio engines, not live dictation hot path.
Vox Jot – LLM Verified
Curated on-device LLM/GGUF models verified for Vox Jot. Small, fast instruct models optimized for local inference on-device.
-
IrieDinamik/LiquidAI-LFM2.5-Audio-1.5B-GGUF
1B • Updated • 225 -
IrieDinamik/LiquidAI-LFM2-1.2B-Tool-GGUF
Text Generation • 1B • Updated • 96 -
bartowski/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 265k • 208 -
bartowski/Llama-3.2-1B-Instruct-GGUF
Text Generation • 1B • Updated • 111k • 164
Vox Jot – STT Verified
Curated CTranslate2 Whisper models verified for Vox Jot speech-to-text. Ranked: Fastest → Balanced → Best Quality → Experimental. MIT/Apache licensed,
-
Systran/faster-whisper-tiny
Automatic Speech Recognition • Updated • 507k • 21 -
Systran/faster-whisper-tiny.en
Automatic Speech Recognition • Updated • 1.17M • 10 -
Systran/faster-whisper-base
Automatic Speech Recognition • Updated • 1.36M • 27 -
Systran/faster-whisper-base.en
Automatic Speech Recognition • Updated • 58.1k • 4
Vox Jot - Creative Audio Verified
Verified Story Studio creative-audio models and runtimes for Vox Jot.
Vox Jot - TTS Candidates
Candidate on-device TTS models under evaluation for Vox Jot. Models here are not ranked or verified until full Vox Jot benchmark suites pass.
Vox Jot – Speech Analysis Runtime
Public runtime group for Vox Jot Python-sidecar file ASR and speaker isolation dependencies.
ML Models
Machine Learning Models
Vox Jot – Speaker Isolation Verified
Curated speaker diarization and isolation models verified for Vox Jot file transcription.
-
pyannote/speaker-diarization-community-1
Automatic Speech Recognition • Updated • 2.76M • 461 -
pyannote/speaker-diarization-3.1
Automatic Speech Recognition • Updated • 9.65M • 2.12k -
BUT-FIT/diarizen-wavlm-large-s80-md-v2
Voice Activity Detection • Updated • 1.64k • 13 -
nvidia/diar_sortformer_4spk-v1
Automatic Speech Recognition • 0.1B • Updated • 13.3k • 139
Vox Jot – File ASR Verified
Curated file-transcription ASR models verified for Vox Jot. File/audio engines, not live dictation hot path.
Vox Jot – OCR Verified
Curated on-device OCR models verified for Vox Jot. Image-to-text and document scanning models optimized for local inference on-device.
Vox Jot – LLM Verified
Curated on-device LLM/GGUF models verified for Vox Jot. Small, fast instruct models optimized for local inference on-device.
-
IrieDinamik/LiquidAI-LFM2.5-Audio-1.5B-GGUF
1B • Updated • 225 -
IrieDinamik/LiquidAI-LFM2-1.2B-Tool-GGUF
Text Generation • 1B • Updated • 96 -
bartowski/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 265k • 208 -
bartowski/Llama-3.2-1B-Instruct-GGUF
Text Generation • 1B • Updated • 111k • 164
Vox Jot – TTS Verified
Curated on-device TTS models verified for Vox Jot speech synthesis. Ranked: Fastest → Balanced → Best Quality → Voice Cloning. MIT/Apache licensed.
Vox Jot – STT Verified
Curated CTranslate2 Whisper models verified for Vox Jot speech-to-text. Ranked: Fastest → Balanced → Best Quality → Experimental. MIT/Apache licensed,
-
Systran/faster-whisper-tiny
Automatic Speech Recognition • Updated • 507k • 21 -
Systran/faster-whisper-tiny.en
Automatic Speech Recognition • Updated • 1.17M • 10 -
Systran/faster-whisper-base
Automatic Speech Recognition • Updated • 1.36M • 27 -
Systran/faster-whisper-base.en
Automatic Speech Recognition • Updated • 58.1k • 4