Kimani James

IrieDinamik

AI & ML interests

None yet

Recent Activity

updated a collection 8 days ago

Vox Jot - Creative Audio Verified

updated a collection 8 days ago

Vox Jot - Creative Audio Verified

updated a collection 8 days ago

Vox Jot - Creative Audio Verified

View all activity

Organizations

None yet

IrieDinamik 's collections 10

Vox Jot - Creative Audio Verified

Verified Story Studio creative-audio models and runtimes for Vox Jot.

IrieDinamik/vox-jot-creative-audio-runtime

Updated 8 days ago
stabilityai/stable-audio-3-optimized

Text-to-Audio • Updated 4 days ago • 16
cvssp/audioldm2-music

Updated Apr 16, 2024 • 2.22k • 29
cvssp/audioldm2

Updated Apr 16, 2024 • 22.9k • 69

Vox Jot – Speech Analysis Runtime

Public runtime group for Vox Jot Python-sidecar file ASR and speaker isolation dependencies.

IrieDinamik/vox-jot-speech-analysis-runtime

Automatic Speech Recognition • Updated 21 days ago

Vox Jot – Speaker Isolation Verified

Curated speaker diarization and isolation models verified for Vox Jot file transcription.

pyannote/speaker-diarization-community-1

Automatic Speech Recognition • Updated Sep 29, 2025 • 2.76M • 461
pyannote/speaker-diarization-3.1

Automatic Speech Recognition • Updated May 10, 2024 • 9.65M • 2.12k
BUT-FIT/diarizen-wavlm-large-s80-md-v2

Voice Activity Detection • Updated Dec 9, 2025 • 1.64k • 13
nvidia/diar_sortformer_4spk-v1

Automatic Speech Recognition • 0.1B • Updated Dec 15, 2025 • 13.3k • 139

Vox Jot – OCR Verified

Curated on-device OCR models verified for Vox Jot. Image-to-text and document scanning models optimized for local inference on-device.

IrieDinamik/ocr-tessdata-best

Updated 18 days ago
IrieDinamik/ocr-nemotron-ocr-v2

Updated 18 days ago • 15
IrieDinamik/ocr-qwen2-5-vl-3b

Image-Text-to-Text • 4B • Updated 18 days ago • 22
IrieDinamik/ocr-glm-ocr

Image-Text-to-Text • 1B • Updated 18 days ago • 32

Vox Jot – TTS Verified

Curated on-device TTS models verified for Vox Jot speech synthesis. Ranked: Fastest → Balanced → Best Quality → Voice Cloning. MIT/Apache licensed.

rhasspy/piper-voices

Updated 18 days ago • 532
microsoft/speecht5_tts

Text-to-Speech • Updated Nov 8, 2023 • 121k • 832
onnx-community/Kokoro-82M-v1.0-ONNX

Text-to-Speech • Updated Feb 8, 2025 • 633k • 224
hexgrad/Kokoro-82M

Text-to-Speech • Updated Apr 10, 2025 • 13.4M • • 6.25k

Vox Jot - TTS Candidates

Candidate on-device TTS models under evaluation for Vox Jot. Models here are not ranked or verified until full Vox Jot benchmark suites pass.

Supertone/supertonic-3

Text-to-Speech • Updated 15 days ago • 59k • 774

ML Models

Machine Learning Models

AngelSlim/Hy-MT1.5-1.8B-1.25bit

Translation • 2B • Updated 7 days ago • 17.6k • 189
Supertone/supertonic-3

Text-to-Speech • Updated 15 days ago • 59k • 774

Vox Jot – File ASR Verified

Curated file-transcription ASR models verified for Vox Jot. File/audio engines, not live dictation hot path.

ibm-granite/granite-speech-4.1-2b

Automatic Speech Recognition • 2B • Updated 14 days ago • 604k • 114
CohereLabs/cohere-transcribe-03-2026

Automatic Speech Recognition • 2B • Updated 4 days ago • 315k • 969
Systran/faster-whisper-large-v3

Automatic Speech Recognition • Updated Nov 23, 2023 • 963k • 589

Vox Jot – LLM Verified

Curated on-device LLM/GGUF models verified for Vox Jot. Small, fast instruct models optimized for local inference on-device.

IrieDinamik/LiquidAI-LFM2.5-Audio-1.5B-GGUF

1B • Updated Apr 28 • 225
IrieDinamik/LiquidAI-LFM2-1.2B-Tool-GGUF

Text Generation • 1B • Updated Apr 28 • 96
bartowski/Llama-3.2-3B-Instruct-GGUF

Text Generation • 3B • Updated Oct 8, 2024 • 265k • 208
bartowski/Llama-3.2-1B-Instruct-GGUF

Text Generation • 1B • Updated Oct 8, 2024 • 111k • 164

Vox Jot – STT Verified

Curated CTranslate2 Whisper models verified for Vox Jot speech-to-text. Ranked: Fastest → Balanced → Best Quality → Experimental. MIT/Apache licensed,

Systran/faster-whisper-tiny

Automatic Speech Recognition • Updated Nov 23, 2023 • 507k • 21
Systran/faster-whisper-tiny.en

Automatic Speech Recognition • Updated Nov 23, 2023 • 1.17M • 10
Systran/faster-whisper-base

Automatic Speech Recognition • Updated Nov 23, 2023 • 1.36M • 27
Systran/faster-whisper-base.en

Automatic Speech Recognition • Updated Nov 23, 2023 • 58.1k • 4

Vox Jot - Creative Audio Verified

Verified Story Studio creative-audio models and runtimes for Vox Jot.

IrieDinamik/vox-jot-creative-audio-runtime

Updated 8 days ago
stabilityai/stable-audio-3-optimized

Text-to-Audio • Updated 4 days ago • 16
cvssp/audioldm2-music

Updated Apr 16, 2024 • 2.22k • 29
cvssp/audioldm2

Updated Apr 16, 2024 • 22.9k • 69

Vox Jot - TTS Candidates

Candidate on-device TTS models under evaluation for Vox Jot. Models here are not ranked or verified until full Vox Jot benchmark suites pass.

Supertone/supertonic-3

Text-to-Speech • Updated 15 days ago • 59k • 774

Vox Jot – Speech Analysis Runtime

Public runtime group for Vox Jot Python-sidecar file ASR and speaker isolation dependencies.

IrieDinamik/vox-jot-speech-analysis-runtime

Automatic Speech Recognition • Updated 21 days ago

ML Models

Machine Learning Models

AngelSlim/Hy-MT1.5-1.8B-1.25bit

Translation • 2B • Updated 7 days ago • 17.6k • 189
Supertone/supertonic-3

Text-to-Speech • Updated 15 days ago • 59k • 774

Vox Jot – Speaker Isolation Verified

Curated speaker diarization and isolation models verified for Vox Jot file transcription.

pyannote/speaker-diarization-community-1

Automatic Speech Recognition • Updated Sep 29, 2025 • 2.76M • 461
pyannote/speaker-diarization-3.1

Automatic Speech Recognition • Updated May 10, 2024 • 9.65M • 2.12k
BUT-FIT/diarizen-wavlm-large-s80-md-v2

Voice Activity Detection • Updated Dec 9, 2025 • 1.64k • 13
nvidia/diar_sortformer_4spk-v1

Automatic Speech Recognition • 0.1B • Updated Dec 15, 2025 • 13.3k • 139

Vox Jot – File ASR Verified

Curated file-transcription ASR models verified for Vox Jot. File/audio engines, not live dictation hot path.

ibm-granite/granite-speech-4.1-2b

Automatic Speech Recognition • 2B • Updated 14 days ago • 604k • 114
CohereLabs/cohere-transcribe-03-2026

Automatic Speech Recognition • 2B • Updated 4 days ago • 315k • 969
Systran/faster-whisper-large-v3

Automatic Speech Recognition • Updated Nov 23, 2023 • 963k • 589

Vox Jot – OCR Verified

Curated on-device OCR models verified for Vox Jot. Image-to-text and document scanning models optimized for local inference on-device.

IrieDinamik/ocr-tessdata-best

Updated 18 days ago
IrieDinamik/ocr-nemotron-ocr-v2

Updated 18 days ago • 15
IrieDinamik/ocr-qwen2-5-vl-3b

Image-Text-to-Text • 4B • Updated 18 days ago • 22
IrieDinamik/ocr-glm-ocr

Image-Text-to-Text • 1B • Updated 18 days ago • 32

Vox Jot – LLM Verified

Curated on-device LLM/GGUF models verified for Vox Jot. Small, fast instruct models optimized for local inference on-device.

IrieDinamik/LiquidAI-LFM2.5-Audio-1.5B-GGUF

1B • Updated Apr 28 • 225
IrieDinamik/LiquidAI-LFM2-1.2B-Tool-GGUF

Text Generation • 1B • Updated Apr 28 • 96
bartowski/Llama-3.2-3B-Instruct-GGUF

Text Generation • 3B • Updated Oct 8, 2024 • 265k • 208
bartowski/Llama-3.2-1B-Instruct-GGUF

Text Generation • 1B • Updated Oct 8, 2024 • 111k • 164

Vox Jot – TTS Verified

Curated on-device TTS models verified for Vox Jot speech synthesis. Ranked: Fastest → Balanced → Best Quality → Voice Cloning. MIT/Apache licensed.

rhasspy/piper-voices

Updated 18 days ago • 532
microsoft/speecht5_tts

Text-to-Speech • Updated Nov 8, 2023 • 121k • 832
onnx-community/Kokoro-82M-v1.0-ONNX

Text-to-Speech • Updated Feb 8, 2025 • 633k • 224
hexgrad/Kokoro-82M

Text-to-Speech • Updated Apr 10, 2025 • 13.4M • • 6.25k

Vox Jot – STT Verified

Curated CTranslate2 Whisper models verified for Vox Jot speech-to-text. Ranked: Fastest → Balanced → Best Quality → Experimental. MIT/Apache licensed,

Systran/faster-whisper-tiny

Automatic Speech Recognition • Updated Nov 23, 2023 • 507k • 21
Systran/faster-whisper-tiny.en

Automatic Speech Recognition • Updated Nov 23, 2023 • 1.17M • 10
Systran/faster-whisper-base

Automatic Speech Recognition • Updated Nov 23, 2023 • 1.36M • 27
Systran/faster-whisper-base.en

Automatic Speech Recognition • Updated Nov 23, 2023 • 58.1k • 4