Alan Gunning
AI & ML interests
Organizations
-
Runtime errorFeatured491
YOLO World
π₯491Detect objects in images or videos
-
Runtime errorFeatured277
CoTracker
π¨277Track points in a video
-
PausedFeatured314
PaliGemma Demo
π€²314Annotate and describe images with text prompts
-
Running on ZeroFeatured827
Florence 2
π827Generate captions, detect objects, and segment images with AI
-
partypress/partypress-monolingual-ireland
Text Classification β’ Updated β’ 6 -
pyannote/speaker-diarization-3.1
Automatic Speech Recognition β’ Updated β’ 13M β’ 1.54k -
Runtime errorFeatured330
Video Dubbing
π330Create dubbed videos in multiple languages
-
speechbrain/sepformer-wham-enhancement
Audio-to-Audio β’ Updated β’ 145 β’ 32
-
Running2
Pdf To Clean Txt
π’2Takes a pdf, cleans and outputs a txt
-
Running4
Historical OCR
β4advanced OCR application for historical document analysis
-
Running3
FadedTextRestoration
π₯3Restore faded text from images
-
Running14
Scanned Document Denoise Reconstruct
β‘14Clean and restore noisy scanned documents
-
Runtime errorFeatured2.77k
XTTS
πΈ2.77kGenerate speech from text using a reference voice
-
coqui/XTTS-v2
Text-to-Speech β’ Updated β’ 6.5M β’ 3.39k -
myshell-ai/OpenVoice
Text-to-Speech β’ Updated β’ 488 -
RunningFeatured1.12k
OpenVoice
π€1.12kClone a voice and generate speech from your text
-
h94/IP-Adapter-FaceID
Text-to-Image β’ Updated β’ 221k β’ 1.82k -
Lykon/dreamshaper-xl-v2-turbo
Text-to-Image β’ Updated β’ 20.6k β’ 71 -
RunDiffusion/Juggernaut-XL-v9
Text-to-Image β’ Updated β’ 74.8k β’ 299 -
Running2.78k
OutfitAnyone
π’2.78kGenerate virtual tryβon images for any person and clothing
-
openai/whisper-large-v3
Automatic Speech Recognition β’ Updated β’ 6.08M β’ β’ 5.39k -
nvidia/canary-1b
Automatic Speech Recognition β’ Updated β’ 1.83k β’ 457 -
Running61
Insanelyfastwhisper
π»61Convert audio to subtitles
-
j-macnamara/wav2vec2-large-xls-r-2b-Irish-gaIE
Automatic Speech Recognition β’ 2B β’ Updated β’ 1
-
speechbrain/sepformer-wham-enhancement
Audio-to-Audio β’ Updated β’ 145 β’ 32 -
speechbrain/sepformer-whamr-enhancement
Audio-to-Audio β’ Updated β’ 132 β’ 13 -
speechbrain/sepformer-dns4-16k-enhancement
Audio-to-Audio β’ Updated β’ 164 β’ 27 -
speechbrain/sepformer-wham16k-enhancement
Audio-to-Audio β’ Updated β’ 250 β’ 32
-
Running2
Pdf To Clean Txt
π’2Takes a pdf, cleans and outputs a txt
-
Running4
Historical OCR
β4advanced OCR application for historical document analysis
-
Running3
FadedTextRestoration
π₯3Restore faded text from images
-
Running14
Scanned Document Denoise Reconstruct
β‘14Clean and restore noisy scanned documents
-
Runtime errorFeatured2.77k
XTTS
πΈ2.77kGenerate speech from text using a reference voice
-
coqui/XTTS-v2
Text-to-Speech β’ Updated β’ 6.5M β’ 3.39k -
myshell-ai/OpenVoice
Text-to-Speech β’ Updated β’ 488 -
RunningFeatured1.12k
OpenVoice
π€1.12kClone a voice and generate speech from your text
-
h94/IP-Adapter-FaceID
Text-to-Image β’ Updated β’ 221k β’ 1.82k -
Lykon/dreamshaper-xl-v2-turbo
Text-to-Image β’ Updated β’ 20.6k β’ 71 -
RunDiffusion/Juggernaut-XL-v9
Text-to-Image β’ Updated β’ 74.8k β’ 299 -
Running2.78k
OutfitAnyone
π’2.78kGenerate virtual tryβon images for any person and clothing
-
Runtime errorFeatured491
YOLO World
π₯491Detect objects in images or videos
-
Runtime errorFeatured277
CoTracker
π¨277Track points in a video
-
PausedFeatured314
PaliGemma Demo
π€²314Annotate and describe images with text prompts
-
Running on ZeroFeatured827
Florence 2
π827Generate captions, detect objects, and segment images with AI
-
openai/whisper-large-v3
Automatic Speech Recognition β’ Updated β’ 6.08M β’ β’ 5.39k -
nvidia/canary-1b
Automatic Speech Recognition β’ Updated β’ 1.83k β’ 457 -
Running61
Insanelyfastwhisper
π»61Convert audio to subtitles
-
j-macnamara/wav2vec2-large-xls-r-2b-Irish-gaIE
Automatic Speech Recognition β’ 2B β’ Updated β’ 1
-
partypress/partypress-monolingual-ireland
Text Classification β’ Updated β’ 6 -
pyannote/speaker-diarization-3.1
Automatic Speech Recognition β’ Updated β’ 13M β’ 1.54k -
Runtime errorFeatured330
Video Dubbing
π330Create dubbed videos in multiple languages
-
speechbrain/sepformer-wham-enhancement
Audio-to-Audio β’ Updated β’ 145 β’ 32
-
speechbrain/sepformer-wham-enhancement
Audio-to-Audio β’ Updated β’ 145 β’ 32 -
speechbrain/sepformer-whamr-enhancement
Audio-to-Audio β’ Updated β’ 132 β’ 13 -
speechbrain/sepformer-dns4-16k-enhancement
Audio-to-Audio β’ Updated β’ 164 β’ 27 -
speechbrain/sepformer-wham16k-enhancement
Audio-to-Audio β’ Updated β’ 250 β’ 32