Qwen/Qwen3-VL-30B-A3B-Instruct Image-Text-to-Text β’ 31B β’ Updated Nov 26, 2025 β’ 1.72M β’ β’ 535
Running Featured 560 Vision Arena (Testing VLMs side-by-side) πΌ 560 Analyze images with multiple vision models for labels and boxes
Running on CPU Upgrade Featured 3k The Smol Training Playbook π 3k The secrets to building world-class LLMs
yayayaaa/florence-2-large-ft-moredetailed Image-to-Text β’ 0.8B β’ Updated Dec 13, 2025 β’ 85 β’ 16
meta-llama/Llama-3.2-11B-Vision Image-Text-to-Text β’ 11B β’ Updated Sep 27, 2024 β’ 9.42k β’ 580
Runtime error Featured 515 Florence2 + SAM2 π₯ 515 Segment and caption objects in images and videos
Running on Zero Featured 5.05k FLUX.1 [Schnell] π 5.05k Generate images from text prompts in seconds