RedHatAI/Llama-4-Scout-17B-16E-Instruct-NVFP4 Text Generation • 64B • Updated Nov 21, 2025 • 10.8k • 1
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic Image-Text-to-Text • 24B • Updated Oct 29, 2025 • 2.6k • 9
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8 Image-Text-to-Text • 24B • Updated Oct 29, 2025 • 219 • 5
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16 Image-Text-to-Text • 5B • Updated Oct 29, 2025 • 2.46k • 10
RedHatAI/Mistral-Small-24B-Instruct-2501-FP8-dynamic Text Generation • 24B • Updated Oct 29, 2025 • 24.1k • 13
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w8a8 Text Generation • 24B • Updated Oct 29, 2025 • 19.5k • 1
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w4a16 Text Generation • 4B • Updated Oct 29, 2025 • 306 • 1
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-block Text Generation • 236B • Updated Oct 27, 2025 • 17 • 3
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-block Text Generation • 109B • Updated Oct 27, 2025 • 24 • 3
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8-block Text Generation • 402B • Updated Oct 27, 2025 • 9 • 1
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic Text Generation • 71B • Updated Oct 23, 2025 • 4.16k • 15
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic Text Generation • 236B • Updated Oct 3, 2025 • 124 • 4
RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w8a8 Image-Text-to-Text • 8B • Updated Oct 2, 2025 • 46.1k • 9
RedHatAI/granite-3.1-8b-instruct-quantized.w8a8 Text Generation • 8B • Updated Sep 25, 2025 • 139 • 2
RedHatAI/Apertus-70B-Instruct-2509-quantized.w4a16 Text Generation • 11B • Updated Sep 23, 2025 • 105 • 1