RedHatAI/Qwen2-0.5B-Instruct-quantized.w8a8
Text Generation
• 0.6B • Updated • 10
RedHatAI/Phi-3-mini-128k-instruct-quantized.w4a16
Text Generation
• 0.7B • Updated • 11
• 1
RedHatAI/Qwen2-1.5B-Instruct-quantized.w8a8
Text Generation
• 2B • Updated • 299
RedHatAI/Meta-Llama-3-8B-Instruct-quantized.w8a8
Text Generation
• 8B • Updated • 365
• 2
RedHatAI/Llama-2-7b-chat-quantized.w8a8
Text Generation
• 7B • Updated • 49
• 1
RedHatAI/Phi-3-mini-128k-instruct-quantized.w8a16
Text Generation
• 1B • Updated • 19
RedHatAI/Phi-3-mini-128k-instruct-FP8
Text Generation
• 4B • Updated • 8
RedHatAI/Llama-3.2-3B-Instruct-FP8-dynamic
Text Generation
• 4B • Updated • 6.48k
• 3
RedHatAI/Llama-3.2-1B-Instruct-FP8-dynamic
Text Generation
• 1B • Updated • 1.51M
• 3
RedHatAI/gemma-2-9b-it-quantized.w8a8
Text Generation
• 10B • Updated • 35
• 2
RedHatAI/Phi-3-medium-128k-instruct-quantized.w8a8
Text Generation
• 14B • Updated • 15
• 2
RedHatAI/Phi-3-medium-128k-instruct-quantized.w8a16
Text Generation
• 4B • Updated • 7
• 2
RedHatAI/Phi-3-medium-128k-instruct-FP8
Text Generation
• 14B • Updated • 203
• 5
RedHatAI/Qwen2.5-32B-Instruct-quantized.w8a16
9B • Updated • 3
RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a16
3B • Updated • 58
RedHatAI/Qwen2.5-0.5B-Instruct-quantized.w8a16
0.4B • Updated RedHatAI/Qwen2.5-72B-Instruct-quantized.w8a8
73B • Updated • 49
RedHatAI/Qwen2.5-32B-Instruct-quantized.w8a8
33B • Updated • 50
RedHatAI/Qwen2.5-32B-quantized.w8a8
33B • Updated RedHatAI/Meta-Llama-3.1-405B-Instruct-FP8
Text Generation
• 406B • Updated • 586
• 31
RedHatAI/Qwen2.5-3B-Instruct-quantized.w8a8
3B • Updated • 70
RedHatAI/Qwen2.5-1.5B-Instruct-quantized.w8a8
2B • Updated • 98
RedHatAI/SparseLlama-3-8B-pruned_50.2of4
Text Generation
• 8B • Updated • 5
RedHatAI/Llama-3.2-90B-Vision-Instruct-FP8-dynamic
Text Generation
• 89B • Updated • 242
• 11
RedHatAI/Phi-3.5-mini-instruct-FP8-KV
Text Generation
• 4B • Updated • 477
• 2
RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w4a16
Text Generation
• 71B • Updated • 15
• 2
RedHatAI/Mixtral-8x22B-Instruct-v0.1-AutoFP8
Text Generation
• 141B • Updated • 165
• 3
RedHatAI/DeepSeek-Coder-V2-Base-FP8
Text Generation
• 236B • Updated • 10
RedHatAI/DeepSeek-Coder-V2-Instruct-FP8
Text Generation
• 236B • Updated • 330
• 7