GGUF
How to use from
Ollama
ollama run hf.co/Severian/Jamba-900M-GGUF:BF16
Quick Links

Jamba 900M GGUF

This is the first GGUF of the new Jamba architecture recently hacked with llama.cpp using this Repo https://github.com/ggerganov/llama.cpp/tree/compilade/refactor-kv-cache

Model: pszemraj/jamba-900M-v0.13-KIx2

Downloads last month
15
GGUF
Model size
0.9B params
Architecture
jamba
Hardware compatibility
Log In to add your hardware

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including Severian/Jamba-900M-GGUF