SmolLM-360M GGUF

This is a GGUF conversion of HuggingFaceTB/SmolLM-360M, generated using llama.cpp's convert_hf_to_gguf.py.

Files

  • smollm360.gguf โ€“ FP16 GGUF weights
  • smollm360.Q4_K_M.gguf โ€“ Quantized GGUF weights (optional)
  • config.json โ€“ Model architecture metadata
  • Tokenizer files โ€“ copied from the original repo

Usage with Ollama

Create a Modelfile:

FROM ./smollm360.gguf
PARAMETER temperature 0.1
SYSTEM "You are a helpful assistant. Answer questions directly and clearly."

Then run

ollama create smollm360 -f modelfile

and finally

ollama run smollm360

Downloads last month
37
GGUF
Model size
0.4B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for rohits98/smollm360-ollama-test

Quantized
(87)
this model