How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="vagrillo/syncro-qwen3-4b-instruct-2507",
	filename="",
)
llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

syncro-qwen3-4b-instruct-2507 - GGUF

This model was finetuned and converted to GGUF format using Unsloth. and using a dataset extracted from "Syncro" a novel written by Salvatore Sanfilippo.

Ask to the model "Who is Emily?" and start your journey. Example usage:

  • For text only LLMs: llama-cli --hf repo_id/model_name -p "Who is studying the GW190521 event?"

This is an allucination generator!

image

image

image

Available Model files:

  • qwen3-4b-instruct-2507.Q8_0.gguf
  • qwen3-4b-instruct-2507.Q4_K_M.gguf

Ollama

An Ollama Modelfile is included for easy deployment.

Downloads last month
227
GGUF
Model size
4B params
Architecture
qwen3
Hardware compatibility
Log In to add your hardware

4-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support