Text Generation
Transformers
Safetensors
qwen2
conversational
text-generation-inference
How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="2stacks/s1.1-0.5B")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("2stacks/s1.1-0.5B")
model = AutoModelForCausalLM.from_pretrained("2stacks/s1.1-0.5B")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

Model Summary

s1.1-0.5B is a sucessor of s1 with better reasoning performance by leveraging reasoning traces from r1 instead of Gemini. This model was created simply to test the process used to train the original s1.1 cited below using consumer grade GPUs.

Thanks to Ryan Marten for helping generate r1 traces for s1K.

Use

The model usage is documented here.

Downloads last month
1
Safetensors
Model size
0.6B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for 2stacks/s1.1-0.5B

Finetuned
(790)
this model
Quantizations
1 model

Dataset used to train 2stacks/s1.1-0.5B

Paper for 2stacks/s1.1-0.5B