Zen Meta & Tools
Collection
Meta repos for the Zen family β index, blog, training recipes. β’ 3 items β’ Updated
How to use zenlm/zen-training with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-generation", model="zenlm/zen-training") # Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("zenlm/zen-training", dtype="auto")How to use zenlm/zen-training with vLLM:
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "zenlm/zen-training"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "zenlm/zen-training",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker model run hf.co/zenlm/zen-training
How to use zenlm/zen-training with SGLang:
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
--model-path "zenlm/zen-training" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "zenlm/zen-training",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker run --gpus all \
--shm-size 32g \
-p 30000:30000 \
-v ~/.cache/huggingface:/root/.cache/huggingface \
--env "HF_TOKEN=<secret>" \
--ipc=host \
lmsysorg/sglang:latest \
python3 -m sglang.launch_server \
--model-path "zenlm/zen-training" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "zenlm/zen-training",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'How to use zenlm/zen-training with Docker Model Runner:
docker model run hf.co/zenlm/zen-training
Unified Training Platform for All Zen Models
Train any Zen model with any dataset combination from HuggingFace. Everything runs directly from HF datasets - no local storage needed!
Language Models:
zen-nano (0.6B) - Edge deploymentzen-eco (4B) - Balanced performancezen-omni (7B) - Multi-taskzen-coder (14B) - Code generationzen-next (32B) - Frontier performanceVision-Language Models:
zen-vl-4b - Efficient VL with function callingzen-vl-8b - Enhanced VL capabilitieszen-vl-30b - Maximum VL performanceAgent Training (ADP):
Function Calling:
Instruction Tuning:
4B Models (A10G - 24GB):
8B Models (A100 - 40GB):
32B Models (A100 - 80GB):
ADP Synatra (80%) + xLAM (20%)
= Strong agent + quality function calling
Code Feedback (70%) + Alpaca (30%)
= Code expertise + general instruction following
ADP (all configs) + xLAM
= Complete vision-language agent training
Apache 2.0
@software{zen-training-2025,
title={Zen Training: Unified Training Platform for Zen Models},
author={Zen AI Team},
year={2025},
url={https://huggingface.co/spaces/zenlm/zen-training}
}
@article{adp2024,
title={Agent Data Protocol},
author={NeuLab},
journal={arXiv preprint arXiv:2510.24702},
year={2024}
}
@dataset{xlam2024,
title={xLAM Function Calling Dataset},
author={Salesforce Research},
year={2024}
}