Instructions to use Qwen/Qwen3.6-27B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Qwen/Qwen3.6-27B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="Qwen/Qwen3.6-27B") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("Qwen/Qwen3.6-27B") model = AutoModelForImageTextToText.from_pretrained("Qwen/Qwen3.6-27B") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] inputs = processor.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Inference
- Notebooks
- Google Colab
- Kaggle
- AMD Developer Cloud
- Local Apps
- vLLM
How to use Qwen/Qwen3.6-27B with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "Qwen/Qwen3.6-27B" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Qwen/Qwen3.6-27B", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/Qwen/Qwen3.6-27B
- SGLang
How to use Qwen/Qwen3.6-27B with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "Qwen/Qwen3.6-27B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Qwen/Qwen3.6-27B", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "Qwen/Qwen3.6-27B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Qwen/Qwen3.6-27B", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' - Docker Model Runner
How to use Qwen/Qwen3.6-27B with Docker Model Runner:
docker model run hf.co/Qwen/Qwen3.6-27B
My deepest respect and admiration goes out to the Qwen team!!
I've been waiting for so long, so long, so long. I particularly love the 27B model in Qwen 3.5; it was the only model that provided satisfactory answers to my scientific structure benchmarks and material property inferences, so I've believed since the voting stage that Qwen 3.6 should include the 27B. I was a little disappointed when Qwen 3.6-35B-A3B was released first, but it was high-performance, so I told myself that even if the 27B might not be released, I should be patient since they'd made a high-performance model open-source. But the Qwen team has made my dream come true! Thank you!!
absolutely!
thanks to the Qwen team!
I bow to your magnificence. Thank you for freeing us from closed/capitalist AI prison.
HONEY WAKE UP QWEN JUST DROPPED AGAIN !!!
Big love to Qwen team! I run AI communities in northern Thailand, would love to do a developer QA if you'd be open to it
源神,启动!
Many thanks from my side as well. Cant wait for the 122b variant as well :D
Absolutly. Thank you so much ❤️
What the fuck are they putting in those models, HOW IS THIS THIS GOOD???
Blown away from what I'm seeing in early testing. Reasoning and problem solving approach is noticeably better than the 35B-A3B variant
i was sad cause they ended free qwen api but 3.6 not bad for local coding.
And I support what Alibaba is doing! It's pure joy... and absolute excellence! I bow to you! Thank you!
感谢阿里的开源!
LiSAN-AL-GAiB!! 🙇♂️ Now i have a choice to say No to the hoarding rituals of closed-source API. Something offline & reliable!! 🎉
great job for human