Thinking can't be disabled

#2
by CodeFault - opened

Thinking can't be disabled using chat-template-kwargs = {"enable_thinking":false} like you can with standard Qwen/Qwen3.5-4B. Modifying the end of the chat template from:

{%- if add_generation_prompt %}
    {{- '<|im_start|>assistant
<think>
' }}
{%- endif %}

To:

{%- if add_generation_prompt %}
    {{- '<|im_start|>assistant\n' }}
    {%- if enable_thinking is defined and enable_thinking is false %}
        {{- '<think>\n\n</think>\n\n' }}
    {%- else %}
        {{- '<think>\n' }}
    {%- endif %}
{%- endif %}

Allows you to disable thinking. This was copied from https://huggingface.co/Qwen/Qwen3.5-4B/blob/main/chat_template.jinja which itself was fixed on 2026-03-01. This affects your single quantization Qwen3.5-4B-MicroCoder-Q6_K.gguf.

Sign up or log in to comment