prithivMLmods commited on
Commit
8c9d76b
·
verified ·
1 Parent(s): 511c8ee

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -23,6 +23,8 @@ datasets:
23
  - Jackrong/GLM-5.1-Reasoning-1M-Cleaned
24
  ---
25
 
 
 
26
  # **Q3.5-9B-GLM-5.1-DA**
27
 
28
  > **Q3.5-9B-GLM-5.1-DA** (Qwen3.5 GLM Distilled-Abliterated) is a reasoning-focused model built on top of **Qwen/Qwen3.5-9B** through the **prithivMLmods/Qwen3.5-9B-Unredacted-MAX** base. The model is optimized for long-context mathematical reasoning, structured problem solving, and context-aware generation using distilled reasoning traces derived from GLM-5.1 reasoning datasets combined with refusal direction analysis and ablation-based training strategies to reduce internal refusal behaviors while preserving strong reasoning and instruction-following performance.
@@ -137,4 +139,4 @@ print(output_text)
137
  * **User Responsibility**: Requires careful and ethical usage
138
  * **Mathematical Hallucinations**: Complex reasoning tasks may still contain logical or numerical inconsistencies
139
  * **Abliteration Trade-offs**: Reduced refusal behaviors may impact safety alignment and output filtering
140
- * **High Compute Demand**: Optimized inference or quantization may still be required for efficient deployment
 
23
  - Jackrong/GLM-5.1-Reasoning-1M-Cleaned
24
  ---
25
 
26
+ ![1](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/7igbBtaI66zhkqycS63WV.png)
27
+
28
  # **Q3.5-9B-GLM-5.1-DA**
29
 
30
  > **Q3.5-9B-GLM-5.1-DA** (Qwen3.5 GLM Distilled-Abliterated) is a reasoning-focused model built on top of **Qwen/Qwen3.5-9B** through the **prithivMLmods/Qwen3.5-9B-Unredacted-MAX** base. The model is optimized for long-context mathematical reasoning, structured problem solving, and context-aware generation using distilled reasoning traces derived from GLM-5.1 reasoning datasets combined with refusal direction analysis and ablation-based training strategies to reduce internal refusal behaviors while preserving strong reasoning and instruction-following performance.
 
139
  * **User Responsibility**: Requires careful and ethical usage
140
  * **Mathematical Hallucinations**: Complex reasoning tasks may still contain logical or numerical inconsistencies
141
  * **Abliteration Trade-offs**: Reduced refusal behaviors may impact safety alignment and output filtering
142
+ * **High Compute Demand**: Optimized inference or quantization may still be required for efficient deployment