Lexsi/audit-recover-apply_lox-llama31-8b-dolly
8B • Updated • 32
Frontier research around Safe and aligned intelligence
Forgetting That Sticks: Quantization-Permanent Unlearning via Circuit Attribution
$C$-$ΔΘ$: Circuit-Restricted Weight Arithmetic for Selective Refusal