MAAT: Multi-phase Adapter-Aware Targeted Unlearning
Paper • 2605.30514 • Published • 5
How to use Novaspree/tofu-Gemma3-adapter-1 with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("google/gemma-3-4b-it")
model = PeftModel.from_pretrained(base_model, "Novaspree/tofu-Gemma3-adapter-1")This repository contains a LoRA adapter for google/gemma-3-4b-it fine-tuned using the MAAT (Multi-phase Adapter-Aware Targeted Unlearning) framework. This model was developed to address challenges in machine unlearning, particularly concerning "Why-type" questions that involve complex causal and relational knowledge.
MAAT is a three-phase unlearning framework that operates exclusively on LoRA adapter weights. It aims to achieve high forgetting on specific targeted facts while maintaining high retention on other knowledge.
The framework consists of:
@article{yagnik2024maat,
title={MAAT: Multi-phase Adapter-Aware Targeted Unlearning},
author={Yagnik, Suryash and Gaur, Shubham and Thakur, Saksham and Jain, Vinija and Chadha, Aman and Das, Amitava},
journal={arXiv preprint arXiv:2605.30514},
year={2024}
}