MAAT: Multi-phase Adapter-Aware Targeted Unlearning

This repository contains a LoRA adapter for Gemma-3-4b-it trained using MAAT (Multi-phase Adapter-Aware Targeted Unlearning), a three-phase framework designed for effective machine unlearning.

Model Description

MAAT is designed to address the challenges of causal unlearning (answering "Why"-type questions) while maintaining high retention of other knowledge. The framework operates exclusively on LoRA adapter weights and achieves a new operating point on the forget-retain Pareto frontier by combining:

Gradient-Projected Ascent: Orthogonally projects forget gradients to remove components that conflict with the retain set.
Structural Compression and Task Negation: Uses SVD rank-dimension pruning and task vector negation to selectively erase knowledge.
Multi-Objective Utility Repair Engine: A joint parameter alignment loop over the retain set to recover utility using KL divergence and hidden-state alignment.

Developed by: Suryash Yagnik, Shubham Gaur, Saksham Thakur, Vinija Jain, Aman Chadha, Amitava Das
Base Model: google/gemma-3-4b-it
Paper: MAAT: Multi-phase Adapter-Aware Targeted Unlearning
Repository: GitHub: Machine-Unlearning

Evaluation

The model was evaluated using 5WBENCH, a balanced 5,000-sample benchmark covering Who, What, When, Where, and Why categories, specifically designed to quantify causal unlearning failures.

Citation

@article{yagnik2025maat,
  title={MAAT: Multi-phase Adapter-Aware Targeted Unlearning},
  author={Yagnik, Suryash and Gaur, Shubham and Thakur, Saksham and Jain, Vinija and Chadha, Aman and Das, Amitava},
  journal={arXiv preprint arXiv:2605.30514},
  year={2025}
}

Downloads last month: -

Model tree for Novaspree/factify-Gemma3-adapter-1

Base model

google/gemma-3-4b-pt

Finetuned

google/gemma-3-4b-it

Adapter

(381)

this model

Paper for Novaspree/factify-Gemma3-adapter-1

MAAT: Multi-phase Adapter-Aware Targeted Unlearning

Paper • 2605.30514 • Published 6 days ago • 5