MAAT: Multi-phase Adapter-Aware Targeted Unlearning

This repository contains a LoRA adapter for Gemma-3-4b-it trained using MAAT (Multi-phase Adapter-Aware Targeted Unlearning), a three-phase framework designed for effective machine unlearning.

Model Description

MAAT is designed to address the challenges of causal unlearning (answering "Why"-type questions) while maintaining high retention of other knowledge. The framework operates exclusively on LoRA adapter weights and achieves a new operating point on the forget-retain Pareto frontier by combining:

  1. Gradient-Projected Ascent: Orthogonally projects forget gradients to remove components that conflict with the retain set.
  2. Structural Compression and Task Negation: Uses SVD rank-dimension pruning and task vector negation to selectively erase knowledge.
  3. Multi-Objective Utility Repair Engine: A joint parameter alignment loop over the retain set to recover utility using KL divergence and hidden-state alignment.

Evaluation

The model was evaluated using 5WBENCH, a balanced 5,000-sample benchmark covering Who, What, When, Where, and Why categories, specifically designed to quantify causal unlearning failures.

Citation

@article{yagnik2025maat,
  title={MAAT: Multi-phase Adapter-Aware Targeted Unlearning},
  author={Yagnik, Suryash and Gaur, Shubham and Thakur, Saksham and Jain, Vinija and Chadha, Aman and Das, Amitava},
  journal={arXiv preprint arXiv:2605.30514},
  year={2025}
}
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Novaspree/factify-Gemma3-adapter-1

Adapter
(381)
this model

Paper for Novaspree/factify-Gemma3-adapter-1