DP2Unlearning: An efficient and guaranteed unlearning framework for LLMs

Al Mahmud, T; Jebreel, N; Domingo-Ferrer, J; Sánchez, D

doi:10.1016/j.neunet.2025.107879

Identification data

Identifier: imarina:9463649

Handle: https://hdl.handle.net/20.500.11797/imarina9463649

Authors: Al Mahmud, T; Jebreel, N; Domingo-Ferrer, J; Sánchez, D

Abstract:
Large language models (LLMs) have recently revolutionized language processing tasks but have also brought ethical and legal issues. LLMs have a tendency to memorize potentially private or copyrighted information present in the training data, which might then be delivered to end users at inference time. When this happens, a naive solution is to retrain the model from scratch after excluding the undesired data. Although this guarantees that the target data have been forgotten, it is also prohibitively expensive for LLMs. Approximate unlearning offers a more efficient alternative, as it consists of expost modifications of the trained model itself to prevent undesirable results, but it lacks forgetting guarantees because it relies solely on empirical evidence. In this work, we present DP2Unlearning, a novel LLM unlearning framework that offers formal forgetting guarantees at a significantly lower cost than retraining from scratch on the data to be retained. DP2Unlearning involves training LLMs on textual data protected using epsilon-differential privacy (DP), which later enables efficient unlearning with the guarantees against disclosure associated with the chosen epsilon. Our experiments demonstrate that DP2Unlearning achieves similar model performance post-unlearning, compared to an LLM retraining from scratch on retained data-the gold standard exact unlearning-but at approximately half the unlearning cost. In addition, with a reasonable computational cost, it outperforms approximate unlearning methods at both preserving the utility of the model post-unlearning and effectively forgetting the targeted information. The code of our experiments is available at https://github.com/tamimalmahmud/LLM-Unlearning/tree/main/ DP2Unlearning.
Others:

Link to the original source: https://www.sciencedirect.com/science/article/pii/S0893608025007592?via%3Dihub
APA: Al Mahmud, T; Jebreel, N; Domingo-Ferrer, J; Sánchez, D (2025). DP2Unlearning: An efficient and guaranteed unlearning framework for LLMs. Neural Networks, 192(), 107879-. DOI: 10.1016/j.neunet.2025.107879
Paper original source: Neural Networks. 192 107879-
Article's DOI: 10.1016/j.neunet.2025.107879
Journal publication year: 2025-12-01
Entity: Universitat Rovira i Virgili
Paper version: info:eu-repo/semantics/publishedVersion
Record's date: 2026-02-09
URV's Author/s: Domingo Ferrer, Josep / Sánchez Ruenes, David
Department: Enginyeria Informàtica i Matemàtiques
Licence document URL: https://repositori.urv.cat/ca/proteccio-de-dades/
Publication Type: Journal Publications
Author, as appears in the article.: Al Mahmud, T; Jebreel, N; Domingo-Ferrer, J; Sánchez, D
licence for use: https://creativecommons.org/licenses/by/3.0/es/
Thematic Areas: Artificial intelligence, Astronomia / física, Biotecnología, Ciência da computação, Ciências agrárias i, Ciencias sociales, Cognitive neuroscience, Computer science, artificial intelligence, Economia, Engenharias iii, Engenharias iv, Filosofia/teologia:subcomissão filosofia, General medicine, Interdisciplinar, Matemática / probabilidade e estatística, Neurosciences, Psicología, Psychology, Química
Author's mail: josep.domingo@urv.cat, josep.domingo@urv.cat, josep.domingo@urv.cat, david.sanchez@urv.cat

Keywords:

Approximate unlearning
Differential privacy
Divergence
Exact unlearning
Llm unlearning
Mode
Privacy-preserving ll
Privacy-preserving llm
Artificial Intelligence
Cognitive Neuroscience
Computer Science
Neurosciences
Astronomia / física
Biotecnología
Ciência da computação
Ciências agrárias i
Ciencias sociales
Economia
Engenharias iii
Engenharias iv
Filosofia/teologia:subcomissão filosofia
General medicine
Interdisciplinar
Matemática / probabilidade e estatística
Psicología
Psychology
Química
Documents:

DocumentPrincipal
Cerca a google

DP2Unlearning: An efficient and guaranteed unlearning framework for LLMs

Identification data

Others:

Keywords:

Documents:

Cerca a google