Digital Forgetting in Large Language Models: A Survey of Unlearning Methods | allinfosecnews.com

April 3, 2024, 4:10 a.m. | Alberto Blanco-Justicia, Najeeb Jebreel, Benet Manzanares, David S\'anchez, Josep Domingo-Ferrer, Guillem Collell, Kuan Eeik Tan

cs.CR updates on arXiv.org arxiv.org

arXiv:2404.02062v1 Announce Type: new
Abstract: The objective of digital forgetting is, given a model with undesirable knowledge or behavior, obtain a new model where the detected issues are no longer present. The motivations for forgetting include privacy protection, copyright protection, elimination of biases and discrimination, and prevention of harmful content generation. Effective digital forgetting has to be effective (meaning how well the new model has forgotten the undesired knowledge/behavior), retain the performance of the original model on the desirable tasks, …

arxiv biases copyright copyright protection cs.ai cs.cr cs.lg digital discrimination knowledge language language models large prevention privacy protection survey

More from arxiv.org / cs.CR updates on arXiv.org

Proactive Detection of Voice Cloning with Localized Watermarking 2 days, 15 hours ago | arxiv.org

architecture arxiv audio authenticity +16

SecFormer: Towards Fast and Accurate Privacy-Preserving Inference for Large Language Models 2 days, 15 hours ago | arxiv.org

account arxiv bank cloud +22

NFT Wash Trading: Direct vs. Indirect Estimation 2 days, 15 hours ago | arxiv.org

arxiv binance crypto crypto exchanges +18

Robust Distortion-free Watermarks for Language Models 2 days, 15 hours ago | arxiv.org

arxiv budget changing compute +15

Backdoor Attack with Sparse and Invisible Trigger 2 days, 15 hours ago | arxiv.org

arxiv attack backdoor backdoor attack +4

Homomorphic Polynomial Public Key Cryptography for Quantum-secure Digital Signature 2 days, 15 hours ago | arxiv.org

arxiv cryptography cs.cr digital +17

Toward Unbiased Multiple-Target Fuzzing with Path Diversity 2 days, 15 hours ago | arxiv.org

arxiv cs.cr diversity energy +10

Transferable Availability Poisoning Attacks 2 days, 15 hours ago | arxiv.org

accuracy adversary arxiv attack +17

Uncovering the Limits of Machine Learning for Automatic Vulnerability Detection 2 days, 15 hours ago | arxiv.org

accuracy arxiv automatic can +17

CyberSOC Technical Lead

@ Integrity360 | Sandyford, Dublin, Ireland

View on infosec-jobs.com

Cyber Security Strategy Consultant

@ Capco | New York City

View on infosec-jobs.com

Cyber Security Senior Consultant

@ Capco | Chicago, IL

View on infosec-jobs.com

Senior Security Researcher - Linux MacOS EDR (Cortex)

@ Palo Alto Networks | Tel Aviv-Yafo, Israel

View on infosec-jobs.com

Sr. Manager, NetSec GTM Programs

@ Palo Alto Networks | Santa Clara, CA, United States

View on infosec-jobs.com

SOC Analyst I

@ Fortress Security Risk Management | Cleveland, OH, United States

View on infosec-jobs.com