Mitigating Backdoor Poisoning Attacks through the Lens of Spurious Correlation. (arXiv:2305.11596v1 [cs.CL]) | allinfosecnews.com

May 22, 2023, 1:10 a.m. | Xuanli He, Qiongkai Xu, Jun Wang, Benjamin Rubinstein, Trevor Cohn

cs.CR updates on arXiv.org arxiv.org

Modern NLP models are often trained over large untrusted datasets, raising
the potential for a malicious adversary to compromise model behaviour. For
instance, backdoors can be implanted through crafting training instances with a
specific textual trigger and a target label. This paper posits that backdoor
poisoning attacks exhibit spurious correlation between simple text features and
classification labels, and accordingly, proposes methods for mitigating
spurious correlation as means of defence. Our empirical study reveals that the
malicious triggers are highly correlated …

adversary attacks backdoor backdoors compromise correlation datasets instance large malicious nlp poisoning target training trigger untrusted

More from arxiv.org / cs.CR updates on arXiv.org

IDEA: Invariant Defense for Graph Adversarial Robustness 5 hours ago | arxiv.org

adversarial arxiv cs.cr cs.lg +4

BELT: Old-School Backdoor Attacks can Evade the State-of-the-Art Defense with Backdoor Exclusivity Lifting 5 hours ago | arxiv.org

art arxiv attackers attacks +19

ZTD$_{JAVA}$: Mitigating Software Supply Chain Vulnerabilities via Zero-Trust Dependencies 5 hours ago | arxiv.org

accelerate application application development arxiv +26

A Generative Framework for Low-Cost Result Validation of Machine Learning-as-a-Service Inference 5 hours ago | arxiv.org

applications arxiv as-a-service cost +23

PA-Boot: A Formally Verified Authentication Protocol for Multiprocessor Secure Boot 5 hours ago | arxiv.org

arxiv attack attacks attack surface +18

FairCMS: Cloud Media Sharing with Fair Copyright Protection 5 hours ago | arxiv.org

arxiv cloud cloud platform copyright +16

Efficient unitary designs and pseudorandom unitaries from permutations 5 hours ago | arxiv.org

algorithm arxiv construction cs.cr +9

Efficient and Near-Optimal Noise Generation for Streaming Differential Privacy 5 hours ago | arxiv.org

arxiv cs.cc cs.cr cs.ds +11

Privacy-Preserving Statistical Data Generation: Application to Sepsis Detection 5 hours ago | arxiv.org

application artificial artificial intelligence arxiv +19

SOC 2 Manager, Audit and Certification

@ Deloitte | US and CA Multiple Locations

View on infosec-jobs.com

Cyber Security Cloud Solution Architect

@ Microsoft | London, London, United Kingdom

View on infosec-jobs.com

Compliance Program Analyst

@ SailPoint | United States

View on infosec-jobs.com

Software Engineer III, Infrastructure, Google Cloud Security and Privacy

@ Google | Sunnyvale, CA, USA

View on infosec-jobs.com

Cryptography Expert

@ Raiffeisen Bank Ukraine | Kyiv, Kyiv city, Ukraine

View on infosec-jobs.com

Senior Cyber Intelligence Planner (15.09)

@ OCT Consulting, LLC | Washington, District of Columbia, United States

View on infosec-jobs.com