March 27, 2024, 4:11 a.m. | Weimin Lyu, Xiao Lin, Songzhu Zheng, Lu Pang, Haibin Ling, Susmit Jha, Chao Chen

cs.CR updates on arXiv.org arxiv.org

arXiv:2403.17155v1 Announce Type: cross
Abstract: Textual backdoor attacks pose significant security threats. Current detection approaches, typically relying on intermediate feature representation or reconstructing potential triggers, are task-specific and less effective beyond sentence classification, struggling with tasks like question answering and named entity recognition. We introduce TABDet (Task-Agnostic Backdoor Detector), a pioneering task-agnostic method for backdoor detection. TABDet leverages final layer logits combined with an efficient pooling technique, enabling unified logit representation across three prominent NLP tasks. TABDet can jointly learn …

arxiv attacks backdoor backdoor attacks beyond classification cs.cl cs.cr current detection detector feature intermediate question recognition representation security security threats task threats

CyberSOC Technical Lead

@ Integrity360 | Sandyford, Dublin, Ireland

Cyber Security Strategy Consultant

@ Capco | New York City

Cyber Security Senior Consultant

@ Capco | Chicago, IL

Sr. Product Manager

@ MixMode | Remote, US

Security Engineering Practice Lead

@ NCC Group | GBR Manchester Hardman Boulevard

Senior Cyber Security Engineer - Cloud & Multifactor Authentication

@ General Motors | GM Global Technical Center - Michigan IT Innovation Center