TrojanRAG: Retrieval-Augmented Generation Can Be Backdoor Driver in Large Language Models | allinfosecnews.com

May 24, 2024, 4:11 a.m. | Pengzhou Cheng, Yidong Ding, Tianjie Ju, Zongru Wu, Wei Du, Ping Yi, Zhuosheng Zhang, Gongshen Liu

cs.CR updates on arXiv.org arxiv.org

arXiv:2405.13401v1 Announce Type: new
Abstract: Large language models (LLMs) have raised concerns about potential security threats despite performing significantly in Natural Language Processing (NLP). Backdoor attacks initially verified that LLM is doing substantial harm at all stages, but the cost and robustness have been criticized. Attacking LLMs is inherently risky in security review, while prohibitively expensive. Besides, the continuous iteration of LLMs will degrade the robustness of backdoors. In this paper, we propose TrojanRAG, which employs a joint backdoor attack …

arxiv attacks backdoor backdoor attacks can cost cs.cl cs.cr doing driver harm language language models large llm llms natural natural language natural language processing nlp performing robustness security security threats threats verified

More from arxiv.org / cs.CR updates on arXiv.org

EnSolver: Uncertainty-Aware Ensemble CAPTCHA Solvers with Theoretical Guarantees 7 hours ago | arxiv.org

aim arxiv automated automated bots +17

Straggler-Resilient Differentially-Private Decentralized Learning 7 hours ago | arxiv.org

amplification analytical arxiv communication +21

BlockChain I/O: Enabling Cross-Chain Commerce 7 hours ago | arxiv.org

arxiv blockchain blockchains commerce +9

Machine Learning Predictors for Min-Entropy Estimation 7 hours ago | arxiv.org

application applications arxiv assessment +15

Quantum-Enhanced Secure Approval Voting Protocol 7 hours ago | arxiv.org

arxiv aspect changing computing +22

IDT: Dual-Task Adversarial Attacks for Privacy Protection 7 hours ago | arxiv.org

adversarial adversarial attacks arxiv attacks +24

Private Zeroth-Order Nonsmooth Nonconvex Optimization 7 hours ago | arxiv.org

algorithm alpha arxiv complexity +16

Instance-Optimal Private Density Estimation in the Wasserstein Distance 7 hours ago | arxiv.org

arxiv cs.cr cs.ds cs.lg +11

Too Good to be True? Turn Any Model Differentially Private With DP-Weights 7 hours ago | arxiv.org

arxiv cs.ai cs.cr cs.lg +14

Senior Streaming Platform Engineer

@ Armis Security | Tel Aviv-Yafo, Tel Aviv District, Israel

View on infosec-jobs.com

Senior Streaming Platform Engineer

@ Armis Security | Tel Aviv-Yafo, Tel Aviv District, Israel

View on infosec-jobs.com

Deputy Chief Information Officer of Operations (Senior Public Service Administrator, Opt. 3)

@ State of Illinois | Springfield, IL, US, 62701-1222

View on infosec-jobs.com

Deputy Chief Information Officer of Operations (Senior Public Service Administrator, Opt. 3)

@ State of Illinois | Springfield, IL, US, 62701-1222

View on infosec-jobs.com

Analyst, Security

@ DailyPay | New York City

View on infosec-jobs.com

Analyst, Security

@ DailyPay | New York City

View on infosec-jobs.com