Boosting Jailbreak Attack with Momentum | allinfosecnews.com

May 3, 2024, 4:15 a.m. | Yihao Zhang, Zeming Wei

cs.CR updates on arXiv.org arxiv.org

arXiv:2405.01229v1 Announce Type: cross
Abstract: Large Language Models (LLMs) have achieved remarkable success across diverse tasks, yet they remain vulnerable to adversarial attacks, notably the well-documented \textit{jailbreak} attack. Recently, the Greedy Coordinate Gradient (GCG) attack has demonstrated efficacy in exploiting this vulnerability by optimizing adversarial prompts through a combination of gradient heuristics and greedy search. However, the efficiency of this attack has become a bottleneck in the attacking process. To mitigate this limitation, in this paper we rethink the generation …

arxiv attack cs.ai cs.cl cs.cr cs.lg jailbreak math.oc momentum

More from arxiv.org / cs.CR updates on arXiv.org

Differentially private projection-depth-based medians 6 hours ago | arxiv.org

arxiv conditions cost cs.cr +17

Transpose Attack: Stealing Datasets with Bidirectional Training 6 hours ago | arxiv.org

addition adversaries arxiv attack +16

Bypassing the Safety Training of Open-Source LLMs with Priming Attacks 6 hours ago | arxiv.org

arxiv attacks bypassing cs.ai +6

ATM: a Logic for Quantitative Security Properties on Attack Trees 6 hours ago | arxiv.org

academia arxiv atm attack +18

A Survey on Cyber-Resilience Approaches for Cyber-Physical Systems 6 hours ago | arxiv.org

arxiv computation control cps +23

ScionFL: Efficient and Robust Secure Quantized Aggregation 6 hours ago | arxiv.org

aggregation aim arxiv central +16

Rethinking Graph Backdoor Attacks: A Distribution-Preserving Perspective 6 hours ago | arxiv.org

arxiv attack attacks backdoor +16

IT Strategic alignment in the decentralized finance (DeFi): CBDC and digital currencies 6 hours ago | arxiv.org

alignment arxiv asset blockchain +23

Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transformers 6 hours ago | arxiv.org

arxiv attack backdoor backdoor attack +5

Information Security Engineers

@ D. E. Shaw Research | New York City

View on infosec-jobs.com

Technology Security Analyst

@ Halton Region | Oakville, Ontario, Canada

View on infosec-jobs.com

Senior Cyber Security Analyst

@ Valley Water | San Jose, CA

View on infosec-jobs.com

Information Technology Security Engineer

@ Plexus Worldwide | Scottsdale, Arizona, United States

View on infosec-jobs.com

Principal Email Security Researcher (Cortex XDR)

@ Palo Alto Networks | Tel Aviv-Yafo, Israel

View on infosec-jobs.com

Lead Security Engineer - Cloud Security, AWS

@ JPMorgan Chase & Co. | Bengaluru, Karnataka, India

View on infosec-jobs.com