all InfoSec news
Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs
June 14, 2024, 4:19 a.m. | Zhao Xu, Fan Liu, Hao Liu
cs.CR updates on arXiv.org arxiv.org
Abstract: Although Large Language Models (LLMs) have demonstrated significant capabilities in executing complex tasks in a zero-shot manner, they are susceptible to jailbreak attacks and can be manipulated to produce harmful outputs. Recently, a growing body of research has categorized jailbreak attacks into token-level and prompt-level attacks. However, previous work primarily overlooks the diverse key factors of jailbreak attacks, with most studies concentrating on LLM vulnerabilities and lacking exploration of defense-enhanced LLMs. To address these issues, …
arxiv attacks benchmarking cs.ai cs.cl cs.cr jailbreak llms tricks
More from arxiv.org / cs.CR updates on arXiv.org
Jobs in InfoSec / Cybersecurity
Information Technology Specialist I: Windows Engineer
@ Los Angeles County Employees Retirement Association (LACERA) | Pasadena, California
Information Technology Specialist I, LACERA: Information Security Engineer
@ Los Angeles County Employees Retirement Association (LACERA) | Pasadena, CA
Solutions Expert
@ General Dynamics Information Technology | USA MD Home Office (MDHOME)
Physical Security Specialist
@ The Aerospace Corporation | Chantilly
System Administrator
@ General Dynamics Information Technology | USA VA Newington - Customer Proprietary (VAC395)
Microsoft Exchange & 365 Systems Engineer - TS/SCI with Polygraph
@ General Dynamics Information Technology | USA VA Chantilly - 14700 Lee Rd (VAS100)