Adversarial Nibbler: An Open Red-Teaming Method for Identifying Diverse Harms in Text-to-Image Generation | allinfosecnews.com

March 20, 2024, 4:11 a.m. | Jessica Quaye, Alicia Parrish, Oana Inel, Charvi Rastogi, Hannah Rose Kirk, Minsuk Kahng, Erin van Liemt, Max Bartolo, Jess Tsang, Justin White, Natha

cs.CR updates on arXiv.org arxiv.org

arXiv:2403.12075v1 Announce Type: cross
Abstract: With the rise of text-to-image (T2I) generative AI models reaching wide audiences, it is critical to evaluate model robustness against non-obvious attacks to mitigate the generation of offensive images. By focusing on ``implicitly adversarial'' prompts (those that trigger T2I models to generate unsafe images for non-obvious reasons), we isolate a set of difficult safety issues that human creativity is well-suited to uncover. To this end, we built the Adversarial Nibbler Challenge, a red-teaming methodology for …

adversarial ai models arxiv attacks critical cs.ai cs.cr cs.cv cs.cy cs.lg generative generative ai image image generation images non offensive prompts robustness text trigger

More from arxiv.org / cs.CR updates on arXiv.org

IDEA: Invariant Defense for Graph Adversarial Robustness 2 days, 13 hours ago | arxiv.org

adversarial arxiv cs.cr cs.lg +4

BELT: Old-School Backdoor Attacks can Evade the State-of-the-Art Defense with Backdoor Exclusivity Lifting 2 days, 13 hours ago | arxiv.org

art arxiv attackers attacks +19

ZTD$_{JAVA}$: Mitigating Software Supply Chain Vulnerabilities via Zero-Trust Dependencies 2 days, 13 hours ago | arxiv.org

accelerate application application development arxiv +26

A Generative Framework for Low-Cost Result Validation of Machine Learning-as-a-Service Inference 2 days, 13 hours ago | arxiv.org

applications arxiv as-a-service cost +23

PA-Boot: A Formally Verified Authentication Protocol for Multiprocessor Secure Boot 2 days, 13 hours ago | arxiv.org

arxiv attack attacks attack surface +18

FairCMS: Cloud Media Sharing with Fair Copyright Protection 2 days, 13 hours ago | arxiv.org

arxiv cloud cloud platform copyright +16

Efficient unitary designs and pseudorandom unitaries from permutations 2 days, 13 hours ago | arxiv.org

algorithm arxiv construction cs.cr +9

Efficient and Near-Optimal Noise Generation for Streaming Differential Privacy 2 days, 13 hours ago | arxiv.org

arxiv cs.cc cs.cr cs.ds +11

Privacy-Preserving Statistical Data Generation: Application to Sepsis Detection 2 days, 13 hours ago | arxiv.org

application artificial artificial intelligence arxiv +19

SOC 2 Manager, Audit and Certification

@ Deloitte | US and CA Multiple Locations

View on infosec-jobs.com

Lead Technical Product Manager - Threat Protection

@ Mastercard | Remote - United Kingdom

View on infosec-jobs.com

Data Privacy Officer

@ Banco Popular | San Juan, PR

View on infosec-jobs.com

GRC Security Program Manager

@ Meta | Bellevue, WA | Menlo Park, CA | Washington, DC | New York City

View on infosec-jobs.com

Cyber Security Engineer

@ ASSYSTEM | Warrington, United Kingdom

View on infosec-jobs.com

Privacy Engineer, Technical Audit

@ Meta | Menlo Park, CA

View on infosec-jobs.com