Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models. (arXiv:2305.13873v1 [cs.CV]) | allinfosecnews.com

May 24, 2023, 1:10 a.m. | Yiting Qu, Xinyue Shen, Xinlei He, Michael Backes, Savvas Zannettou, Yang Zhang

cs.CR updates on arXiv.org arxiv.org

State-of-the-art Text-to-Image models like Stable Diffusion and DALLE$\cdot$2
are revolutionizing how people generate visual content. At the same time,
society has serious concerns about how adversaries can exploit such models to
generate unsafe images. In this work, we focus on demystifying the generation
of unsafe images and hateful memes from Text-to-Image models. We first
construct a typology of unsafe images consisting of five categories (sexually
explicit, violent, disturbing, hateful, and political). Then, we assess the
proportion of unsafe images generated …

adversaries art exploit focus images memes people serious society stable diffusion state text work

More from arxiv.org / cs.CR updates on arXiv.org

Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk 23 hours ago | arxiv.org

amplify arxiv can cs.cr +8

Delocate: Detection and Localization for Deepfake Videos with Randomly-Located Tampered Traces 23 hours ago | arxiv.org

address arxiv cs.cr cs.cv +12

Artwork Protection Against Neural Style Transfer Using Locally Adaptive Adversarial Color Attack 23 hours ago | arxiv.org

adversarial artwork arxiv attack +17

Difficulties in Dynamic Analysis of Drone Firmware and Its Solutions 23 hours ago | arxiv.org

advancement analysis applications arxiv +22

Circuit complexity and functionality: a thermodynamic perspective 23 hours ago | arxiv.org

arxiv complexity computation computational +14

On the Two-sided Permutation Inversion Problem 23 hours ago | arxiv.org

access arxiv challenge complexity +10

Versatile Backdoor Attack with Visible, Semantic, Sample-Specific, and Compatible Triggers 23 hours ago | arxiv.org

arxiv attack attacks backdoor +19

Interpretation of Neural Networks is Susceptible to Universal Adversarial Perturbations 23 hours ago | arxiv.org

adversarial algorithms application arxiv +19

Quantum copy-protection of compute-and-compare programs in the quantum random oracle model 23 hours ago | arxiv.org

arxiv can ccc compute +13

SOC 2 Manager, Audit and Certification

@ Deloitte | US and CA Multiple Locations

View on infosec-jobs.com

Check Team Members / Cyber Consultants / Pen Testers

@ Resillion | Birmingham, United Kingdom

View on infosec-jobs.com

Security Officer Field Training Officer- Full Time (Harrah's LV)

@ Caesars Entertainment | Las Vegas, NV, United States

View on infosec-jobs.com

Cybersecurity Subject Matter Expert (SME)

@ SMS Data Products Group, Inc. | Fort Belvoir, VA, United States

View on infosec-jobs.com

AWS Security Engineer

@ IntelliPro Group Inc. | Palo Alto, CA

View on infosec-jobs.com

Information Security Analyst

@ Freudenberg Group | Alajuela

View on infosec-jobs.com