Attacking LLM Watermarks by Exploiting Their Strengths | allinfosecnews.com

Feb. 27, 2024, 5:11 a.m. | Qi Pang, Shengyuan Hu, Wenting Zheng, Virginia Smith

cs.CR updates on arXiv.org arxiv.org

arXiv:2402.16187v1 Announce Type: new
Abstract: Advances in generative models have made it possible for AI-generated text, code, and images to mirror human-generated content in many applications. Watermarking, a technique that aims to embed information in the output of a model to verify its source, is useful for mitigating misuse of such AI-generated content. However, existing watermarking schemes remain surprisingly susceptible to attack. In particular, we show that desirable properties shared by existing LLM watermarking systems such as quality preservation, robustness, …

applications arxiv code cs.cl cs.cr cs.lg exploiting generated generative generative models human images information llm mirror text verify watermarking watermarks

More from arxiv.org / cs.CR updates on arXiv.org

IDEA: Invariant Defense for Graph Adversarial Robustness 2 days, 2 hours ago | arxiv.org

adversarial arxiv cs.cr cs.lg +4

BELT: Old-School Backdoor Attacks can Evade the State-of-the-Art Defense with Backdoor Exclusivity Lifting 2 days, 2 hours ago | arxiv.org

art arxiv attackers attacks +19

ZTD$_{JAVA}$: Mitigating Software Supply Chain Vulnerabilities via Zero-Trust Dependencies 2 days, 2 hours ago | arxiv.org

accelerate application application development arxiv +26

A Generative Framework for Low-Cost Result Validation of Machine Learning-as-a-Service Inference 2 days, 2 hours ago | arxiv.org

applications arxiv as-a-service cost +23

PA-Boot: A Formally Verified Authentication Protocol for Multiprocessor Secure Boot 2 days, 2 hours ago | arxiv.org

arxiv attack attacks attack surface +18

FairCMS: Cloud Media Sharing with Fair Copyright Protection 2 days, 2 hours ago | arxiv.org

arxiv cloud cloud platform copyright +16

Efficient unitary designs and pseudorandom unitaries from permutations 2 days, 2 hours ago | arxiv.org

algorithm arxiv construction cs.cr +9

Efficient and Near-Optimal Noise Generation for Streaming Differential Privacy 2 days, 2 hours ago | arxiv.org

arxiv cs.cc cs.cr cs.ds +11

Privacy-Preserving Statistical Data Generation: Application to Sepsis Detection 2 days, 2 hours ago | arxiv.org

application artificial artificial intelligence arxiv +19

SOC 2 Manager, Audit and Certification

@ Deloitte | US and CA Multiple Locations

View on infosec-jobs.com

Associate Compliance Advisor

@ SAP | Budapest, HU, 1031

View on infosec-jobs.com

DevSecOps Engineer

@ Qube Research & Technologies | London

View on infosec-jobs.com

Software Engineer, Security

@ Render | San Francisco, CA or Remote (USA & Canada)

View on infosec-jobs.com

Associate Consultant

@ Control Risks | Frankfurt, Hessen, Germany

View on infosec-jobs.com

Senior Security Engineer

@ Activision Blizzard | Work from Home - CA

View on infosec-jobs.com