A Statistical Framework of Watermarks for Large Language Models: Pivot, Detection Efficiency and Optimal Rules | allinfosecnews.com

April 2, 2024, 7:12 p.m. | Xiang Li, Feng Ruan, Huiyuan Wang, Qi Long, Weijie J. Su

cs.CR updates on arXiv.org arxiv.org

arXiv:2404.01245v1 Announce Type: cross
Abstract: Since ChatGPT was introduced in November 2022, embedding (nearly) unnoticeable statistical signals into text generated by large language models (LLMs), also known as watermarking, has been used as a principled approach to provable detection of LLM-generated text from its human-written counterpart. In this paper, we introduce a general and flexible framework for reasoning about the statistical efficiency of watermarks and designing powerful detection rules. Inspired by the hypothesis testing formulation of watermark detection, our framework …

arxiv chatgpt cs.cl cs.cr cs.lg detection efficiency framework generated human language language models large llm llms math.st november pivot rules signals stat.ml stat.th text watermarking watermarks written

More from arxiv.org / cs.CR updates on arXiv.org

Causal Inference with Differentially Private (Clustered) Outcomes 11 hours ago | arxiv.org

algorithm arxiv cs.cr cs.lg +12

An artificial neural network approach to finding the key length of the Vigen\`{e}re cipher 11 hours ago | arxiv.org

accuracy article artificial arxiv +9

Generic Selfish Mining MDP for DAG Protocols 11 hours ago | arxiv.org

analysis arxiv bitcoin breaking +15

Tight Differential Privacy Guarantees for the Shuffle Model with $k$-Randomized Response 11 hours ago | arxiv.org

algorithms arxiv cs.cr data +14

Succinct arguments for QMA from standard assumptions via compiled nonlocal games 11 hours ago | arxiv.org

argument arxiv building crypto +8

On Training a Neural Network to Explain Binaries 11 hours ago | arxiv.org

aid arxiv binary code +15

Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning 11 hours ago | arxiv.org

arxiv attacks backdoor backdoor attacks +14

Leveraging Label Information for Stealthy Data Stealing in Vertical Federated Learning 11 hours ago | arxiv.org

arxiv attack attacks cs.cr +16

An Extensive Survey of Digital Image Steganography: State of the Art 11 hours ago | arxiv.org

adoption art arxiv attention +21

Information Security Cyber Risk Analyst

@ Intel | USA - AZ - Chandler

View on infosec-jobs.com

Senior Cloud Security Engineer (Fullstack)

@ Grab | Petaling Jaya, Malaysia

View on infosec-jobs.com

Principal Product Security Engineer

@ Oracle | United States

View on infosec-jobs.com

Cybersecurity Strategy Director

@ Proofpoint | Sunnyvale, CA

View on infosec-jobs.com

Information Security Consultant/Auditor

@ Devoteam | Lisboa, Portugal

View on infosec-jobs.com

IT Security Engineer til Netcompany IT Services

@ Netcompany | Copenhagen, Denmark

View on infosec-jobs.com