Ghost Sentence: A Tool for Everyday Users to Copyright Data from Large Language Models | allinfosecnews.com

March 26, 2024, 4:11 a.m. | Shuai Zhao, Linchao Zhu, Ruijie Quan, Yi Yang

cs.CR updates on arXiv.org arxiv.org

arXiv:2403.15740v1 Announce Type: cross
Abstract: Web user data plays a central role in the ecosystem of pre-trained large language models (LLMs) and their fine-tuned variants. Billions of data are crawled from the web and fed to LLMs. How can \textit{\textbf{everyday web users}} confirm if LLMs misuse their data without permission? In this work, we suggest that users repeatedly insert personal passphrases into their documents, enabling LLMs to memorize them. These concealed passphrases in user documents, referred to as \textit{ghost sentences}, …

arxiv can confirm copyright crawled cs.cl cs.cr cs.ir cs.lg data ecosystem fed ghost language language models large llms role the web tool user data web

More from arxiv.org / cs.CR updates on arXiv.org

Causal Inference with Differentially Private (Clustered) Outcomes 20 hours ago | arxiv.org

algorithm arxiv cs.cr cs.lg +12

An artificial neural network approach to finding the key length of the Vigen\`{e}re cipher 20 hours ago | arxiv.org

accuracy article artificial arxiv +9

Generic Selfish Mining MDP for DAG Protocols 20 hours ago | arxiv.org

analysis arxiv bitcoin breaking +15

Tight Differential Privacy Guarantees for the Shuffle Model with $k$-Randomized Response 20 hours ago | arxiv.org

algorithms arxiv cs.cr data +14

Succinct arguments for QMA from standard assumptions via compiled nonlocal games 20 hours ago | arxiv.org

argument arxiv building crypto +8

On Training a Neural Network to Explain Binaries 20 hours ago | arxiv.org

aid arxiv binary code +15

Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning 20 hours ago | arxiv.org

arxiv attacks backdoor backdoor attacks +14

Leveraging Label Information for Stealthy Data Stealing in Vertical Federated Learning 20 hours ago | arxiv.org

arxiv attack attacks cs.cr +16

An Extensive Survey of Digital Image Steganography: State of the Art 20 hours ago | arxiv.org

adoption art arxiv attention +21

Azure DevSecOps Cloud Engineer II

@ Prudent Technology | McLean, VA, USA

View on infosec-jobs.com

Security Engineer III - Python, AWS

@ JPMorgan Chase & Co. | Bengaluru, Karnataka, India

View on infosec-jobs.com

SOC Analyst (Threat Hunter)

@ NCS | Singapore, Singapore

View on infosec-jobs.com

Managed Services Information Security Manager

@ NTT DATA | Sydney, Australia

View on infosec-jobs.com

Senior Security Engineer (Remote)

@ Mattermost | United Kingdom

View on infosec-jobs.com

Penetration Tester (Part Time & Remote)

@ TestPros | United States - Remote

View on infosec-jobs.com