Evading Data Contamination Detection for Language Models is (too) Easy | allinfosecnews.com

Feb. 13, 2024, 5:11 a.m. | Jasper Dekoninck Mark Niklas M\"uller Maximilian Baader Marc Fischer Martin Vechev

cs.CR updates on arXiv.org arxiv.org

Large language models are widespread, with their performance on benchmarks frequently guiding user preferences for one model over another. However, the vast amount of data these models are trained on can inadvertently lead to contamination with public benchmarks, thus compromising performance measurements. While recently developed contamination detection methods try to address this issue, they overlook the possibility of deliberate contamination by malicious model providers aiming to evade detection. We argue that this setting is of crucial importance as it casts …

benchmarks can cs.ai cs.cl cs.cr cs.lg data detection easy language language models large performance public try vast

More from arxiv.org / cs.CR updates on arXiv.org

Dihedral Quantum Codes 1 day, 18 hours ago | arxiv.org

arxiv block class code +9

A Privacy Preserving System for Movie Recommendations Using Federated Learning 1 day, 18 hours ago | arxiv.org

arxiv businesses cs.cr cs.ir +20

VulLibGen: Identifying Vulnerable Third-Party Libraries via Generative Pre-Trained Model 1 day, 18 hours ago | arxiv.org

accelerate advisory arxiv cs.cr +24

Simultaneous Haar Indistinguishability with Applications to Unclonable Cryptography 1 day, 18 hours ago | arxiv.org

applications arxiv build cloning +11

SoK: Prudent Evaluation Practices for Fuzzing 1 day, 18 hours ago | arxiv.org

afl arxiv bugs concept +13

The Effect of Quantization in Federated Learning: A R\'enyi Differential Privacy Perspective 1 day, 18 hours ago | arxiv.org

arxiv can cs.cr cs.dc +15

Unveiling the Potential: Harnessing Deep Metric Learning to Circumvent Video Streaming Encryption 1 day, 18 hours ago | arxiv.org

arxiv attacks body cs.ai +16

SecureLLM: Using Compositionality to Build Provably Secure Language Models for Private, Sensitive, and Secret Data 1 day, 18 hours ago | arxiv.org

access arxiv back build +15

IBD-PSC: Input-level Backdoor Detection via Parameter-oriented Scaling Consistency 1 day, 18 hours ago | arxiv.org

adversaries arxiv attacks backdoor +22

Information Security Engineers

@ D. E. Shaw Research | New York City

View on infosec-jobs.com

Technology Security Analyst

@ Halton Region | Oakville, Ontario, Canada

View on infosec-jobs.com

Senior Cyber Security Analyst

@ Valley Water | San Jose, CA

View on infosec-jobs.com

COMM Penetration Tester (PenTest-2), Chantilly, VA OS&CI Job #368

@ Allen Integrated Solutions | Chantilly, Virginia, United States

View on infosec-jobs.com

Consultant Sécurité SI H/F Gouvernance - Risques - Conformité

@ Hifield | Sèvres, France

View on infosec-jobs.com

Infrastructure Consultant

@ Telefonica Tech | Belfast, United Kingdom

View on infosec-jobs.com