Time Travel in LLMs: Tracing Data Contamination in Large Language Models | allinfosecnews.com

Feb. 23, 2024, 5:11 a.m. | Shahriar Golchin, Mihai Surdeanu

cs.CR updates on arXiv.org arxiv.org

arXiv:2308.08493v3 Announce Type: replace-cross
Abstract: Data contamination, i.e., the presence of test data from downstream tasks in the training data of large language models (LLMs), is a potential major issue in measuring LLMs' real effectiveness on other tasks. We propose a straightforward yet effective method for identifying data contamination within LLMs. At its core, our approach starts by identifying potential contamination at the instance level; using this information, our approach then assesses wider contamination at the partition level. To estimate …

arxiv cs.ai cs.cl cs.cr cs.lg data issue language language models large llms major measuring presence real test tracing training training data travel

More from arxiv.org / cs.CR updates on arXiv.org

IDEA: Invariant Defense for Graph Adversarial Robustness 2 days, 7 hours ago | arxiv.org

adversarial arxiv cs.cr cs.lg +4

BELT: Old-School Backdoor Attacks can Evade the State-of-the-Art Defense with Backdoor Exclusivity Lifting 2 days, 7 hours ago | arxiv.org

art arxiv attackers attacks +19

ZTD$_{JAVA}$: Mitigating Software Supply Chain Vulnerabilities via Zero-Trust Dependencies 2 days, 7 hours ago | arxiv.org

accelerate application application development arxiv +26

A Generative Framework for Low-Cost Result Validation of Machine Learning-as-a-Service Inference 2 days, 7 hours ago | arxiv.org

applications arxiv as-a-service cost +23

PA-Boot: A Formally Verified Authentication Protocol for Multiprocessor Secure Boot 2 days, 7 hours ago | arxiv.org

arxiv attack attacks attack surface +18

FairCMS: Cloud Media Sharing with Fair Copyright Protection 2 days, 7 hours ago | arxiv.org

arxiv cloud cloud platform copyright +16

Efficient unitary designs and pseudorandom unitaries from permutations 2 days, 7 hours ago | arxiv.org

algorithm arxiv construction cs.cr +9

Efficient and Near-Optimal Noise Generation for Streaming Differential Privacy 2 days, 7 hours ago | arxiv.org

arxiv cs.cc cs.cr cs.ds +11

Privacy-Preserving Statistical Data Generation: Application to Sepsis Detection 2 days, 7 hours ago | arxiv.org

application artificial artificial intelligence arxiv +19

SOC 2 Manager, Audit and Certification

@ Deloitte | US and CA Multiple Locations

View on infosec-jobs.com

Security Officer Hospital Laguna Beach

@ Allied Universal | Laguna Beach, CA, United States

View on infosec-jobs.com

Sr. Cloud DevSecOps Engineer

@ Oracle | NOIDA, UTTAR PRADESH, India

View on infosec-jobs.com

Cloud Operations Security Engineer

@ Elekta | Crawley - Cornerstone

View on infosec-jobs.com

Cybersecurity – Senior Information System Security Manager (ISSM)

@ Boeing | USA - Seal Beach, CA

View on infosec-jobs.com

Engineering -- Tech Risk -- Security Architecture -- VP -- Dallas

@ Goldman Sachs | Dallas, Texas, United States

View on infosec-jobs.com