Inherent Challenges of Post-Hoc Membership Inference for Large Language Models | allinfosecnews.com

June 27, 2024, 4:19 a.m. | Matthieu Meeus, Shubham Jain, Marek Rei, Yves-Alexandre de Montjoye

cs.CR updates on arXiv.org arxiv.org

arXiv:2406.17975v1 Announce Type: cross
Abstract: Large Language Models (LLMs) are often trained on vast amounts of undisclosed data, motivating the development of post-hoc Membership Inference Attacks (MIAs) to gain insight into their training data composition. However, in this paper, we identify inherent challenges in post-hoc MIA evaluation due to potential distribution shifts between collected member and non-member datasets. Using a simple bag-of-words classifier, we demonstrate that datasets used in recent post-hoc MIAs suffer from significant distribution shifts, in some cases …

arxiv attacks challenges cs.cl cs.cr cs.lg data development evaluation identify insight language language models large llms training training data vast

More from arxiv.org / cs.CR updates on arXiv.org

Kirchhoff Meets Johnson: In Pursuit of Unconditionally Secure Communication 1 day, 23 hours ago | arxiv.org

arxiv communication cs.cr cs.it +12

Intriguing Properties of Adversarial ML Attacks in the Problem Space [Extended Version] 1 day, 23 hours ago | arxiv.org

adversarial arxiv attacks clear +17

Quartic quantum speedups for planted inference 1 day, 23 hours ago | arxiv.org

algorithm arxiv cs.cc cs.cr +10

Understanding Routing-Induced Censorship Changes Globally 1 day, 23 hours ago | arxiv.org

arxiv censorship cs.cr cs.ni +12

On Convex Optimization with Semi-Sensitive Features 1 day, 23 hours ago | arxiv.org

arxiv cs.cr cs.ds cs.lg +11

Investigating and Defending Shortcut Learning in Personalized Diffusion Models 1 day, 23 hours ago | arxiv.org

adversarial arxiv cs.ai cs.cr +14

Contraction of Private Quantum Channels and Private Quantum Hypothesis Testing 1 day, 23 hours ago | arxiv.org

action arxiv channel cs.cr +12

Fully Exploiting Every Real Sample: SuperPixel Sample Gradient Model Stealing 1 day, 23 hours ago | arxiv.org

arxiv cs.cr cs.cv exploiting +3

TTP-Based Cyber Resilience Index: A Probabilistic Quantitative Approach to Measure Defence Effectiveness Against Cyber Attacks 1 day, 23 hours ago | arxiv.org

arxiv attacks building campaigns +25

Senior Systems Engineer - AWS

@ CACI International Inc | 999 REMOTE

View on infosec-jobs.com

Managing Consultant / Consulting Director / Engagement Lead in Cybersecurity Consulting

@ Marsh McLennan | Toronto - Bremner

View on infosec-jobs.com

Specialist , Fraud Investigation and SecOps

@ Concentrix | Bulgaria - Work at Home

View on infosec-jobs.com

Data Engineer, Mid

@ Booz Allen Hamilton | USA, CA, San Diego (1615 Murray Canyon Rd)

View on infosec-jobs.com

Manager, Risk Management

@ Manulife | CAN, Ontario, Toronto, 200 Bloor Street East

View on infosec-jobs.com

Regional Channel Manager (Remote - West)

@ Dell Technologies | Remote - California, United States (All Other)

View on infosec-jobs.com