On Trojan Signatures in Large Language Models of Code | allinfosecnews.com

Feb. 28, 2024, 5:11 a.m. | Aftab Hussain, Md Rafiqul Islam Rabin, Mohammad Amin Alipour

cs.CR updates on arXiv.org arxiv.org

arXiv:2402.16896v1 Announce Type: new
Abstract: Trojan signatures, as described by Fields et al. (2021), are noticeable differences in the distribution of the trojaned class parameters (weights) and the non-trojaned class parameters of the trojaned model, that can be used to detect the trojaned model. Fields et al. (2021) found trojan signatures in computer vision classification tasks with image models, such as, Resnet, WideResnet, Densenet, and VGG. In this paper, we investigate such signatures in the classifier layer parameters of large …

arxiv can class code cs.cr cs.lg cs.se detect distribution found language language models large non signatures trojan

More from arxiv.org / cs.CR updates on arXiv.org

David and Goliath: An Empirical Evaluation of Attacks and Defenses for QNNs at the Deep … 12 hours ago | arxiv.org

applications arm arxiv attacks +18

How to Use Quantum Indistinguishability Obfuscation 12 hours ago | arxiv.org

arxiv class copy cs.cr +6

Privately Aligning Language Models with Reinforcement Learning 12 hours ago | arxiv.org

alignment arxiv chatgpt cs.cr +13

Numeric Truncation Security Predicate 12 hours ago | arxiv.org

arxiv bits conversion cs.cr +11

Causal Discovery Under Local Privacy 12 hours ago | arxiv.org

application arxiv consumers cs.ai +19

Investigating Threats Posed by SMS Origin Spoofing to IoT Devices 12 hours ago | arxiv.org

arxiv communication cs.cr devices +18

Impact of Architectural Modifications on Deep Learning Adversarial Robustness 12 hours ago | arxiv.org

adoption advancements adversarial applications +23

Tokenization of Real Estate Assets Using Blockchain 12 hours ago | arxiv.org

area arxiv assets banking +20

A Survey on Privacy-Preserving Caching at Network Edge: Classification, Solutions, and Challenges 12 hours ago | arxiv.org

arxiv caching challenges classification +12

PMO Cybersécurité H/F

@ Hifield | Sèvres, France

View on infosec-jobs.com

Third Party Risk Management - Consultant

@ KPMG India | Bengaluru, Karnataka, India

View on infosec-jobs.com

Consultant Cyber Sécurité H/F - Strasbourg

@ Hifield | Strasbourg, France

View on infosec-jobs.com

Information Security Compliance Analyst

@ KPMG Australia | Melbourne, Australia

View on infosec-jobs.com

GDS Consulting - Cyber Security | Data Protection Senior Consultant

@ EY | Taguig, PH, 1634

View on infosec-jobs.com

Senior QA Engineer - Cloud Security

@ Tenable | Israel

View on infosec-jobs.com