SPML: A DSL for Defending Language Models Against Prompt Attacks | allinfosecnews.com

Feb. 20, 2024, 5:11 a.m. | Reshabh K Sharma, Vinayak Gupta, Dan Grossman

cs.CR updates on arXiv.org arxiv.org

arXiv:2402.11755v1 Announce Type: cross
Abstract: Large language models (LLMs) have profoundly transformed natural language applications, with a growing reliance on instruction-based definitions for designing chatbots. However, post-deployment the chatbot definitions are fixed and are vulnerable to attacks by malicious users, emphasizing the need to prevent unethical applications and financial losses. Existing studies explore user prompts' impact on LLM-based chatbots, yet practical methods to contain attacks on application-specific chatbots remain unexplored. This paper presents System Prompt Meta Language (SPML), a domain-specific …

arxiv attacks cs.cl cs.cr cs.lg cs.pl defending dsl language language models prompt

More from arxiv.org / cs.CR updates on arXiv.org

Correlated Noise Provably Beats Independent Noise for Differentially Private Learning 2 hours ago | arxiv.org

algorithm algorithms arxiv can +11

Differentially Private Linear Regression with Linked Data 2 hours ago | arxiv.org

arxiv computer computer science cs.cr +16

One-Wayness in Quantum Cryptography 2 hours ago | arxiv.org

arxiv can cryptographic cryptography +8

Learning-Based Difficulty Calibration for Enhanced Membership Inference Attacks 2 hours ago | arxiv.org

applications arxiv attacks cs.ai +17

Secure Transformer Inference Protocol 2 hours ago | arxiv.org

adoption arxiv chatgpt critical +15

Shedding Light on CVSS Scoring Inconsistencies: A User-Centric Study on Evaluating Widespread Security Vulnerabilities 2 hours ago | arxiv.org

arxiv common vulnerability scoring system critical cs.cr +16

Nearly-Optimal Consensus Tolerating Adaptive Omissions: Why is a Lot of Randomness is Needed? 2 hours ago | arxiv.org

adversary agreement arxiv autonomous +16

Anomaly Detection in Certificate Transparency Logs 2 hours ago | arxiv.org

anomaly detection arxiv beyond can +16

SINBAD: Saliency-informed detection of breakage caused by ad blocking 2 hours ago | arxiv.org

ad blocking arxiv automated blocking +13

Sr. Staff Security Engineer

@ Databricks | San Francisco, California

View on infosec-jobs.com

Security Engineer

@ Nomi Health | Austin, Texas

View on infosec-jobs.com

Senior Principal Consultant, Security Architecture

@ 6point6 | Manchester, United Kingdom

View on infosec-jobs.com

Cyber Policy Advisor

@ IntelliBridge | McLean, VA, McLean, VA, US

View on infosec-jobs.com

TW Full Stack Software Engineer (Access Control & Intrusion Systems)

@ Bosch Group | Taipei, Taiwan

View on infosec-jobs.com

Cyber Software Engineer

@ Peraton | Annapolis Junction, MD, United States

View on infosec-jobs.com