'Skeleton Key' attack unlocks the worst of AI, says Microsoft

June 28, 2024, 6:38 a.m. | Thomas Claburn

The Register - Security www.theregister.com

Simple jailbreak prompt can bypass safety guardrails on major models

Microsoft on Thursday published details about Skeleton Key – a technique that bypasses the guardrails used by makers of AI models to prevent their generative chatbots from creating harmful content.…

ai models attack bypass can chatbots generative guardrails jailbreak key major makers microsoft prevent prompt safety simple skeleton

Visit resource

More from www.theregister.com / The Register - Security

CISA looked at C/C++ projects and found a lot of C/C++ code. Wanna redo any … 2 days, 1 hour ago | www.theregister.com

agency cisa code critical +14

TeamViewer says Russia broke into its corp IT network 2 days, 3 hours ago | www.theregister.com

apt29 backdoor intelligence it network +9

Unlock the future of security 2 days, 7 hours ago | www.theregister.com

critical digital digital landscape exclusive +11

Google cuts ties with Entrust in Chrome over trust issues 2 days, 8 hours ago | www.theregister.com

authority certificate certificate authority chrome +10

Microsoft hits snooze again on security certificate renewal 2 days, 9 hours ago | www.theregister.com

certificate certificates expiration issues +7

'Skeleton Key' attack unlocks the worst of AI, says Microsoft 2 days, 16 hours ago | www.theregister.com

ai models attack bypass can +13

Polyfill.io owner punches back at 'malicious defamation' amid domain shutdown 2 days, 19 hours ago | www.theregister.com

attacks back claims code +11

TeamViewer can't bring itself to say someone broke into its network – but it happened 2 days, 22 hours ago | www.theregister.com

alarm can customer customer data +10

US lawmakers wave red flags over Chinese drone dominance 3 days, 9 hours ago | www.theregister.com

beijing chinese congress control +12

Technical Product Engineer

@ Palo Alto Networks | Tel Aviv-Yafo, Israel

View on infosec-jobs.com

Azure Cloud Architect

@ Version 1 | Dublin, Ireland

View on infosec-jobs.com

Junior Pen Tester

@ Vertiv | Pune, India

View on infosec-jobs.com

Information Security GRC Director

@ IQ-EQ | Hyderabad, India

View on infosec-jobs.com

Senior Technical Analyst

@ Fidelity International | Gurgaon Office

View on infosec-jobs.com

Security Engineer II

@ Microsoft | Redmond, Washington, United States

View on infosec-jobs.com

all InfoSec news

'Skeleton Key' attack unlocks the worst of AI, says Microsoft

Simple jailbreak prompt can bypass safety guardrails on major models

More from www.theregister.com / The Register - Security

Jobs in InfoSec / Cybersecurity

Technical Product Engineer

Azure Cloud Architect

Junior Pen Tester

Information Security GRC Director

Senior Technical Analyst

Security Engineer II