Multi-step Jailbreaking Privacy Attacks on ChatGPT. (arXiv:2304.05197v1 [cs.CL]) | allinfosecnews.com

April 12, 2023, 1:10 a.m. | Haoran Li, Dadi Guo, Wei Fan, Mingshi Xu, Yangqiu Song

cs.CR updates on arXiv.org arxiv.org

With the rapid progress of large language models (LLMs), many downstream NLP
tasks can be well solved given good prompts. Though model developers and
researchers work hard on dialog safety to avoid generating harmful content from
LLMs, it is still challenging to steer AI-generated content (AIGC) for the
human good. As powerful LLMs are devouring existing text data from various
domains (e.g., GPT-3 is trained on 45TB texts), it is natural to doubt whether
the private information is included in …

attacks chatgpt data developers dialog domains generated gpt gpt-3 hard human information jailbreaking language language models large llms nlp privacy progress prompts rapid researchers safety text threats training work

More from arxiv.org / cs.CR updates on arXiv.org

Privacy Amplification for Matrix Mechanisms 6 hours ago | arxiv.org

algorithms amplification analysis art +16

Jailbreak and Guard Aligned Language Models with Only Few In-Context Demonstrations 6 hours ago | arxiv.org

alignment arxiv context cs.ai +11

FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models 6 hours ago | arxiv.org

adaptation advancement applications arxiv +15

Are aligned neural networks adversarially aligned? 6 hours ago | arxiv.org

adversarial align alignment arxiv +17

Data Depth and Core-based Trend Detection on Blockchain Transaction Networks 6 hours ago | arxiv.org

arxiv assets blockchain blockchains +14

A New Linear Scaling Rule for Private Adaptive Hyperparameter Optimization 6 hours ago | arxiv.org

account arxiv cost cs.ai +11

RandOhm: Mitigating Impedance Side-channel Attacks using Randomized Circuit Configurations 6 hours ago | arxiv.org

arxiv attacks can channel +15

An algorithm for forensic toolmark comparisons 6 hours ago | arxiv.org

address algorithm analysis arxiv +11

Scalable and Adaptively Secure Any-Trust Distributed Key Generation and All-hands Checkpointing 6 hours ago | arxiv.org

applications arxiv blockchain challenges +10

Red Team Operator

@ JPMorgan Chase & Co. | LONDON, United Kingdom

View on infosec-jobs.com

SOC Analyst

@ Resillion | Bengaluru, India

View on infosec-jobs.com

Director of Cyber Security

@ Revinate | San Francisco Bay Area

View on infosec-jobs.com

Jr. Security Incident Response Analyst

@ Kaseya | Miami, Florida, United States

View on infosec-jobs.com

Infrastructure Vulnerability Consultant - (Cloud Security , CSPM)

@ Blue Yonder | Hyderabad

View on infosec-jobs.com

Product Security Lead

@ Lely | Maassluis, Netherlands

View on infosec-jobs.com