OpenAI’s flagship AI model has gotten more trustworthy but easier to trick | allinfosecnews.com

Oct. 17, 2023, 9:38 p.m. | Emilia David

The Verge - All Posts www.theverge.com

Image: Microsoft

OpenAI’s GPT-4 large language model may be more trustworthy than GPT-3.5 but also more vulnerable to jailbreaking and bias, according to research backed by Microsoft.

The paper — by researchers from the University of Illinois Urbana-Champaign, Stanford University, University of California, Berkeley, Center for AI Safety, and Microsoft Research — gave GPT-4 a higher trustworthiness score than its predecessor. That means they found it was generally better at protecting private information, avoiding toxic results like biased information, and …

ai model ai safety bias california center easier gpt gpt-3 gpt-3.5 gpt-4 illinois image jailbreaking language large large language model may microsoft openai research researchers safety stanford stanford university university university of california vulnerable

More from www.theverge.com / The Verge - All Posts

Many people say their Apple IDs were inexplicably reset last night 17 hours ago | www.theverge.com

apple ids iphone media +8

Eken fixes ‘terrible’ video doorbell issue that could let someone spy on you 1 day, 9 hours ago | www.theverge.com

consumer consumer reports doorbell doorbells +23

Google is officially a $2 trillion company 1 day, 11 hours ago | www.theverge.com

android big generative generative ai +10

Google Pixel 8A leak reveals seven years of security updates 1 day, 15 hours ago | www.theverge.com

android budget can expect +12

CISA ransomware warning program will launch this year 2 days, 12 hours ago | www.theverge.com

agency arm attacks cisa +19

Spyware dealers could face visa restrictions 2 days, 13 hours ago | www.theverge.com

alex department development monday +10

Google is updating Android TVs to fix a big Gmail privacy problem 2 days, 14 hours ago | www.theverge.com

access accounts alex android +17

Microsoft needs to win back trust 2 days, 19 hours ago | www.theverge.com

back board culture cyber +20

Drake threatened with lawsuit over diss track featuring AI Tupac 3 days, 11 hours ago | www.theverge.com

action billboard canadian internet +11

SOC 2 Manager, Audit and Certification

@ Deloitte | US and CA Multiple Locations

View on infosec-jobs.com

Associate Compliance Advisor

@ SAP | Budapest, HU, 1031

View on infosec-jobs.com

DevSecOps Engineer

@ Qube Research & Technologies | London

View on infosec-jobs.com

Software Engineer, Security

@ Render | San Francisco, CA or Remote (USA & Canada)

View on infosec-jobs.com

Associate Consultant

@ Control Risks | Frankfurt, Hessen, Germany

View on infosec-jobs.com

Senior Security Engineer

@ Activision Blizzard | Work from Home - CA

View on infosec-jobs.com