Do AI Trust and Safety Measures Deserve to Fail? | allinfosecnews.com

Dec. 12, 2023, 3:12 p.m. | Stewart Baker

The Cyberlaw Podcast www.steptoe.com

It’s the last and probably longest Cyberlaw Podcast episode of 2023. To lead off, Megan Stifel takes us through a batch of stories about ways that AI, and especially AI trust and safety, manage to look remarkably fallible. Anthropic released a paper showing that race, gender, and age discrimination by AI models was real but could be dramatically reduced by instructing The Model to “really, really, really” avoid such discrimination. (Buried in the paper was the fact that the original, …

age anthropic batch cyberlaw discrimination fail gender manage megan podcast podcast episode race safety stories trust trust and safety

More from www.steptoe.com / The Cyberlaw Podcast

World on the Brink with Dmitri Alperovitch 1 week, 1 day ago | www.steptoe.com

america asking book brink +12

Who’s the Bigger Cybersecurity Risk – Microsoft or Open Source? 2 weeks, 5 days ago | www.steptoe.com

book commentary cyberlaw cybersecurity +6

Taking AI Existential Risk Seriously 4 weeks ago | www.steptoe.com

commentary cyberlaw podcast risk +1

The Fourth Antitrust Shoe Drops, on Apple This Time 1 month ago | www.steptoe.com

administration amazon antitrust apple +21

Social Speech and the Supreme Court 1 month, 1 week ago | www.steptoe.com

cases court first amendment justice +12

Preventing Sales of Personal Data to Adversary Nations 1 month, 2 weeks ago | www.steptoe.com

adtech adversary bonus cyberlaw +16

The National Cybersecurity Strategy – How Does it Look After a Year? 1 month, 2 weeks ago | www.steptoe.com

attack bigtech change change healthcare +26

Episode 495: The National Cybersecurity Strategy – How Does it Look After a Year? 1 month, 2 weeks ago | www.steptoe.com

attack change change healthcare companies +17

Regulating personal data for national security 1 month, 3 weeks ago | www.steptoe.com

adam adam hickey allies attention +21

SOC 2 Manager, Audit and Certification

@ Deloitte | US and CA Multiple Locations

View on infosec-jobs.com

Senior Security Architect - Northwest region (Remote)

@ GuidePoint Security LLC | Remote

View on infosec-jobs.com

Senior Consultant, Cyber Security Architecture

@ 6point6 | Manchester, United Kingdom

View on infosec-jobs.com

Junior Security Architect

@ IQ-EQ | Port Louis, Mauritius

View on infosec-jobs.com

Senior Detection & Response Engineer

@ Expel | Remote

View on infosec-jobs.com

Cyber Security Systems Engineer ISSE Splunk

@ SAP | Southbank (Melbourne), VIC, AU, 3006

View on infosec-jobs.com