Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients | allinfosecnews.com

April 17, 2024, 4:11 a.m. | Chris Cundy, Rishi Desai, Stefano Ermon

cs.CR updates on arXiv.org arxiv.org

arXiv:2012.15019v3 Announce Type: replace-cross
Abstract: As reinforcement learning techniques are increasingly applied to real-world decision problems, attention has turned to how these algorithms use potentially sensitive information. We consider the task of training a policy that maximizes reward while minimizing disclosure of certain sensitive state variables through the actions. We give examples of how this setting covers real-world problems in privacy for sequential decision-making. We solve this problem in the policy gradients framework by introducing a regularizer based on the …

actions algorithms arxiv attention cs.cr cs.lg decision disclosure examples information policies policy privacy problems real reward sensitive sensitive information state task techniques training world

More from arxiv.org / cs.CR updates on arXiv.org

Decentralised, Collaborative, and Privacy-preserving Machine Learning for Multi-Hospital Data 7 hours ago | arxiv.org

accuracy analysis arxiv cs.cr +17

Pruning for Protection: Increasing Jailbreak Resistance in Aligned LLMs Without Fine-Tuning 7 hours ago | arxiv.org

arxiv attacks can cs.ai +13

Relay Mining: Incentivizing Full Non-Validating Nodes Servicing All RPC Types 7 hours ago | arxiv.org

arxiv client crypto cryptographic +19

Laccolith: Hypervisor-Based Adversary Emulation with Anti-Detection 7 hours ago | arxiv.org

advanced advanced persistent threats adversary adversary emulation +18

Commercial Anti-Smishing Tools and Their Comparative Effectiveness Against Modern Threats 7 hours ago | arxiv.org

arxiv attacker attacks commercial +19

PriSampler: Mitigating Property Inference of Diffusion Models 7 hours ago | arxiv.org

arxiv attacks banking cs.cr +12

PrescientFuzz: A more effective exploration approach for grey-box fuzzing 7 hours ago | arxiv.org

arxiv box campaigns control +13

Evaluating and Mitigating Linguistic Discrimination in Large Language Models 7 hours ago | arxiv.org

arxiv can capabilities cs.ai +16

Towards Classical Software Verification using Quantum Computers 7 hours ago | arxiv.org

aid arxiv computer computers +15

SOC 2 Manager, Audit and Certification

@ Deloitte | US and CA Multiple Locations

View on infosec-jobs.com

Senior Security Engineer

@ Core10 | Nashville, Tennessee, United States - Remote

View on infosec-jobs.com

Security Operations Engineer I

@ Jamf | US Remote

View on infosec-jobs.com

IT Security ISSO Specialist (15.10)

@ OCT Consulting, LLC | Washington, District of Columbia, United States

View on infosec-jobs.com

Compliance Officer

@ Aspire Software | Canada - Remote

View on infosec-jobs.com

Security Operations Center (SOC) - AVP

@ Paytm | Noida, Uttar Pradesh

View on infosec-jobs.com