Exploration is Harder than Prediction: Cryptographically Separating Reinforcement Learning from Supervised Learning | allinfosecnews.com

April 8, 2024, 4:11 a.m. | Noah Golowich, Ankur Moitra, Dhruv Rohatgi

cs.CR updates on arXiv.org arxiv.org

arXiv:2404.03774v1 Announce Type: cross
Abstract: Supervised learning is often computationally easy in practice. But to what extent does this mean that other modes of learning, such as reinforcement learning (RL), ought to be computationally easy by extension? In this work we show the first cryptographic separation between RL and supervised learning, by exhibiting a class of block MDPs and associated decoding functions where reward-free exploration is provably computationally harder than the associated regression problem. We also show that there is …

arxiv cryptographic cs.cc cs.cr cs.ds cs.lg easy exploration extension practice prediction work

More from arxiv.org / cs.CR updates on arXiv.org

Privacy Amplification for Matrix Mechanisms 5 hours ago | arxiv.org

algorithms amplification analysis art +16

Jailbreak and Guard Aligned Language Models with Only Few In-Context Demonstrations 5 hours ago | arxiv.org

alignment arxiv context cs.ai +11

FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models 5 hours ago | arxiv.org

adaptation advancement applications arxiv +15

Are aligned neural networks adversarially aligned? 5 hours ago | arxiv.org

adversarial align alignment arxiv +17

Data Depth and Core-based Trend Detection on Blockchain Transaction Networks 5 hours ago | arxiv.org

arxiv assets blockchain blockchains +14

A New Linear Scaling Rule for Private Adaptive Hyperparameter Optimization 5 hours ago | arxiv.org

account arxiv cost cs.ai +11

RandOhm: Mitigating Impedance Side-channel Attacks using Randomized Circuit Configurations 5 hours ago | arxiv.org

arxiv attacks can channel +15

An algorithm for forensic toolmark comparisons 5 hours ago | arxiv.org

address algorithm analysis arxiv +11

Scalable and Adaptively Secure Any-Trust Distributed Key Generation and All-hands Checkpointing 5 hours ago | arxiv.org

applications arxiv blockchain challenges +10

Offensive Security Engineering Technical Lead, Device Security

@ Google | Amsterdam, Netherlands

View on infosec-jobs.com

Senior Security Engineering Program Manager

@ Microsoft | Redmond, Washington, United States

View on infosec-jobs.com

Information System Security Analyst

@ Resource Management Concepts, Inc. | Dahlgren, Virginia, United States

View on infosec-jobs.com

Critical Facility Security Officer - Evening Shift

@ Allied Universal | Charlotte, NC, United States

View on infosec-jobs.com

Information System Security Officer, Junior

@ Resource Management Concepts, Inc. | Patuxent River, Maryland, United States

View on infosec-jobs.com

Security Engineer

@ JPMorgan Chase & Co. | Plano, TX, United States

View on infosec-jobs.com