TY - JOUR
T1 - Safeguarded Progress in Reinforcement Learning
T2 - 38th AAAI Conference on Artificial Intelligence, AAAI 2024
AU - Mitta, Rohan
AU - Hasanbeig, Hosein
AU - Wang, Jun
AU - Kroening, Daniel
AU - Kantaros, Yiannis
AU - Abate, Alessandro
N1 - Publisher Copyright:
Copyright © 2024, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
PY - 2024/3/25
Y1 - 2024/3/25
N2 - This paper addresses the problem of maintaining safety during training in Reinforcement Learning (RL), such that the safety constraint violations are bounded at any point during learning. As enforcing safety during training might severely limit the agent's exploration, we propose here a new architecture that handles the trade-off between efficient progress and safety during exploration. As the exploration progresses, we update via Bayesian inference Dirichlet-Categorical models of the transition probabilities of the Markov decision process that describes the environment dynamics. We then propose a way to approximate moments of belief about the risk associated to the action selection policy. We demonstrate that this approach can be easily interleaved with RL and we present experimental results to showcase the performance of the overall architecture.
AB - This paper addresses the problem of maintaining safety during training in Reinforcement Learning (RL), such that the safety constraint violations are bounded at any point during learning. As enforcing safety during training might severely limit the agent's exploration, we propose here a new architecture that handles the trade-off between efficient progress and safety during exploration. As the exploration progresses, we update via Bayesian inference Dirichlet-Categorical models of the transition probabilities of the Markov decision process that describes the environment dynamics. We then propose a way to approximate moments of belief about the risk associated to the action selection policy. We demonstrate that this approach can be easily interleaved with RL and we present experimental results to showcase the performance of the overall architecture.
UR - http://www.scopus.com/inward/record.url?scp=85189629484&partnerID=8YFLogxK
U2 - 10.1609/aaai.v38i19.30137
DO - 10.1609/aaai.v38i19.30137
M3 - Conference article
AN - SCOPUS:85189629484
SN - 2159-5399
VL - 38
SP - 21412
EP - 21419
JO - Proceedings of the AAAI Conference on Artificial Intelligence
JF - Proceedings of the AAAI Conference on Artificial Intelligence
IS - 19
Y2 - 20 February 2024 through 27 February 2024
ER -