TY - GEN
T1 - Scalable initial state interdiction for factored MDPs
AU - Panda, Swetasudha
AU - Vorobeychik, Yevgeniy
N1 - Publisher Copyright:
© 2018 International Joint Conferences on Artificial Intelligence.All right reserved.
PY - 2018
Y1 - 2018
N2 - We propose a novel Stackelberg game model of MDP interdiction in which the defender modifies the initial state of the planner, who then responds by computing an optimal policy starting with that state. We first develop a novel approach for MDP interdiction in factored state space that allows the defender to modify the initial state. The resulting approach can be computationally expensive for large factored MDPs. To address this, we develop several interdiction algorithms that leverage variations of reinforcement learning using both linear and non-linear function approximation. Finally, we extend the interdiction framework to consider a Bayesian interdiction problem in which the inter-dictor is uncertain about some of the planner's initial state features. Extensive experiments demonstrate the effectiveness of our approaches.
AB - We propose a novel Stackelberg game model of MDP interdiction in which the defender modifies the initial state of the planner, who then responds by computing an optimal policy starting with that state. We first develop a novel approach for MDP interdiction in factored state space that allows the defender to modify the initial state. The resulting approach can be computationally expensive for large factored MDPs. To address this, we develop several interdiction algorithms that leverage variations of reinforcement learning using both linear and non-linear function approximation. Finally, we extend the interdiction framework to consider a Bayesian interdiction problem in which the inter-dictor is uncertain about some of the planner's initial state features. Extensive experiments demonstrate the effectiveness of our approaches.
UR - https://www.scopus.com/pages/publications/85055689633
U2 - 10.24963/ijcai.2018/667
DO - 10.24963/ijcai.2018/667
M3 - Conference contribution
AN - SCOPUS:85055689633
T3 - IJCAI International Joint Conference on Artificial Intelligence
SP - 4801
EP - 4807
BT - Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJCAI 2018
A2 - Lang, Jerome
PB - International Joint Conferences on Artificial Intelligence
T2 - 27th International Joint Conference on Artificial Intelligence, IJCAI 2018
Y2 - 13 July 2018 through 19 July 2018
ER -