TY - JOUR
T1 - Secure Control in Partially Observable Environments to Satisfy LTL Specifications
AU - Ramasubramanian, Bhaskar
AU - Niu, Luyao
AU - Clark, Andrew
AU - Bushnell, Linda
AU - Poovendran, Radha
N1 - Publisher Copyright:
© 1963-2012 IEEE.
PY - 2021/12/1
Y1 - 2021/12/1
N2 - This article studies the synthesis of control policies for an agent that has to satisfy a temporal logic specification in a partially observable environment, in the presence of an adversary. The interaction of the agent (defender) with the adversary is modeled as a partially observable stochastic game. The goal is to generate a defender policy to maximize satisfaction of a given temporal logic specification under any adversary policy. The search for policies is limited to the space of finite-state controllers, which leads to a tractable approach to determine policies. We relate the satisfaction of the specification to reaching (a subset of) recurrent states of a Markov chain. We present an algorithm to determine a set of defender and adversary finite-state controllers of fixed sizes that will satisfy the temporal logic specification and prove that it is sound. We then propose a value-iteration algorithm to maximize the probability of satisfying the temporal logic specification under finite-state controllers of fixed sizes. Finally, we extend this setting to the scenario where the size of the finite-state controller of the defender can be increased to improve the satisfaction probability. We illustrate our approach with an example.
AB - This article studies the synthesis of control policies for an agent that has to satisfy a temporal logic specification in a partially observable environment, in the presence of an adversary. The interaction of the agent (defender) with the adversary is modeled as a partially observable stochastic game. The goal is to generate a defender policy to maximize satisfaction of a given temporal logic specification under any adversary policy. The search for policies is limited to the space of finite-state controllers, which leads to a tractable approach to determine policies. We relate the satisfaction of the specification to reaching (a subset of) recurrent states of a Markov chain. We present an algorithm to determine a set of defender and adversary finite-state controllers of fixed sizes that will satisfy the temporal logic specification and prove that it is sound. We then propose a value-iteration algorithm to maximize the probability of satisfying the temporal logic specification under finite-state controllers of fixed sizes. Finally, we extend this setting to the scenario where the size of the finite-state controller of the defender can be increased to improve the satisfaction probability. We illustrate our approach with an example.
KW - Finite-state controllers (FSCs)
KW - Global Markov chain (GMC)
KW - Linear temporal logic (LTL)
KW - Partially observable stochastic games (POSGs)
KW - Policy iteration
KW - Stackelberg equilibrium
KW - Value iteration
UR - https://www.scopus.com/pages/publications/85096833646
U2 - 10.1109/TAC.2020.3039484
DO - 10.1109/TAC.2020.3039484
M3 - Article
AN - SCOPUS:85096833646
SN - 0018-9286
VL - 66
SP - 5665
EP - 5679
JO - IEEE Transactions on Automatic Control
JF - IEEE Transactions on Automatic Control
IS - 12
ER -