TY - JOUR
T1 - Data-driven control of nonlinear systems
T2 - An online sequential approach
AU - Vu, Minh
AU - Huang, Yunshen
AU - Zeng, Shen
N1 - Publisher Copyright:
© 2024 The Authors
PY - 2024/11
Y1 - 2024/11
N2 - While data-driven control has shown its potential for solving complex tasks, current algorithms such as reinforcement learning are still data-intensive and often limited to simulated environments. Model-based learning is a promising approach to reducing the amount of data required in practical implementations, yet it suffers from a critical issue known as model exploitation. In this paper, we present a sequential approach to model-based learning that avoids model exploitation and achieves stable system behaviors during learning with minimal exploration. The advocated control design utilizes estimates of the system's local dynamics to step-by-step improve the control. During the process, when additional data is required, the program pauses the control synthesis to collect data in the surrounding area and updates the model accordingly. The local and sequential nature of this approach is the key component to regulating the system's exploration in the state–action space and, at the same time, avoiding the issue of model exploitation, which are the main challenges in model-based learning control. Through simulated examples and physical experiments, we demonstrate that the proposed approach can quickly learn a desirable control from scratch, with just a small number of trials.
AB - While data-driven control has shown its potential for solving complex tasks, current algorithms such as reinforcement learning are still data-intensive and often limited to simulated environments. Model-based learning is a promising approach to reducing the amount of data required in practical implementations, yet it suffers from a critical issue known as model exploitation. In this paper, we present a sequential approach to model-based learning that avoids model exploitation and achieves stable system behaviors during learning with minimal exploration. The advocated control design utilizes estimates of the system's local dynamics to step-by-step improve the control. During the process, when additional data is required, the program pauses the control synthesis to collect data in the surrounding area and updates the model accordingly. The local and sequential nature of this approach is the key component to regulating the system's exploration in the state–action space and, at the same time, avoiding the issue of model exploitation, which are the main challenges in model-based learning control. Through simulated examples and physical experiments, we demonstrate that the proposed approach can quickly learn a desirable control from scratch, with just a small number of trials.
KW - Iterative synthesis
KW - Model-based learning control
KW - Neural networks
KW - Nonlinear dynamics
KW - Physical experiments
UR - http://www.scopus.com/inward/record.url?scp=85206012241&partnerID=8YFLogxK
U2 - 10.1016/j.sysconle.2024.105932
DO - 10.1016/j.sysconle.2024.105932
M3 - Article
AN - SCOPUS:85206012241
SN - 0167-6911
VL - 193
JO - Systems and Control Letters
JF - Systems and Control Letters
M1 - 105932
ER -