TY - GEN
T1 - Cocktail
T2 - 58th ACM/IEEE Design Automation Conference, DAC 2021
AU - Wang, Yixuan
AU - Huang, Chao
AU - Wang, Zhilu
AU - Xu, Shichao
AU - Wang, Zhaoran
AU - Zhu, Qi
N1 - Funding Information:
We gratefully acknowledge the support from NSF grants 1834701, 1839511, 1724341, 2038853, 2048075, 2008827, 2015568, 1934931, and ONR grant N00014-19-1-2496, Simons Institute (Theory of Reinforcement Learning), Amazon, J.P. Morgan, and Two Sigma.
Publisher Copyright:
© 2021 IEEE.
PY - 2021/12/5
Y1 - 2021/12/5
N2 - Neural networks are being increasingly applied to control and decision making for learning-enabled cyber-physical systems (LE-CPSs). They have shown promising performance without requiring the development of complex physical models; however, their adoption is significantly hindered by the concerns on their safety, robustness, and efficiency. In this work, we propose COCKTAIL, a novel design framework that automatically learns a neural network based controller from multiple existing control methods (experts) that could be either model-based or neural network based. In particular, COCKTAIL first performs reinforcement learning to learn an optimal system-level adaptive mixing strategy that incorporates the underlying experts with dynamically-assigned weights, and then conducts a teacher-student distillation with probabilistic adversarial training and regularization to synthesize a student neural network controller with improved control robustness (measured by a safe control rate metric with respect to adversarial attacks or measurement noises), control energy efficiency, and verifiability (measured by the computation time for verification). Experiments on three non-linear systems demonstrate significant advantages of our approach on these properties over various baseline methods.
AB - Neural networks are being increasingly applied to control and decision making for learning-enabled cyber-physical systems (LE-CPSs). They have shown promising performance without requiring the development of complex physical models; however, their adoption is significantly hindered by the concerns on their safety, robustness, and efficiency. In this work, we propose COCKTAIL, a novel design framework that automatically learns a neural network based controller from multiple existing control methods (experts) that could be either model-based or neural network based. In particular, COCKTAIL first performs reinforcement learning to learn an optimal system-level adaptive mixing strategy that incorporates the underlying experts with dynamically-assigned weights, and then conducts a teacher-student distillation with probabilistic adversarial training and regularization to synthesize a student neural network controller with improved control robustness (measured by a safe control rate metric with respect to adversarial attacks or measurement noises), control energy efficiency, and verifiability (measured by the computation time for verification). Experiments on three non-linear systems demonstrate significant advantages of our approach on these properties over various baseline methods.
UR - http://www.scopus.com/inward/record.url?scp=85115848633&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85115848633&partnerID=8YFLogxK
U2 - 10.1109/DAC18074.2021.9586148
DO - 10.1109/DAC18074.2021.9586148
M3 - Conference contribution
AN - SCOPUS:85115848633
T3 - Proceedings - Design Automation Conference
SP - 397
EP - 402
BT - 2021 58th ACM/IEEE Design Automation Conference, DAC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 5 December 2021 through 9 December 2021
ER -