Selected Publications
Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
Yixuan Wang, Simon Sinong Zhan, Ruochen Jiao, Zhilu Wang, Wanxin Jin, Zhuoran Yang, Zhaoran Wang, Chao Huang, Qi Zhu
ICML, Jul 2023.
Joint Differentiable Optimization and Verification for Certified Reinforcement Learning
Yixuan Wang*, Simon Sinong Zhan*, Zhilu Wang, Chao Huang, Zhaoran Wang, Zhuoran Yang, Qi Zhu
ICCPS, May 2023.
Variational Delayed Policy Optimization
Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Chao Huang
NeurIPS Spotlight, Nov, 2024.
State-wise Safe Reinforcement Learning with Pixel Observations
Simon Sinong Zhan, Yixuan Wang, Qingyuan Wu, Ruochen Jiao, Chao Huang, Qi Zhu
L4DC, Jul 2024.
Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays
Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Jürgen Schmidhuber, Chao Huang
ICML, July 2024.
Energy-Efficient Control Adaptation with Safety Guarantees for Learning-Enabled Cyber-Physical Systems
Yixuan Wang, Chao Huang, Qi Zhu
ICCAD (Best Paper Candidate), Nov 2020.
Design-while-Verify: Correct-by-Construction Control Learning with Verification in the Loop
Yixuan Wang, Chao Huang, Zhilu Wang, Zhaoran Wang, Qi Zhu
DAC, Jul 2022.
Empowering Autonomous Driving with Large Language Models: A Safety Perspective
Yixuan Wang, Ruochen Jiao, Chengtian Lang, Sinong Simon Zhan, Chao Huang, Zhaoran Wang, Zhuoran Yang, Qi Zhu
LLM Agents Workshop @ ICLR2024, May 2024.
One for Many: Transfer Learning for Building HVAC Control
Shichao Xu, Yixuan Wang, Yanzhi Wang, Zheng O'Neill, Qi Zhu
BuildSys, Nov 2020.
REGLO: Provable Neural Network Repair for Global Robustness Properties
Feisi Fu, Zhilu Wang, Weichao Zhou, Yixuan Wang, Jiameng Fan, Chao Huang, Qi Zhu, Xin Chen, Wenchao Li
AAAI, Feb 2024.
POLAR-Express: Efficient and Precise Formal Reachability Analysis of Neural-Network Controlled Systems
Yixuan Wang*, Weichao Zhou*, Jiameng Fan, Zhilu Wang, Jiajun Li, Xin Chen, Chao Huang, Wenchao Li, Qi Zhu
IEEE TCAD, Oct 2023.
Weak Adaptation Learning – Addressing Cross-domain Data Insufficiency with Weak Annotator
Shichao Xu, Lixu Wang, Yixuan Wang, Qi Zhu
ICCV, 2021.