Dynamic regret of policy optimization in non-stationary environments

Yingjie Fei, Zhuoran Yang, Zhaoran Wang, Qiaomin Xie

Research output: Contribution to journalConference articlepeer-review

33 Scopus citations

Fingerprint

Dive into the research topics of 'Dynamic regret of policy optimization in non-stationary environments'. Together they form a unique fingerprint.

Keyphrases

Computer Science