Neural proximal/trust region policy optimization attains globally optimal policy

Boyi Liu*, Qi Cai, Zhuoran Yang, Zhaoran Wang

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

33 Scopus citations

Fingerprint

Dive into the research topics of 'Neural proximal/trust region policy optimization attains globally optimal policy'. Together they form a unique fingerprint.

INIS

Computer Science