Provably efficient neural GTD algorithm for off-policy learning

Hoi To Wai, Zhuoran Yang, Zhaoran Wang, Mingyi Hong

Research output: Contribution to journalConference articlepeer-review

4 Scopus citations

Fingerprint

Dive into the research topics of 'Provably efficient neural GTD algorithm for off-policy learning'. Together they form a unique fingerprint.

Keyphrases

Mathematics