Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets

Han Zhong*, Wei Xiong, Jiyuan Tan, Liwei Wang, Tong Zhang, Zhaoran Wang*, Zhuoran Yang*

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

14 Scopus citations

Fingerprint

Dive into the research topics of 'Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets'. Together they form a unique fingerprint.

Keyphrases

Computer Science

Mathematics