GRAPH-ASSISTED PREDICTIVE STATE REPRESENTATIONS FOR MULTI-AGENT PARTIALLY OBSERVABLE SYSTEMS

Zhi Zhang, Zhuoran Yang, Han Liu, Pratap Tokekar, Furong Huang

Research output: Contribution to conferencePaperpeer-review

Abstract

We study reinforcement learning for partially observable multi-agent systems where each agent only has access to its own observation and reward and aims to maximize its cumulative rewards. To handle partial observations, we propose graph-assisted predictive state representations (GAPSR), a scalable multi-agent representation learning framework that leverages the agent connectivity graphs to aggregate local representations computed by each agent. In addition, our representations are readily able to incorporate dynamic interaction graphs and kernel space embeddings of the predictive states, and thus have strong flexibility and representation power. Based on GAPSR, we propose an end-to-end MARL algorithm that simultaneously infers the predictive representations and uses the representations as the input of a policy optimization algorithm. Empirically, we demonstrate the efficacy of the proposed algorithm provided on both a MAMuJoCo robotic learning experiment and a multi-agent particle learning environment.

Original languageEnglish (US)
StatePublished - 2022
Event10th International Conference on Learning Representations, ICLR 2022 - Virtual, Online
Duration: Apr 25 2022Apr 29 2022

Conference

Conference10th International Conference on Learning Representations, ICLR 2022
CityVirtual, Online
Period4/25/224/29/22

ASJC Scopus subject areas

  • Language and Linguistics
  • Computer Science Applications
  • Education
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'GRAPH-ASSISTED PREDICTIVE STATE REPRESENTATIONS FOR MULTI-AGENT PARTIALLY OBSERVABLE SYSTEMS'. Together they form a unique fingerprint.

Cite this