TY - GEN
T1 - Correlation-Based Algorithm for Team-Maxmin Equilibrium in Multiplayer Extensive-Form Games
AU - Zhang, Youzhi
AU - An, Bo
AU - Subrahmanian, V. S.
N1 - Funding Information:
We gratefully acknowledge support from ONR grant N00014-20-1-2407. Bo An is supported by the National Re-
Funding Information:
search Foundation, Singapore under its AI Singapore Programme (AISG Award No: AISG-RP-2019-0013).
Funding Information:
We gratefully acknowledge support from ONR grant N00014-20-1-2407. Bo An is supported by the National Research Foundation, Singapore under its AI Singapore Pro-gramme (AISG Award No: AISG-RP-2019-0013).
Publisher Copyright:
© 2022 International Joint Conferences on Artificial Intelligence. All rights reserved.
PY - 2022
Y1 - 2022
N2 - Efficient algorithms computing a Nash equilibrium have been successfully applied to large zero-sum two-player extensive-form games (e.g., poker). However, in multiplayer games, computing a Nash equilibrium is generally hard, and the equilibria are not exchangeable, which makes players face the problem of selecting one of many different Nash equilibria. In this paper, we focus on an alternative solution concept in zero-sum multiplayer extensive-form games called Team-Maxmin Equilibrium (TME). It is a Nash equilibrium that maximizes each team member's utility. As TME is unique in general, it avoids the equilibrium selection problem. However, it is still difficult (FNP-hard) to find a TME. Computing it can be formulated as a non-convex program, but existing algorithms are capable of solving this program for only very small games. In this paper, we first refine the complexity result for computing a TME by using a correlation plan to show that a TME can be found in polynomial time in a specific class of games according to our boundary for complexity. Second, we propose an efficient correlation-based algorithm to solve the non-convex program for TME in games not belonging to this class. The algorithm combines two special correlation plans based on McCormick envelopes for convex relaxation and von Stengel-Forges polytope for correlated equilibria. We show that restricting the feasible solution space to von Stengel-Forges polytope will strictly reduce the feasible solution space after convex relaxation of nonlinear terms. Finally, experiments show that our algorithm is about four orders of magnitude faster than the prior state of the art and can solve many previously unsolvable games.
AB - Efficient algorithms computing a Nash equilibrium have been successfully applied to large zero-sum two-player extensive-form games (e.g., poker). However, in multiplayer games, computing a Nash equilibrium is generally hard, and the equilibria are not exchangeable, which makes players face the problem of selecting one of many different Nash equilibria. In this paper, we focus on an alternative solution concept in zero-sum multiplayer extensive-form games called Team-Maxmin Equilibrium (TME). It is a Nash equilibrium that maximizes each team member's utility. As TME is unique in general, it avoids the equilibrium selection problem. However, it is still difficult (FNP-hard) to find a TME. Computing it can be formulated as a non-convex program, but existing algorithms are capable of solving this program for only very small games. In this paper, we first refine the complexity result for computing a TME by using a correlation plan to show that a TME can be found in polynomial time in a specific class of games according to our boundary for complexity. Second, we propose an efficient correlation-based algorithm to solve the non-convex program for TME in games not belonging to this class. The algorithm combines two special correlation plans based on McCormick envelopes for convex relaxation and von Stengel-Forges polytope for correlated equilibria. We show that restricting the feasible solution space to von Stengel-Forges polytope will strictly reduce the feasible solution space after convex relaxation of nonlinear terms. Finally, experiments show that our algorithm is about four orders of magnitude faster than the prior state of the art and can solve many previously unsolvable games.
UR - http://www.scopus.com/inward/record.url?scp=85137921633&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85137921633&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85137921633
T3 - IJCAI International Joint Conference on Artificial Intelligence
SP - 606
EP - 612
BT - Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022
A2 - De Raedt, Luc
A2 - De Raedt, Luc
PB - International Joint Conferences on Artificial Intelligence
T2 - 31st International Joint Conference on Artificial Intelligence, IJCAI 2022
Y2 - 23 July 2022 through 29 July 2022
ER -