Publications

Temporal Induced Self-Play for Stochastic Bayesian Games

Published in IJCAI2021, 2021

We proposed a temporal induced based self-play algorithm for stochastic Bayesian games. We showed that with our algorithm, we can efficiently converge close to sequential perfect bayesian equilibrium, which make the learned policy robust.

Download here

Bi-level Actor-Critic for Multi-agent Coordination

Published in AAAI2020, 2020

We proposed a novel bi-level actor-critic learning method for multi-agent reinforcement learning that allows agents to have different knowledge base, while their actions still can be executed simultaneously and distributedly, and result in Stackelberg equilibrium as the solution.

Download here