Recursive Small-Step Multi-Agent A* for Dec-POMDPs

Wietze Koops; Nils Jansen; Sebastian Junges; Thiago D. Simão

doi:10.24963/ijcai.2023/600

Recursive Small-Step Multi-Agent A* for Dec-POMDPs

Wietze Koops, Nils Jansen, Sebastian Junges, Thiago D. Simão

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence

Main Track. Pages 5402-5410. https://doi.org/10.24963/ijcai.2023/600

PDF BibTeX

We present recursive small-step multi-agent A* (RS-MAA*), an exact algorithm that optimizes the expected reward in decentralized partially observable Markov decision processes (Dec-POMDPs). RS-MAA* builds on multi-agent A* (MAA*), an algorithm that finds policies by exploring a search tree, but tackles two major scalability concerns. First, we employ a modified, small-step variant of the search tree that avoids the double exponential outdegree of the classical formulation. Second, we use a tight and recursive heuristic that we compute on-the-fly, thereby avoiding an expensive precomputation. The resulting algorithm is conceptually simple, yet it shows superior performance on a rich set of standard benchmarks.

Keywords:

Planning and Scheduling: PS: Distributed and multi-agent planning

Agent-based and Multi-agent Systems: MAS: Multi-agent planning

Planning and Scheduling: PS: Planning under uncertainty