Equi-Reward Utility Maximizing Design in Stochastic Environments

Sarah Keren; Luis Pineda; Avigdor Gal; Erez Karpas; Shlomo Zilberstein

Equi-Reward Utility Maximizing Design in Stochastic Environments

Sarah Keren, Luis Pineda, Avigdor Gal, Erez Karpas, Shlomo Zilberstein

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence

Main track. Pages 4353-4360. https://doi.org/10.24963/ijcai.2017/608

PDF BibTeX

We present the Equi Reward Utility Maximizing Design (ER-UMD) problem for redesigning stochastic environments to maximize agent performance. ER-UMD fits well contemporary applications that require offline design of environments where robots and humans act and cooperate. To find an optimal modification sequence we present two novel solution techniques: a compilation that embeds design into a planning problem, allowing use of off-the-shelf solvers to find a solution, and a heuristic search in the modifications space, for which we present an admissible heuristic. Evaluation shows the feasibility of the approach using standard benchmarks from the probabilistic planning competition and a benchmark we created for a vacuum cleaning robot setting.

Keywords:

Planning and Scheduling: Markov Decisions Processes

Planning and Scheduling: Planning under Uncertainty

Combinatorial & Heuristic Search: Heuristic Search