Best Heuristic Identification for Constraint Satisfaction

Frederic Koriche; Christophe Lecoutre; Anastasia Paparrizou; Hugues Wattez

doi:10.24963/ijcai.2022/258

Best Heuristic Identification for Constraint Satisfaction

Frederic Koriche, Christophe Lecoutre, Anastasia Paparrizou, Hugues Wattez

Watch video

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence

Main Track. Pages 1859-1865. https://doi.org/10.24963/ijcai.2022/258

PDF BibTeX

In constraint satisfaction problems, the variable ordering heuristic takes a central place by selecting the variables to branch on during backtrack search. As many hand-crafted branching heuristics have been proposed in the literature, a key issue is to identify, from a pool of candidate heuristics, which one is the best for solving a given constraint satisfaction task. Based on the observation that modern constraint solvers are using restart sequences, the best heuristic identification problem can be cast in the context of multi-armed bandits as a non-stochastic best arm identification problem. Namely, during each run of some given restart sequence, the bandit algorithm selects a branching heuristic and receives a reward for this heuristic before proceeding to the next run. The goal is to identify the best heuristic using few runs, and without any stochastic assumption about the constraint solver. In this study, we propose an adaptive variant of Successive Halving that exploits Luby's universal restart sequence. We analyze the convergence of this bandit algorithm in the non-stochastic setting, and we demonstrate its empirical effectiveness on various constraint satisfaction benchmarks.

Keywords:

Constraint Satisfaction and Optimization: Constraints and Machine Learning

Constraint Satisfaction and Optimization: Constraint Satisfaction

Machine Learning: Reinforcement Learning