Search-based Reinforcement Learning through Bandit Linear Optimization

Milan Peelman, Antoon Bronselaer, Guy De Tré

Video #1 Length : 00:01:32
Video #2 Length : 00:06:00