Combinatorial Multi-armed Bandits for Real-Time Strategy Games
[摘要] Games with large branching factors pose a significant challenge for game tree search algorithms. In this paper, we address this problem with asampling strategy for Monte Carlo Tree Search (MCTS) algorithms called "naive sampling", based on a variant of the Multi-armed Bandit problem called "Combinatorial Multi-armed Bandits" (CMAB). We analyze the theoretical properties of several variants of naive sampling, and empirically compare it against the other existing strategies in the literature for CMABs. We then evaluate these strategies in the context of real-time strategy (RTS) games, a genre of computer games characterized by their very large branching factors. Our results show that as the branching factor grows, naive sampling outperforms the other sampling strategies.
[发布日期] [发布机构]
[效力级别] [学科分类] 人工智能
[关键词] [时效性]