Reinforcement learning for routing in communication networks

[摘要] ENGLISH ABSTRACT:Routing policies for packet-switched communication networks must be able to adaptto changing traffic patterns and topologies. We study the feasibility of implementingan adaptive routing policy using the Q-Learning algorithm which learns sequences ofactions from delayed rewards. The Q-Routing algorithm adapts a network's routingpolicy based on local information alone and converges toward an optimal solution. Wedemonstrate that Q-Routing is a viable alternative to other adaptive routing methodssuch as Bellman-Ford. We also study variations of Q-Routing designed to better explorepossible routes and to take into consideration limited buffer size and optimize multipleobjectives.

[发布日期] [发布机构] Stellenbosch University

[效力级别] [学科分类]

[关键词] [时效性]

浏览次数：4

统一登录查看全文激活码登录查看全文