Monitoring plan execution in partially observable stochastic worlds

[摘要] This thesis presents two novel algorithms for monitoring plan execution in stochastic partially observable environments. The problems can beformulated as partially-observable Markov decision processes (POMDPs). Exact solutions of POMDP problems are difficult to find due to the computational complexity, so many approximate solutions are proposed instead. These POMDP solvers tend to generate an approximate policy at planning time and execute the policy without any change at run time. Our approaches will monitor the execution of the initial approximate policy and perform plan modification procedure to improve the policy’s quality at run time. This thesis considers twoapproximate POMDP solvers. One is a translation-based POMDP solver which converts a subclass of POMDP, called quasi-deterministic POMDP (QDET-POMDP) problems into classical planning problems or Markov decision processes (MDPs). The resulting approximate solution is either a contingency plan or an MDP policy that requires full observability of the world at run time. The other is a point-based POMDP solver which generates an approximate policy by utilizing sampling techniques. Study of the algorithms in simulation has shown that our execution monitoring approaches can improve the approximate POMDP solvers overall performance in terms of plan quality, plan generation time and plan execution time.

[发布日期] [发布机构] University:University of Birmingham;Department:School of Computer Science

[效力级别] [学科分类]

[关键词] Q Science;QA Mathematics;QA75 Electronic computers. Computer science [时效性]

浏览次数：19

统一登录查看全文激活码登录查看全文