Mobile agent path planning under uncertain environment using reinforcement learning and probabilistic model checking

强化学习计算机科学概率逻辑概率CTL 可靠性（半导体）马尔可夫链增强学习马尔可夫决策过程运动规划路径（计算）人工智能数学优化机器学习马尔可夫过程算法的概率分析数学机器人物理统计功率（物理）程序设计语言量子力学

作者

Xia Wang,Jun Liu,Chris Nugent,Ian Cleland,Yang Xu

出处

期刊：Knowledge Based Systems [Elsevier BV]
日期：2023-03-01 卷期号：264: 110355-110355 被引量：6

标识

DOI：10.1016/j.knosys.2023.110355

摘要

The major challenge in mobile agent path planning, within an uncertain environment, is effectively determining an optimal control model to discover the target location as quickly as possible and evaluating the control system’s reliability. To address this challenge, we introduce a learning-verification integrated mobile agent path planning method to achieve both the effectiveness and the reliability. More specifically, we first propose a modified Q-learning algorithm (a popular reinforcement learning algorithm), called QEA−learning algorithm, to find the best Q-table in the environment. We then determine the location transition probability matrix, and establish a probability model using the assumption that the agent selects a location with a higher Q-value. Secondly, the learnt behaviour of the mobile agent based on QEA−learning algorithm, is formalized as a Discrete-time Markov Chain (DTMC) model. Thirdly, the required reliability requirements of the mobile agent control system are specified using Probabilistic Computation Tree Logic (PCTL). In addition, the DTMC model and the specified properties are taken as the input of the Probabilistic Model Checker PRISM for automatic verification. This is preformed to evaluate and verify the control system’s reliability. Finally, a case study of a mobile agent walking in a grids map is used to illustrate the proposed learning algorithm. Here we have a special focus on the modelling approach demonstrating how PRISM can be used to analyse and evaluate the reliability of the mobile agent control system learnt via the proposed algorithm. The results show that the path identified using the proposed integrated method yields the largest expected reward.

求助该文献

最长约 10秒，即可获得该文献文件

Mobile agent path planning under uncertain environment using reinforcement learning and probabilistic model checking

今日热心研友