强化学习
计算机科学
初始化
任务(项目管理)
学习迁移
Dijkstra算法
软件
航程(航空)
人工智能
人机交互
机器学习
工程类
最短路径问题
系统工程
图形
理论计算机科学
程序设计语言
航空航天工程
作者
Agostinho Rosa,Mariana Cabral Falqueiro,Rodrigo Bonacin,Fábio Lúcio Lópes de Mendonça,Geraldo P. Rocha Filho,Vinícius P. Gonçalves
出处
期刊:Sensors
[MDPI AG]
日期:2023-11-01
卷期号:23 (21): 8892-8892
摘要
There is only a very short reaction time for people to find the best way out of a building in a fire outbreak. Software applications can be used to assist the rapid evacuation of people from the building; however, this is an arduous task, which requires an understanding of advanced technologies. Since well-known pathway algorithms (such as, Dijkstra, Bellman-Ford, and A*) can lead to serious performance problems, when it comes to multi-objective problems, we decided to make use of deep reinforcement learning techniques. A wide range of strategies including a random initialization of replay buffer and transfer learning were assessed in three projects involving schools of different sizes. The results showed the proposal was viable and that in most cases the performance of transfer learning was superior, enabling the learning agent to be trained in times shorter than 1 min, with 100% accuracy in the routes. In addition, the study raised challenges that had to be faced in the future.
科研通智能强力驱动
Strongly Powered by AbleSci AI