蒙特卡罗方法
统计物理学
蒙特卡罗树搜索
计算机科学
树(集合论)
算法
物理
数学
组合数学
统计
作者
Benjamin Rivière,John Lathrop,Soon‐Jo Chung
出处
期刊:Science robotics
[American Association for the Advancement of Science (AAAS)]
日期:2024-12-04
卷期号:9 (97)
标识
DOI:10.1126/scirobotics.ado1010
摘要
The ability of a robot to plan complex behaviors with real-time computation, rather than adhering to predesigned or offline-learned routines, alleviates the need for specialized algorithms or training for each problem instance. Monte Carlo Tree Search is a powerful planning algorithm that strategically explores simulated future possibilities, but it requires a discrete problem representation that is irreconcilable with the continuous dynamics of the physical world. We present Spectral Expansion Tree Search (SETS), a real-time, tree-based planner that uses the spectrum of the locally linearized system to construct a low-complexity and approximately equivalent discrete representation of the continuous world. We prove SETS converges to a bound of the globally optimal solution for continuous, deterministic and differentiable Markov Decision Processes, a broad class of problems that includes underactuated nonlinear dynamics, non-convex reward functions, and unstructured environments. We experimentally validate SETS on drone, spacecraft, and ground vehicle robots and one numerical experiment, each of which is not directly solvable with existing methods. We successfully show SETS automatically discovers a diverse set of optimal behaviors and motion trajectories in real time.
科研通智能强力驱动
Strongly Powered by AbleSci AI