部分可观测马尔可夫决策过程
抓住
计算机科学
动作(物理)
操纵杆
人机交互
马尔可夫决策过程
人工智能
自治
机器人
过程(计算)
马尔可夫过程
偏爱
对象(语法)
马尔可夫链
机器学习
模拟
马尔可夫模型
数学
统计
操作系统
物理
量子力学
程序设计语言
法学
政治学
作者
J.L. Jr. Yow,Neha Priyadarshini Garg,Wei Tech Ang
出处
期刊:IEEE Transactions on Robotics
[Institute of Electrical and Electronics Engineers]
日期:2023-11-20
卷期号:40: 332-350
被引量:6
标识
DOI:10.1109/tro.2023.3334631
摘要
In shared autonomy (SA), accurate user intent prediction is crucial for good robot assistance and avoiding user–robot conflicts. Prior works have relied on passive observation of joystick inputs to predict user intent, which works when the goals are clearly separated or when a common policy exists for multiple goals. However, they may not work well when grasping objects to perform daily activities, as there are multiple ways to grasp the same object. We demonstrate the need for active information-gathering in such cases and show how this can be done in a principled manner by formulating SA as a discrete action partially observable Markov decision process (POMDP), reasoning over high-level actions. One of our insights is that apart from having explicit information-gathering actions and goal-oriented actions, it is important to have actions that move toward a distribution of goals and provide no assistance in the POMDP action space. Compared with a method with no active information-gathering, our method performs tasks faster, requires less user input, and decreases opposing actions, especially for more complex objects, getting higher ratings and preference in our user study.
科研通智能强力驱动
Strongly Powered by AbleSci AI