机器人学
人工智能
计算机科学
机器人
钥匙(锁)
感知
模式
人机交互
心理学
计算机安全
社会学
社会科学
神经科学
作者
Yeseung Kim,Dohyun Kim,Ji‐Eun Choi,Jisang Park,Nahmsuk Oh,Daehyung Park
标识
DOI:10.1007/s11370-024-00550-5
摘要
Abstract In recent years, the integration of large language models (LLMs) has revolutionized the field of robotics, enabling robots to communicate, understand, and reason with human-like proficiency. This paper explores the multifaceted impact of LLMs on robotics, addressing key challenges and opportunities for leveraging these models across various domains. By categorizing and analyzing LLM applications within core robotics elements—communication, perception, planning, and control—we aim to provide actionable insights for researchers seeking to integrate LLMs into their robotic systems. Our investigation focuses on LLMs developed post-GPT-3.5, primarily in text-based modalities while also considering multimodal approaches for perception and control. We offer comprehensive guidelines and examples for prompt engineering, facilitating beginners’ access to LLM-based robotics solutions. Through tutorial-level examples and structured prompt construction, we illustrate how LLM-guided enhancements can be seamlessly integrated into robotics applications. This survey serves as a roadmap for researchers navigating the evolving landscape of LLM-driven robotics, offering a comprehensive overview and practical guidance for harnessing the power of language models in robotics development.
科研通智能强力驱动
Strongly Powered by AbleSci AI