Incremental Learning of Humanoid Robot Behavior from Natural Interaction and Large Language Models

Bärmann, Leonard; Kartmann, Rainer; Peller-Konrad, Fabian; Niehues, Jan; Waibel, Alex; Asfour, Tamim

doi:10.3389/frobt.2024.1455375

Computer Science > Robotics

arXiv:2309.04316 (cs)

[Submitted on 8 Sep 2023 (v1), last revised 16 May 2024 (this version, v3)]

Title:Incremental Learning of Humanoid Robot Behavior from Natural Interaction and Large Language Models

Authors:Leonard Bärmann, Rainer Kartmann, Fabian Peller-Konrad, Jan Niehues, Alex Waibel, Tamim Asfour

View PDF

Abstract:Natural-language dialog is key for intuitive human-robot interaction. It can be used not only to express humans' intents, but also to communicate instructions for improvement if a robot does not understand a command correctly. Of great importance is to endow robots with the ability to learn from such interaction experience in an incremental way to allow them to improve their behaviors or avoid mistakes in the future. In this paper, we propose a system to achieve incremental learning of complex behavior from natural interaction, and demonstrate its implementation on a humanoid robot. Building on recent advances, we present a system that deploys Large Language Models (LLMs) for high-level orchestration of the robot's behavior, based on the idea of enabling the LLM to generate Python statements in an interactive console to invoke both robot perception and action. The interaction loop is closed by feeding back human instructions, environment observations, and execution results to the LLM, thus informing the generation of the next statement. Specifically, we introduce incremental prompt learning, which enables the system to interactively learn from its mistakes. For that purpose, the LLM can call another LLM responsible for code-level improvements of the current interaction based on human feedback. The improved interaction is then saved in the robot's memory, and thus retrieved on similar requests. We integrate the system in the robot cognitive architecture of the humanoid robot ARMAR-6 and evaluate our methods both quantitatively (in simulation) and qualitatively (in simulation and real-world) by demonstrating generalized incrementally-learned knowledge.

Comments:	This version (v3) adds further quantitative evaluation and many improvements. v2 was presented at the Workshop on Language and Robot Learning (LangRob) at the Conference on Robot Learning (CoRL) 2023. Supplementary video available at this https URL
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2309.04316 [cs.RO]
	(or arXiv:2309.04316v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2309.04316
Journal reference:	Frontiers in Robotics and AI, Volume 11 - 2024
Related DOI:	https://doi.org/10.3389/frobt.2024.1455375

Submission history

From: Leonard Bärmann [view email]
[v1] Fri, 8 Sep 2023 13:29:05 UTC (2,108 KB)
[v2] Thu, 2 Nov 2023 17:38:37 UTC (2,116 KB)
[v3] Thu, 16 May 2024 09:07:42 UTC (2,318 KB)

Computer Science > Robotics

Title:Incremental Learning of Humanoid Robot Behavior from Natural Interaction and Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Incremental Learning of Humanoid Robot Behavior from Natural Interaction and Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators