Embodied Intelligence and Multi-Agent Collaboration

The field of artificial intelligence is witnessing a significant shift towards embodied intelligence, where agents interact with the physical world through integrated perception, cognition, action, and advanced reasoning powered by large language models (LLMs). A key area of focus is the development of multi-agent collaboration frameworks that enable LLMs to adaptively collaborate and perform complex tasks. Researchers are working to address the challenges of efficient natural language communication, task execution, and response generation in multi-agent systems.

Noteworthy papers in this area include:

  • Collaborating Action by Action: A Multi-agent LLM Framework for Embodied Reasoning, which introduces a platform and benchmark for evaluating embodied and collaborative reasoning in LLM agents.
  • Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant, which presents a comprehensive end-to-end evaluation pipeline for LLM-based multi-agent frameworks.
  • RepliBench: Evaluating the autonomous replication capabilities of language model agents, which provides a suite of evaluations to measure autonomous replication capabilities and highlights the potential safety risks of uncontrollable autonomous replication.
  • MATCHA: Can Multi-Agent Collaboration Build a Trustworthy Conversational Recommender?, which proposes a multi-agent collaboration framework for conversational recommendation systems and achieves superior or comparable performance to state-of-the-art models.
  • MuRAL: A Multi-Resident Ambient Sensor Dataset Annotated with Natural Language for Activities of Daily Living, which introduces a dataset annotated with fine-grained natural language descriptions for human activity recognition using ambient sensors.

Sources

Collaborating Action by Action: A Multi-agent LLM Framework for Embodied Reasoning

Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant

RepliBench: Evaluating the autonomous replication capabilities of language model agents

Generative AI in Embodied Systems: System-Level Analysis of Performance, Efficiency and Scalability

From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review

MATCHA: Can Multi-Agent Collaboration Build a Trustworthy Conversational Recommender?

Search-Based Interaction For Conversation Recommendation via Generative Reward Model Based Simulated User

MuRAL: A Multi-Resident Ambient Sensor Dataset Annotated with Natural Language for Activities of Daily Living

Built with on top of