Picture for Yao Mu

Yao Mu

$\textbf{EMOS}$: $\textbf{E}$mbodiment-aware Heterogeneous $\textbf{M}$ulti-robot $\textbf{O}$perating $\textbf{S}$ystem with LLM Agents

Add code
Oct 30, 2024
Viaarxiv icon

VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions

Add code
Oct 29, 2024
Viaarxiv icon

Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking

Add code
Sep 24, 2024
Viaarxiv icon

RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)

Add code
Sep 04, 2024
Viaarxiv icon

Cog-GA: A Large Language Models-based Generative Agent for Vision-Language Navigation in Continuous Environments

Add code
Sep 04, 2024
Viaarxiv icon

HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model

Add code
Aug 18, 2024
Viaarxiv icon

CAMON: Cooperative Agents for Multi-Object Navigation with LLM-based Conversations

Add code
Jun 30, 2024
Viaarxiv icon

DAG-Plan: Generating Directed Acyclic Dependency Graphs for Dual-Arm Cooperative Planning

Add code
Jun 14, 2024
Viaarxiv icon

Learning Reward for Robot Skills Using Large Language Models via Self-Alignment

Add code
May 16, 2024
Viaarxiv icon

ManiPose: A Comprehensive Benchmark for Pose-aware Object Manipulation in Robotics

Add code
Mar 20, 2024
Viaarxiv icon