Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kalakonda Sai Shashank

MoRAG -- Multi-Fusion Retrieval Augmented Generation for Human Motion

Sep 18, 2024

Kalakonda Sai Shashank, Shubh Maheshwari, Ravi Kiran Sarvadevabhatla

Figure 1 for MoRAG -- Multi-Fusion Retrieval Augmented Generation for Human Motion

Figure 2 for MoRAG -- Multi-Fusion Retrieval Augmented Generation for Human Motion

Figure 3 for MoRAG -- Multi-Fusion Retrieval Augmented Generation for Human Motion

Figure 4 for MoRAG -- Multi-Fusion Retrieval Augmented Generation for Human Motion

Abstract:We introduce MoRAG, a novel multi-part fusion based retrieval-augmented generation strategy for text-based human motion generation. The method enhances motion diffusion models by leveraging additional knowledge obtained through an improved motion retrieval process. By effectively prompting large language models (LLMs), we address spelling errors and rephrasing issues in motion retrieval. Our approach utilizes a multi-part retrieval strategy to improve the generalizability of motion retrieval across the language space. We create diverse samples through the spatial composition of the retrieved motions. Furthermore, by utilizing low-level, part-specific motion information, we can construct motion samples for unseen text descriptions. Our experiments demonstrate that our framework can serve as a plug-and-play module, improving the performance of motion diffusion models. Code, pretrained models and sample videos will be made available at: https://motion-rag.github.io/

Via

Access Paper or Ask Questions