Picture for Mubbasir Kapadia

Mubbasir Kapadia

RoMo: A Large-Scale, Richly Organized Dataset and Semantic Taxonomy for Human Motion Generation

Add code
May 25, 2026
Viaarxiv icon

MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

Add code
May 14, 2026
Viaarxiv icon

Large Sign Language Models: Toward 3D American Sign Language Translation

Add code
Nov 11, 2025
Viaarxiv icon

StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion

Add code
Mar 27, 2025
Figure 1 for StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion
Figure 2 for StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion
Figure 3 for StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion
Figure 4 for StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion
Viaarxiv icon

ArchSeek: Retrieving Architectural Case Studies Using Vision-Language Models

Add code
Mar 24, 2025
Viaarxiv icon

Less is More: Improving Motion Diffusion Models with Sparse Keyframes

Add code
Mar 18, 2025
Viaarxiv icon

Cardiverse: Harnessing LLMs for Novel Card Game Prototyping

Add code
Feb 10, 2025
Viaarxiv icon

CASIM: Composite Aware Semantic Injection for Text to Motion Generation

Add code
Feb 04, 2025
Figure 1 for CASIM: Composite Aware Semantic Injection for Text to Motion Generation
Figure 2 for CASIM: Composite Aware Semantic Injection for Text to Motion Generation
Figure 3 for CASIM: Composite Aware Semantic Injection for Text to Motion Generation
Figure 4 for CASIM: Composite Aware Semantic Injection for Text to Motion Generation
Viaarxiv icon

TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction

Add code
Oct 14, 2024
Figure 1 for TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction
Figure 2 for TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction
Figure 3 for TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction
Figure 4 for TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction
Viaarxiv icon

From Words to Worlds: Transforming One-line Prompt into Immersive Multi-modal Digital Stories with Communicative LLM Agent

Add code
Jun 15, 2024
Figure 1 for From Words to Worlds: Transforming One-line Prompt into Immersive Multi-modal Digital Stories with Communicative LLM Agent
Figure 2 for From Words to Worlds: Transforming One-line Prompt into Immersive Multi-modal Digital Stories with Communicative LLM Agent
Figure 3 for From Words to Worlds: Transforming One-line Prompt into Immersive Multi-modal Digital Stories with Communicative LLM Agent
Figure 4 for From Words to Worlds: Transforming One-line Prompt into Immersive Multi-modal Digital Stories with Communicative LLM Agent
Viaarxiv icon