Picture for Ruimao Zhang

Ruimao Zhang

Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset

Add code
Jan 09, 2025
Viaarxiv icon

ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning

Add code
Jan 08, 2025
Viaarxiv icon

ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model

Add code
Dec 19, 2024
Viaarxiv icon

Ensuring Force Safety in Vision-Guided Robotic Manipulation via Implicit Tactile Calibration

Add code
Dec 13, 2024
Viaarxiv icon

KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension

Add code
Nov 04, 2024
Figure 1 for KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Figure 2 for KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Figure 3 for KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Figure 4 for KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Viaarxiv icon

WorldSimBench: Towards Video Generation Models as World Simulators

Add code
Oct 23, 2024
Figure 1 for WorldSimBench: Towards Video Generation Models as World Simulators
Figure 2 for WorldSimBench: Towards Video Generation Models as World Simulators
Figure 3 for WorldSimBench: Towards Video Generation Models as World Simulators
Figure 4 for WorldSimBench: Towards Video Generation Models as World Simulators
Viaarxiv icon

Advancing Medical Radiograph Representation Learning: A Hybrid Pre-training Paradigm with Multilevel Semantic Granularity

Add code
Oct 01, 2024
Viaarxiv icon

Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models

Add code
Aug 21, 2024
Viaarxiv icon

F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions

Add code
Jul 17, 2024
Viaarxiv icon

Open-World Human-Object Interaction Detection via Multi-modal Prompts

Add code
Jun 11, 2024
Viaarxiv icon