Picture for Mingyu Ding

Mingyu Ding

DexH2R: Task-oriented Dexterous Manipulation from Human to Robots

Add code
Nov 07, 2024
Viaarxiv icon

X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios

Add code
Nov 02, 2024
Viaarxiv icon

Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning

Add code
Oct 31, 2024
Viaarxiv icon

MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts

Add code
Oct 30, 2024
Viaarxiv icon

CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians

Add code
Oct 28, 2024
Viaarxiv icon

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Add code
Oct 14, 2024
Figure 1 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Figure 2 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Figure 3 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Figure 4 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Viaarxiv icon

DCP: Learning Accelerator Dataflow for Neural Network via Propagation

Add code
Oct 09, 2024
Viaarxiv icon

TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection

Add code
Sep 17, 2024
Viaarxiv icon

P2 Explore: Efficient Exploration in Unknown Clustered Environment with Floor Plan Prediction

Add code
Sep 17, 2024
Viaarxiv icon

Embodiment-Agnostic Action Planning via Object-Part Scene Flow

Add code
Sep 16, 2024
Viaarxiv icon