Picture for Pengxiang Ding

Pengxiang Ding

Score and Distribution Matching Policy: Advanced Accelerated Visuomotor Policies via Matched Distillation

Add code
Dec 13, 2024
Viaarxiv icon

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

Add code
Dec 09, 2024
Viaarxiv icon

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration

Add code
Nov 26, 2024
Viaarxiv icon

ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification

Add code
Sep 30, 2024
Figure 1 for ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification
Figure 2 for ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification
Figure 3 for ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification
Figure 4 for ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification
Viaarxiv icon

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Add code
Sep 11, 2024
Viaarxiv icon

DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation

Add code
Apr 27, 2024
Figure 1 for DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation
Figure 2 for DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation
Figure 3 for DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation
Figure 4 for DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation
Viaarxiv icon

Towards more realistic human motion prediction with attention to motion coordination

Add code
Apr 04, 2024
Viaarxiv icon

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference

Add code
Mar 22, 2024
Viaarxiv icon

GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot

Add code
Mar 20, 2024
Viaarxiv icon

QUAR-VLA: Vision-Language-Action Model for Quadruped Robots

Add code
Dec 22, 2023
Viaarxiv icon