Picture for Xuxin Cheng

Xuxin Cheng

DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

Add code
Dec 13, 2024
Viaarxiv icon

Mobile-TeleVision: Predictive Motion Priors for Humanoid Whole-Body Control

Add code
Dec 10, 2024
Viaarxiv icon

DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval

Add code
Sep 16, 2024
Viaarxiv icon

Audio-text Retrieval with Transformer-based Hierarchical Alignment and Disentangled Cross-modal Representation

Add code
Sep 14, 2024
Viaarxiv icon

ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation

Add code
Aug 21, 2024
Figure 1 for ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation
Figure 2 for ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation
Figure 3 for ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation
Figure 4 for ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation
Viaarxiv icon

FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model

Add code
Aug 18, 2024
Viaarxiv icon

MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models

Add code
Aug 06, 2024
Figure 1 for MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
Figure 2 for MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
Figure 3 for MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
Figure 4 for MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
Viaarxiv icon

MMRA: A Benchmark for Multi-granularity Multi-image Relational Association

Add code
Jul 24, 2024
Figure 1 for MMRA: A Benchmark for Multi-granularity Multi-image Relational Association
Figure 2 for MMRA: A Benchmark for Multi-granularity Multi-image Relational Association
Figure 3 for MMRA: A Benchmark for Multi-granularity Multi-image Relational Association
Figure 4 for MMRA: A Benchmark for Multi-granularity Multi-image Relational Association
Viaarxiv icon

EXCGEC: A Benchmark of Edit-wise Explainable Chinese Grammatical Error Correction

Add code
Jul 01, 2024
Viaarxiv icon

Open-TeleVision: Teleoperation with Immersive Active Visual Feedback

Add code
Jul 01, 2024
Figure 1 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Figure 2 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Figure 3 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Figure 4 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Viaarxiv icon