Picture for Errui Ding

Errui Ding

R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models

Add code
Oct 23, 2024
Viaarxiv icon

TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model

Add code
Oct 14, 2024
Viaarxiv icon

MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction

Add code
Oct 10, 2024
Viaarxiv icon

Uni$^2$Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection

Add code
Sep 30, 2024
Viaarxiv icon

MonoFormer: One Transformer for Both Diffusion and Autoregression

Add code
Sep 24, 2024
Figure 1 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Figure 2 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Figure 3 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Figure 4 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Viaarxiv icon

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

Add code
Aug 06, 2024
Figure 1 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Figure 2 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Figure 3 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Figure 4 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Viaarxiv icon

Add-SD: Rational Generation without Manual Reference

Add code
Jul 30, 2024
Figure 1 for Add-SD: Rational Generation without Manual Reference
Figure 2 for Add-SD: Rational Generation without Manual Reference
Figure 3 for Add-SD: Rational Generation without Manual Reference
Figure 4 for Add-SD: Rational Generation without Manual Reference
Viaarxiv icon

LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction

Add code
Jul 16, 2024
Viaarxiv icon

OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

Add code
Jul 15, 2024
Figure 1 for OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection
Figure 2 for OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection
Figure 3 for OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection
Figure 4 for OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection
Viaarxiv icon

OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer

Add code
Jul 15, 2024
Figure 1 for OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer
Figure 2 for OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer
Viaarxiv icon