Picture for Haoran Cheng

Haoran Cheng

VP-MEL: Visual Prompts Guided Multimodal Entity Linking

Add code
Dec 10, 2024
Viaarxiv icon

Searching Priors Makes Text-to-Video Synthesis Better

Add code
Jun 05, 2024
Figure 1 for Searching Priors Makes Text-to-Video Synthesis Better
Figure 2 for Searching Priors Makes Text-to-Video Synthesis Better
Figure 3 for Searching Priors Makes Text-to-Video Synthesis Better
Figure 4 for Searching Priors Makes Text-to-Video Synthesis Better
Viaarxiv icon

EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face Generation

Add code
Feb 02, 2024
Viaarxiv icon

Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving

Add code
Dec 19, 2023
Viaarxiv icon

Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning

Add code
Nov 29, 2023
Viaarxiv icon

DHOT-GM: Robust Graph Matching Using A Differentiable Hierarchical Optimal Transport Framework

Add code
Oct 18, 2023
Viaarxiv icon

MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection

Add code
Aug 18, 2023
Viaarxiv icon

Learning Occupancy for Monocular 3D Object Detection

Add code
May 25, 2023
Viaarxiv icon