Picture for Chenxu Hu

Chenxu Hu

DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models

Add code
Feb 25, 2024
Figure 1 for DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models
Figure 2 for DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models
Figure 3 for DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models
Figure 4 for DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models
Viaarxiv icon

DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech -- A Study between English and Mandarin

Add code
Sep 02, 2023
Viaarxiv icon

Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

Add code
Jun 29, 2023
Figure 1 for Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
Figure 2 for Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
Figure 3 for Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
Figure 4 for Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
Viaarxiv icon

ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory

Add code
Jun 07, 2023
Viaarxiv icon

ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries

Add code
Aug 02, 2022
Figure 1 for ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries
Figure 2 for ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries
Figure 3 for ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries
Figure 4 for ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries
Viaarxiv icon

Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech

Add code
Jul 13, 2022
Figure 1 for Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech
Figure 2 for Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech
Figure 3 for Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech
Figure 4 for Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech
Viaarxiv icon

Neural Dubber: Dubbing for Silent Videos According to Scripts

Add code
Oct 15, 2021
Figure 1 for Neural Dubber: Dubbing for Silent Videos According to Scripts
Figure 2 for Neural Dubber: Dubbing for Silent Videos According to Scripts
Figure 3 for Neural Dubber: Dubbing for Silent Videos According to Scripts
Figure 4 for Neural Dubber: Dubbing for Silent Videos According to Scripts
Viaarxiv icon

FastSpeech 2: Fast and High-Quality End-to-End Text to Speech

Add code
Jun 22, 2020
Figure 1 for FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Figure 2 for FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Figure 3 for FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Figure 4 for FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Viaarxiv icon