Picture for Cong Liu

Cong Liu

EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion

Add code
Nov 23, 2024
Figure 1 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 2 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 3 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 4 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Viaarxiv icon

Transferable Adversarial Attacks against ASR

Add code
Nov 14, 2024
Viaarxiv icon

BOXR: Body and head motion Optimization framework for eXtended Reality

Add code
Oct 16, 2024
Figure 1 for BOXR: Body and head motion Optimization framework for eXtended Reality
Figure 2 for BOXR: Body and head motion Optimization framework for eXtended Reality
Figure 3 for BOXR: Body and head motion Optimization framework for eXtended Reality
Figure 4 for BOXR: Body and head motion Optimization framework for eXtended Reality
Viaarxiv icon

Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis

Add code
Aug 10, 2024
Figure 1 for Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis
Figure 2 for Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis
Figure 3 for Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis
Figure 4 for Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis
Viaarxiv icon

Cross-modulated Attention Transformer for RGBT Tracking

Add code
Aug 05, 2024
Figure 1 for Cross-modulated Attention Transformer for RGBT Tracking
Figure 2 for Cross-modulated Attention Transformer for RGBT Tracking
Figure 3 for Cross-modulated Attention Transformer for RGBT Tracking
Figure 4 for Cross-modulated Attention Transformer for RGBT Tracking
Viaarxiv icon

Cool-Fusion: Fuse Large Language Models without Training

Add code
Jul 29, 2024
Figure 1 for Cool-Fusion: Fuse Large Language Models without Training
Figure 2 for Cool-Fusion: Fuse Large Language Models without Training
Figure 3 for Cool-Fusion: Fuse Large Language Models without Training
Figure 4 for Cool-Fusion: Fuse Large Language Models without Training
Viaarxiv icon

MSP-MVS: Multi-granularity Segmentation Prior Guided Multi-View Stereo

Add code
Jul 27, 2024
Viaarxiv icon

NAMER: Non-Autoregressive Modeling for Handwritten Mathematical Expression Recognition

Add code
Jul 16, 2024
Viaarxiv icon

Multivector Neurons: Better and Faster O(n)-Equivariant Clifford Graph Neural Networks

Add code
Jun 06, 2024
Figure 1 for Multivector Neurons: Better and Faster O(n)-Equivariant Clifford Graph Neural Networks
Figure 2 for Multivector Neurons: Better and Faster O(n)-Equivariant Clifford Graph Neural Networks
Viaarxiv icon

ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization

Add code
May 23, 2024
Figure 1 for ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization
Figure 2 for ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization
Figure 3 for ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization
Figure 4 for ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization
Viaarxiv icon