Picture for Weidong Chen

Weidong Chen

Southern Methodist University

DrawSpeech: Expressive Speech Synthesis Using Prosodic Sketches as Control Conditions

Add code
Jan 08, 2025
Figure 1 for DrawSpeech: Expressive Speech Synthesis Using Prosodic Sketches as Control Conditions
Figure 2 for DrawSpeech: Expressive Speech Synthesis Using Prosodic Sketches as Control Conditions
Figure 3 for DrawSpeech: Expressive Speech Synthesis Using Prosodic Sketches as Control Conditions
Figure 4 for DrawSpeech: Expressive Speech Synthesis Using Prosodic Sketches as Control Conditions
Viaarxiv icon

Graph Mixture of Experts and Memory-augmented Routers for Multivariate Time Series Anomaly Detection

Add code
Dec 26, 2024
Viaarxiv icon

The Dawn of Video Generation: Preliminary Explorations with SORA-like Models

Add code
Oct 07, 2024
Viaarxiv icon

Dual-path Collaborative Generation Network for Emotional Video Captioning

Add code
Aug 06, 2024
Viaarxiv icon

SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds

Add code
Jul 16, 2024
Viaarxiv icon

Multi-Modal UAV Detection, Classification and Tracking Algorithm -- Technical Report for CVPR 2024 UG2 Challenge

Add code
May 26, 2024
Figure 1 for Multi-Modal UAV Detection, Classification and Tracking Algorithm -- Technical Report for CVPR 2024 UG2 Challenge
Figure 2 for Multi-Modal UAV Detection, Classification and Tracking Algorithm -- Technical Report for CVPR 2024 UG2 Challenge
Figure 3 for Multi-Modal UAV Detection, Classification and Tracking Algorithm -- Technical Report for CVPR 2024 UG2 Challenge
Figure 4 for Multi-Modal UAV Detection, Classification and Tracking Algorithm -- Technical Report for CVPR 2024 UG2 Challenge
Viaarxiv icon

Sentiment-oriented Transformer-based Variational Autoencoder Network for Live Video Commenting

Add code
Apr 19, 2024
Figure 1 for Sentiment-oriented Transformer-based Variational Autoencoder Network for Live Video Commenting
Figure 2 for Sentiment-oriented Transformer-based Variational Autoencoder Network for Live Video Commenting
Figure 3 for Sentiment-oriented Transformer-based Variational Autoencoder Network for Live Video Commenting
Figure 4 for Sentiment-oriented Transformer-based Variational Autoencoder Network for Live Video Commenting
Viaarxiv icon

Incremental Joint Learning of Depth, Pose and Implicit Scene Representation on Monocular Camera in Large-scale Scenes

Add code
Apr 09, 2024
Viaarxiv icon

NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising

Add code
Mar 29, 2024
Viaarxiv icon

Compact 3D Gaussian Splatting For Dense Visual SLAM

Add code
Mar 17, 2024
Figure 1 for Compact 3D Gaussian Splatting For Dense Visual SLAM
Figure 2 for Compact 3D Gaussian Splatting For Dense Visual SLAM
Figure 3 for Compact 3D Gaussian Splatting For Dense Visual SLAM
Figure 4 for Compact 3D Gaussian Splatting For Dense Visual SLAM
Viaarxiv icon