Picture for Shenghai Yuan

Shenghai Yuan

Audio Array-Based 3D UAV Trajectory Estimation with LiDAR Pseudo-Labeling

Add code
Dec 17, 2024
Viaarxiv icon

Unsupervised UAV 3D Trajectories Estimation with Sparse Point Clouds

Add code
Dec 17, 2024
Viaarxiv icon

An Efficient Scene Coordinate Encoding and Relocalization Method

Add code
Dec 09, 2024
Viaarxiv icon

Open-Sora Plan: Open-Source Large Video Generation Model

Add code
Nov 28, 2024
Figure 1 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 2 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 3 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 4 for Open-Sora Plan: Open-Source Large Video Generation Model
Viaarxiv icon

WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

Add code
Nov 26, 2024
Viaarxiv icon

Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Add code
Nov 26, 2024
Figure 1 for Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Figure 2 for Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Figure 3 for Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Figure 4 for Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Viaarxiv icon

Multiple noncooperative targets encirclement by relative distance-based positioning and neural antisynchronization control

Add code
Nov 13, 2024
Figure 1 for Multiple noncooperative targets encirclement by relative distance-based positioning and neural antisynchronization control
Figure 2 for Multiple noncooperative targets encirclement by relative distance-based positioning and neural antisynchronization control
Figure 3 for Multiple noncooperative targets encirclement by relative distance-based positioning and neural antisynchronization control
Figure 4 for Multiple noncooperative targets encirclement by relative distance-based positioning and neural antisynchronization control
Viaarxiv icon

AV-PedAware: Self-Supervised Audio-Visual Fusion for Dynamic Pedestrian Awareness

Add code
Nov 11, 2024
Viaarxiv icon

GPTR: Gaussian Process Trajectory Representation for Continuous-Time Motion Estimation

Add code
Oct 31, 2024
Viaarxiv icon

Robust Loop Closure by Textual Cues in Challenging Environments

Add code
Oct 21, 2024
Figure 1 for Robust Loop Closure by Textual Cues in Challenging Environments
Figure 2 for Robust Loop Closure by Textual Cues in Challenging Environments
Figure 3 for Robust Loop Closure by Textual Cues in Challenging Environments
Figure 4 for Robust Loop Closure by Textual Cues in Challenging Environments
Viaarxiv icon