Picture for Xiaodong Han

Xiaodong Han

Vript: A Video Is Worth Thousands of Words

Add code
Jun 10, 2024
Figure 1 for Vript: A Video Is Worth Thousands of Words
Figure 2 for Vript: A Video Is Worth Thousands of Words
Figure 3 for Vript: A Video Is Worth Thousands of Words
Figure 4 for Vript: A Video Is Worth Thousands of Words
Viaarxiv icon

Improving Audio-Visual Segmentation with Bidirectional Generation

Add code
Aug 16, 2023
Viaarxiv icon

Scaling TransNormer to 175 Billion Parameters

Add code
Jul 27, 2023
Viaarxiv icon

Linearized Relative Positional Encoding

Add code
Jul 18, 2023
Viaarxiv icon

Toeplitz Neural Network for Sequence Modeling

Add code
May 08, 2023
Viaarxiv icon

Fine-grained Audible Video Description

Add code
Mar 27, 2023
Figure 1 for Fine-grained Audible Video Description
Figure 2 for Fine-grained Audible Video Description
Figure 3 for Fine-grained Audible Video Description
Figure 4 for Fine-grained Audible Video Description
Viaarxiv icon

Linear Video Transformer with Feature Fixation

Add code
Oct 15, 2022
Figure 1 for Linear Video Transformer with Feature Fixation
Figure 2 for Linear Video Transformer with Feature Fixation
Figure 3 for Linear Video Transformer with Feature Fixation
Figure 4 for Linear Video Transformer with Feature Fixation
Viaarxiv icon