Picture for Xing Cheng

Xing Cheng

Generation-Guided Multi-Level Unified Network for Video Grounding

Add code
Mar 14, 2023
Viaarxiv icon

SimViT: Exploring a Simple Vision Transformer with sliding windows

Add code
Dec 24, 2021
Figure 1 for SimViT: Exploring a Simple Vision Transformer with sliding windows
Figure 2 for SimViT: Exploring a Simple Vision Transformer with sliding windows
Figure 3 for SimViT: Exploring a Simple Vision Transformer with sliding windows
Figure 4 for SimViT: Exploring a Simple Vision Transformer with sliding windows
Viaarxiv icon

Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss

Add code
Sep 13, 2021
Figure 1 for Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss
Figure 2 for Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss
Figure 3 for Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss
Figure 4 for Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss
Viaarxiv icon

MlTr: Multi-label Classification with Transformer

Add code
Jun 11, 2021
Figure 1 for MlTr: Multi-label Classification with Transformer
Figure 2 for MlTr: Multi-label Classification with Transformer
Figure 3 for MlTr: Multi-label Classification with Transformer
Figure 4 for MlTr: Multi-label Classification with Transformer
Viaarxiv icon

CAT: Cross Attention in Vision Transformer

Add code
Jun 10, 2021
Figure 1 for CAT: Cross Attention in Vision Transformer
Figure 2 for CAT: Cross Attention in Vision Transformer
Figure 3 for CAT: Cross Attention in Vision Transformer
Figure 4 for CAT: Cross Attention in Vision Transformer
Viaarxiv icon