Picture for Dong Shen

Dong Shen

EVLM: An Efficient Vision-Language Model for Visual Understanding

Add code
Jul 19, 2024
Figure 1 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Figure 2 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Figure 3 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Figure 4 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Viaarxiv icon

ContentCTR: Frame-level Live Streaming Click-Through Rate Prediction with Multimodal Transformer

Add code
Jun 26, 2023
Viaarxiv icon

Generation-Guided Multi-Level Unified Network for Video Grounding

Add code
Mar 14, 2023
Viaarxiv icon

A Unified Model for Video Understanding and Knowledge Embedding with Heterogeneous Knowledge Graph Dataset

Add code
Nov 19, 2022
Viaarxiv icon

Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss

Add code
Sep 13, 2021
Figure 1 for Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss
Figure 2 for Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss
Figure 3 for Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss
Figure 4 for Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss
Viaarxiv icon

MlTr: Multi-label Classification with Transformer

Add code
Jun 11, 2021
Figure 1 for MlTr: Multi-label Classification with Transformer
Figure 2 for MlTr: Multi-label Classification with Transformer
Figure 3 for MlTr: Multi-label Classification with Transformer
Figure 4 for MlTr: Multi-label Classification with Transformer
Viaarxiv icon

CAT: Cross Attention in Vision Transformer

Add code
Jun 10, 2021
Figure 1 for CAT: Cross Attention in Vision Transformer
Figure 2 for CAT: Cross Attention in Vision Transformer
Figure 3 for CAT: Cross Attention in Vision Transformer
Figure 4 for CAT: Cross Attention in Vision Transformer
Viaarxiv icon

ES-Net: Erasing Salient Parts to Learn More in Re-Identification

Add code
Mar 10, 2021
Figure 1 for ES-Net: Erasing Salient Parts to Learn More in Re-Identification
Figure 2 for ES-Net: Erasing Salient Parts to Learn More in Re-Identification
Figure 3 for ES-Net: Erasing Salient Parts to Learn More in Re-Identification
Figure 4 for ES-Net: Erasing Salient Parts to Learn More in Re-Identification
Viaarxiv icon

Complementary Pseudo Labels For Unsupervised Domain Adaptation On Person Re-identification

Add code
Feb 07, 2021
Figure 1 for Complementary Pseudo Labels For Unsupervised Domain Adaptation On Person Re-identification
Figure 2 for Complementary Pseudo Labels For Unsupervised Domain Adaptation On Person Re-identification
Figure 3 for Complementary Pseudo Labels For Unsupervised Domain Adaptation On Person Re-identification
Figure 4 for Complementary Pseudo Labels For Unsupervised Domain Adaptation On Person Re-identification
Viaarxiv icon