Picture for Guangting Wang

Guangting Wang

Visual Perception by Large Language Model's Weights

Add code
May 30, 2024
Viaarxiv icon

Multi-Modal Generative Embedding Model

Add code
May 29, 2024
Viaarxiv icon

Correlation-Aware Deep Tracking

Add code
Mar 03, 2022
Figure 1 for Correlation-Aware Deep Tracking
Figure 2 for Correlation-Aware Deep Tracking
Figure 3 for Correlation-Aware Deep Tracking
Figure 4 for Correlation-Aware Deep Tracking
Viaarxiv icon

When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism

Add code
Jan 26, 2022
Figure 1 for When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Figure 2 for When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Figure 3 for When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Figure 4 for When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Viaarxiv icon

Learning Tracking Representations via Dual-Branch Fully Transformer Networks

Add code
Dec 05, 2021
Figure 1 for Learning Tracking Representations via Dual-Branch Fully Transformer Networks
Figure 2 for Learning Tracking Representations via Dual-Branch Fully Transformer Networks
Figure 3 for Learning Tracking Representations via Dual-Branch Fully Transformer Networks
Figure 4 for Learning Tracking Representations via Dual-Branch Fully Transformer Networks
Viaarxiv icon

Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?

Add code
Sep 12, 2021
Figure 1 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Figure 2 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Figure 3 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Figure 4 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Viaarxiv icon

A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP

Add code
Aug 30, 2021
Figure 1 for A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP
Figure 2 for A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP
Figure 3 for A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP
Figure 4 for A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP
Viaarxiv icon

Self-Supervised Visual Representations Learning by Contrastive Mask Prediction

Add code
Aug 18, 2021
Figure 1 for Self-Supervised Visual Representations Learning by Contrastive Mask Prediction
Figure 2 for Self-Supervised Visual Representations Learning by Contrastive Mask Prediction
Figure 3 for Self-Supervised Visual Representations Learning by Contrastive Mask Prediction
Figure 4 for Self-Supervised Visual Representations Learning by Contrastive Mask Prediction
Viaarxiv icon

Unsupervised Visual Representation Learning by Tracking Patches in Video

Add code
May 06, 2021
Figure 1 for Unsupervised Visual Representation Learning by Tracking Patches in Video
Figure 2 for Unsupervised Visual Representation Learning by Tracking Patches in Video
Figure 3 for Unsupervised Visual Representation Learning by Tracking Patches in Video
Figure 4 for Unsupervised Visual Representation Learning by Tracking Patches in Video
Viaarxiv icon

Tracking by Instance Detection: A Meta-Learning Approach

Add code
Apr 02, 2020
Figure 1 for Tracking by Instance Detection: A Meta-Learning Approach
Figure 2 for Tracking by Instance Detection: A Meta-Learning Approach
Figure 3 for Tracking by Instance Detection: A Meta-Learning Approach
Figure 4 for Tracking by Instance Detection: A Meta-Learning Approach
Viaarxiv icon