PolyViT: Co-training Vision Transformers on Images, Videos and Audio

Add code
Nov 25, 2021
Figure 1 for PolyViT: Co-training Vision Transformers on Images, Videos and Audio
Figure 2 for PolyViT: Co-training Vision Transformers on Images, Videos and Audio
Figure 3 for PolyViT: Co-training Vision Transformers on Images, Videos and Audio
Figure 4 for PolyViT: Co-training Vision Transformers on Images, Videos and Audio

Share this with someone who'll enjoy it:

View paper onarxiv iconopen_review iconOpenReview

Share this with someone who'll enjoy it: