Picture for Mahmoud Assran

Mahmoud Assran

An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels

Add code
Jun 13, 2024
Viaarxiv icon

Modeling Caption Diversity in Contrastive Vision-Language Pretraining

Add code
Apr 30, 2024
Figure 1 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 2 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 3 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 4 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Viaarxiv icon

Learning and Leveraging World Models in Visual Representation Learning

Add code
Mar 01, 2024
Figure 1 for Learning and Leveraging World Models in Visual Representation Learning
Figure 2 for Learning and Leveraging World Models in Visual Representation Learning
Figure 3 for Learning and Leveraging World Models in Visual Representation Learning
Figure 4 for Learning and Leveraging World Models in Visual Representation Learning
Viaarxiv icon

Predicting masked tokens in stochastic locations improves masked image modeling

Add code
Jul 31, 2023
Figure 1 for Predicting masked tokens in stochastic locations improves masked image modeling
Figure 2 for Predicting masked tokens in stochastic locations improves masked image modeling
Figure 3 for Predicting masked tokens in stochastic locations improves masked image modeling
Figure 4 for Predicting masked tokens in stochastic locations improves masked image modeling
Viaarxiv icon

DINOv2: Learning Robust Visual Features without Supervision

Add code
Apr 14, 2023
Figure 1 for DINOv2: Learning Robust Visual Features without Supervision
Figure 2 for DINOv2: Learning Robust Visual Features without Supervision
Figure 3 for DINOv2: Learning Robust Visual Features without Supervision
Figure 4 for DINOv2: Learning Robust Visual Features without Supervision
Viaarxiv icon

Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

Add code
Jan 19, 2023
Figure 1 for Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
Figure 2 for Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
Figure 3 for Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
Figure 4 for Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
Viaarxiv icon

The Hidden Uniform Cluster Prior in Self-Supervised Learning

Add code
Oct 13, 2022
Figure 1 for The Hidden Uniform Cluster Prior in Self-Supervised Learning
Figure 2 for The Hidden Uniform Cluster Prior in Self-Supervised Learning
Figure 3 for The Hidden Uniform Cluster Prior in Self-Supervised Learning
Figure 4 for The Hidden Uniform Cluster Prior in Self-Supervised Learning
Viaarxiv icon

Masked Siamese Networks for Label-Efficient Learning

Add code
Apr 14, 2022
Figure 1 for Masked Siamese Networks for Label-Efficient Learning
Figure 2 for Masked Siamese Networks for Label-Efficient Learning
Figure 3 for Masked Siamese Networks for Label-Efficient Learning
Figure 4 for Masked Siamese Networks for Label-Efficient Learning
Viaarxiv icon

Memory Augmented Optimizers for Deep Learning

Add code
Jun 20, 2021
Figure 1 for Memory Augmented Optimizers for Deep Learning
Figure 2 for Memory Augmented Optimizers for Deep Learning
Figure 3 for Memory Augmented Optimizers for Deep Learning
Figure 4 for Memory Augmented Optimizers for Deep Learning
Viaarxiv icon

Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples

Add code
Apr 28, 2021
Figure 1 for Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples
Figure 2 for Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples
Figure 3 for Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples
Figure 4 for Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples
Viaarxiv icon