Picture for Alaaeldin El-Nouby

Alaaeldin El-Nouby

Multimodal Autoregressive Pre-training of Large Vision Encoders

Add code
Nov 21, 2024
Figure 1 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Figure 2 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Figure 3 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Figure 4 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Viaarxiv icon

DataComp-LM: In search of the next generation of training sets for language models

Add code
Jun 18, 2024
Figure 1 for DataComp-LM: In search of the next generation of training sets for language models
Figure 2 for DataComp-LM: In search of the next generation of training sets for language models
Figure 3 for DataComp-LM: In search of the next generation of training sets for language models
Figure 4 for DataComp-LM: In search of the next generation of training sets for language models
Viaarxiv icon

Scalable Pre-training of Large Autoregressive Image Models

Add code
Jan 16, 2024
Figure 1 for Scalable Pre-training of Large Autoregressive Image Models
Figure 2 for Scalable Pre-training of Large Autoregressive Image Models
Figure 3 for Scalable Pre-training of Large Autoregressive Image Models
Figure 4 for Scalable Pre-training of Large Autoregressive Image Models
Viaarxiv icon

ImageBind: One Embedding Space To Bind Them All

Add code
May 09, 2023
Viaarxiv icon

DINOv2: Learning Robust Visual Features without Supervision

Add code
Apr 14, 2023
Figure 1 for DINOv2: Learning Robust Visual Features without Supervision
Figure 2 for DINOv2: Learning Robust Visual Features without Supervision
Figure 3 for DINOv2: Learning Robust Visual Features without Supervision
Figure 4 for DINOv2: Learning Robust Visual Features without Supervision
Viaarxiv icon

Are Visual Recognition Models Robust to Image Compression?

Add code
Apr 10, 2023
Viaarxiv icon

Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models

Add code
Jan 28, 2023
Viaarxiv icon

Image Compression with Product Quantized Masked Image Modeling

Add code
Dec 14, 2022
Viaarxiv icon

OmniMAE: Single Model Masked Pretraining on Images and Videos

Add code
Jun 16, 2022
Figure 1 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Figure 2 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Figure 3 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Figure 4 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Viaarxiv icon

Three things everyone should know about Vision Transformers

Add code
Mar 18, 2022
Figure 1 for Three things everyone should know about Vision Transformers
Figure 2 for Three things everyone should know about Vision Transformers
Figure 3 for Three things everyone should know about Vision Transformers
Figure 4 for Three things everyone should know about Vision Transformers
Viaarxiv icon