Picture for Nicolas Ballas

Nicolas Ballas

VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning

Add code
Oct 04, 2024
Figure 1 for VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Figure 2 for VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Figure 3 for VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Figure 4 for VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Viaarxiv icon

Modeling Caption Diversity in Contrastive Vision-Language Pretraining

Add code
Apr 30, 2024
Figure 1 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 2 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 3 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 4 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Viaarxiv icon

Learning and Leveraging World Models in Visual Representation Learning

Add code
Mar 01, 2024
Figure 1 for Learning and Leveraging World Models in Visual Representation Learning
Figure 2 for Learning and Leveraging World Models in Visual Representation Learning
Figure 3 for Learning and Leveraging World Models in Visual Representation Learning
Figure 4 for Learning and Leveraging World Models in Visual Representation Learning
Viaarxiv icon

Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model

Add code
Dec 19, 2023
Viaarxiv icon

Discovering environments with XRM

Add code
Sep 28, 2023
Figure 1 for Discovering environments with XRM
Figure 2 for Discovering environments with XRM
Figure 3 for Discovering environments with XRM
Figure 4 for Discovering environments with XRM
Viaarxiv icon

Predicting masked tokens in stochastic locations improves masked image modeling

Add code
Jul 31, 2023
Figure 1 for Predicting masked tokens in stochastic locations improves masked image modeling
Figure 2 for Predicting masked tokens in stochastic locations improves masked image modeling
Figure 3 for Predicting masked tokens in stochastic locations improves masked image modeling
Figure 4 for Predicting masked tokens in stochastic locations improves masked image modeling
Viaarxiv icon

DINOv2: Learning Robust Visual Features without Supervision

Add code
Apr 14, 2023
Figure 1 for DINOv2: Learning Robust Visual Features without Supervision
Figure 2 for DINOv2: Learning Robust Visual Features without Supervision
Figure 3 for DINOv2: Learning Robust Visual Features without Supervision
Figure 4 for DINOv2: Learning Robust Visual Features without Supervision
Viaarxiv icon

A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation

Add code
Apr 11, 2023
Figure 1 for A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation
Figure 2 for A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation
Figure 3 for A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation
Figure 4 for A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation
Viaarxiv icon

A Simple Recipe for Competitive Low-compute Self supervised Vision Models

Add code
Jan 23, 2023
Viaarxiv icon

Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

Add code
Jan 19, 2023
Figure 1 for Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
Figure 2 for Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
Figure 3 for Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
Figure 4 for Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
Viaarxiv icon