Picture for Nicolas Ballas

Nicolas Ballas

VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning

Add code
Oct 04, 2024
Figure 1 for VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Figure 2 for VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Figure 3 for VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Figure 4 for VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Viaarxiv icon

Modeling Caption Diversity in Contrastive Vision-Language Pretraining

Add code
Apr 30, 2024
Figure 1 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 2 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 3 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 4 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Viaarxiv icon

Learning and Leveraging World Models in Visual Representation Learning

Add code
Mar 01, 2024
Figure 1 for Learning and Leveraging World Models in Visual Representation Learning
Figure 2 for Learning and Leveraging World Models in Visual Representation Learning
Figure 3 for Learning and Leveraging World Models in Visual Representation Learning
Figure 4 for Learning and Leveraging World Models in Visual Representation Learning
Viaarxiv icon

Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model

Add code
Dec 19, 2023
Viaarxiv icon

Discovering environments with XRM

Add code
Sep 28, 2023
Viaarxiv icon

Predicting masked tokens in stochastic locations improves masked image modeling

Add code
Jul 31, 2023
Figure 1 for Predicting masked tokens in stochastic locations improves masked image modeling
Figure 2 for Predicting masked tokens in stochastic locations improves masked image modeling
Figure 3 for Predicting masked tokens in stochastic locations improves masked image modeling
Figure 4 for Predicting masked tokens in stochastic locations improves masked image modeling
Viaarxiv icon

DINOv2: Learning Robust Visual Features without Supervision

Add code
Apr 14, 2023
Viaarxiv icon

A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation

Add code
Apr 11, 2023
Figure 1 for A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation
Figure 2 for A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation
Figure 3 for A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation
Figure 4 for A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation
Viaarxiv icon

A Simple Recipe for Competitive Low-compute Self supervised Vision Models

Add code
Jan 23, 2023
Viaarxiv icon

Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

Add code
Jan 19, 2023
Viaarxiv icon