Picture for Christoph Feichtenhofer

Christoph Feichtenhofer

Jack

An Empirical Study of Autoregressive Pre-training from Videos

Add code
Jan 09, 2025
Figure 1 for An Empirical Study of Autoregressive Pre-training from Videos
Figure 2 for An Empirical Study of Autoregressive Pre-training from Videos
Figure 3 for An Empirical Study of Autoregressive Pre-training from Videos
Figure 4 for An Empirical Study of Autoregressive Pre-training from Videos
Viaarxiv icon

Gaussian Masked Autoencoders

Add code
Jan 06, 2025
Figure 1 for Gaussian Masked Autoencoders
Figure 2 for Gaussian Masked Autoencoders
Figure 3 for Gaussian Masked Autoencoders
Figure 4 for Gaussian Masked Autoencoders
Viaarxiv icon

Altogether: Image Captioning via Re-aligning Alt-text

Add code
Oct 22, 2024
Figure 1 for Altogether: Image Captioning via Re-aligning Alt-text
Figure 2 for Altogether: Image Captioning via Re-aligning Alt-text
Figure 3 for Altogether: Image Captioning via Re-aligning Alt-text
Figure 4 for Altogether: Image Captioning via Re-aligning Alt-text
Viaarxiv icon

SAM 2: Segment Anything in Images and Videos

Add code
Aug 01, 2024
Figure 1 for SAM 2: Segment Anything in Images and Videos
Figure 2 for SAM 2: Segment Anything in Images and Videos
Figure 3 for SAM 2: Segment Anything in Images and Videos
Figure 4 for SAM 2: Segment Anything in Images and Videos
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Window Attention is Bugged: How not to Interpolate Position Embeddings

Add code
Nov 09, 2023
Figure 1 for Window Attention is Bugged: How not to Interpolate Position Embeddings
Figure 2 for Window Attention is Bugged: How not to Interpolate Position Embeddings
Figure 3 for Window Attention is Bugged: How not to Interpolate Position Embeddings
Figure 4 for Window Attention is Bugged: How not to Interpolate Position Embeddings
Viaarxiv icon

Demystifying CLIP Data

Add code
Oct 02, 2023
Figure 1 for Demystifying CLIP Data
Figure 2 for Demystifying CLIP Data
Figure 3 for Demystifying CLIP Data
Figure 4 for Demystifying CLIP Data
Viaarxiv icon

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Add code
Jun 01, 2023
Figure 1 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 2 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 3 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 4 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Viaarxiv icon

Diffusion Models as Masked Autoencoders

Add code
Apr 06, 2023
Figure 1 for Diffusion Models as Masked Autoencoders
Figure 2 for Diffusion Models as Masked Autoencoders
Figure 3 for Diffusion Models as Masked Autoencoders
Figure 4 for Diffusion Models as Masked Autoencoders
Viaarxiv icon

On the Benefits of 3D Pose and Tracking for Human Action Recognition

Add code
Apr 03, 2023
Viaarxiv icon