Picture for Skanda Koppula

Skanda Koppula

PaliGemma: A versatile 3B VLM for transfer

Add code
Jul 10, 2024
Viaarxiv icon

TAPVid-3D: A Benchmark for Tracking Any Point in 3D

Add code
Jul 08, 2024
Viaarxiv icon

Memory Consolidation Enables Long-Context Video Understanding

Add code
Feb 08, 2024
Viaarxiv icon

BootsTAP: Bootstrapped Training for Tracking-Any-Point

Add code
Feb 01, 2024
Viaarxiv icon

A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames

Add code
Dec 12, 2023
Viaarxiv icon

Perception Test: A Diagnostic Benchmark for Multimodal Video Models

Add code
May 23, 2023
Viaarxiv icon

Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation

Add code
Apr 13, 2023
Viaarxiv icon

Where Should I Spend My FLOPS? Efficiency Evaluations of Visual Pre-training Methods

Add code
Oct 06, 2022
Figure 1 for Where Should I Spend My FLOPS? Efficiency Evaluations of Visual Pre-training Methods
Figure 2 for Where Should I Spend My FLOPS? Efficiency Evaluations of Visual Pre-training Methods
Figure 3 for Where Should I Spend My FLOPS? Efficiency Evaluations of Visual Pre-training Methods
Figure 4 for Where Should I Spend My FLOPS? Efficiency Evaluations of Visual Pre-training Methods
Viaarxiv icon

Object discovery and representation networks

Add code
Mar 16, 2022
Figure 1 for Object discovery and representation networks
Figure 2 for Object discovery and representation networks
Figure 3 for Object discovery and representation networks
Figure 4 for Object discovery and representation networks
Viaarxiv icon

Hierarchical Perceiver

Add code
Feb 22, 2022
Figure 1 for Hierarchical Perceiver
Figure 2 for Hierarchical Perceiver
Figure 3 for Hierarchical Perceiver
Figure 4 for Hierarchical Perceiver
Viaarxiv icon