Picture for João Carreira

João Carreira

DeepMind

Scaling 4D Representations

Add code
Dec 19, 2024
Figure 1 for Scaling 4D Representations
Figure 2 for Scaling 4D Representations
Figure 3 for Scaling 4D Representations
Figure 4 for Scaling 4D Representations
Viaarxiv icon

TRecViT: A Recurrent Video Transformer

Add code
Dec 18, 2024
Viaarxiv icon

Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark

Add code
Nov 29, 2024
Viaarxiv icon

TAPVid-3D: A Benchmark for Tracking Any Point in 3D

Add code
Jul 08, 2024
Figure 1 for TAPVid-3D: A Benchmark for Tracking Any Point in 3D
Figure 2 for TAPVid-3D: A Benchmark for Tracking Any Point in 3D
Figure 3 for TAPVid-3D: A Benchmark for Tracking Any Point in 3D
Figure 4 for TAPVid-3D: A Benchmark for Tracking Any Point in 3D
Viaarxiv icon

BootsTAP: Bootstrapped Training for Tracking-Any-Point

Add code
Feb 01, 2024
Figure 1 for BootsTAP: Bootstrapped Training for Tracking-Any-Point
Figure 2 for BootsTAP: Bootstrapped Training for Tracking-Any-Point
Figure 3 for BootsTAP: Bootstrapped Training for Tracking-Any-Point
Figure 4 for BootsTAP: Bootstrapped Training for Tracking-Any-Point
Viaarxiv icon

Perception Test 2023: A Summary of the First Challenge And Outcome

Add code
Dec 20, 2023
Viaarxiv icon

Learning from One Continuous Video Stream

Add code
Dec 01, 2023
Figure 1 for Learning from One Continuous Video Stream
Figure 2 for Learning from One Continuous Video Stream
Figure 3 for Learning from One Continuous Video Stream
Figure 4 for Learning from One Continuous Video Stream
Viaarxiv icon

Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video

Add code
Oct 12, 2023
Viaarxiv icon

Perception Test: A Diagnostic Benchmark for Multimodal Video Models

Add code
May 23, 2023
Figure 1 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Figure 2 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Figure 3 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Figure 4 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Viaarxiv icon

TAP-Vid: A Benchmark for Tracking Any Point in a Video

Add code
Nov 07, 2022
Viaarxiv icon