Picture for Yusuf Aytar

Yusuf Aytar

A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos

Add code
Nov 13, 2024
Figure 1 for A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos
Figure 2 for A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos
Figure 3 for A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos
Viaarxiv icon

OVR: A Dataset for Open Vocabulary Temporal Repetition Counting in Videos

Add code
Jul 24, 2024
Viaarxiv icon

Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models

Add code
Jun 13, 2024
Viaarxiv icon

FlexCap: Generating Rich, Localized, and Flexible Captions in Images

Add code
Mar 18, 2024
Viaarxiv icon

Genie: Generative Interactive Environments

Add code
Feb 23, 2024
Viaarxiv icon

Learning from One Continuous Video Stream

Add code
Dec 01, 2023
Figure 1 for Learning from One Continuous Video Stream
Figure 2 for Learning from One Continuous Video Stream
Figure 3 for Learning from One Continuous Video Stream
Figure 4 for Learning from One Continuous Video Stream
Viaarxiv icon

RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation

Add code
Aug 31, 2023
Viaarxiv icon

RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation

Add code
Jun 20, 2023
Viaarxiv icon

TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement

Add code
Jun 14, 2023
Viaarxiv icon

Perception Test: A Diagnostic Benchmark for Multimodal Video Models

Add code
May 23, 2023
Viaarxiv icon