Picture for Debidatta Dwibedi

Debidatta Dwibedi

A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos

Add code
Nov 13, 2024
Viaarxiv icon

Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation

Add code
Sep 24, 2024
Viaarxiv icon

OVR: A Dataset for Open Vocabulary Temporal Repetition Counting in Videos

Add code
Jul 24, 2024
Viaarxiv icon

Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers

Add code
Mar 19, 2024
Figure 1 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Figure 2 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Figure 3 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Figure 4 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Viaarxiv icon

FlexCap: Generating Rich, Localized, and Flexible Captions in Images

Add code
Mar 18, 2024
Viaarxiv icon

RT-H: Action Hierarchies Using Language

Add code
Mar 04, 2024
Viaarxiv icon

AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents

Add code
Jan 23, 2024
Viaarxiv icon

RoboVQA: Multimodal Long-Horizon Reasoning for Robotics

Add code
Nov 01, 2023
Viaarxiv icon

Q-Match: Self-Supervised Learning by Matching Distributions Induced by a Queue

Add code
Feb 22, 2023
Viaarxiv icon

Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations

Add code
May 12, 2022
Figure 1 for Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations
Figure 2 for Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations
Figure 3 for Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations
Figure 4 for Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations
Viaarxiv icon