Picture for Dima Damen

Dima Damen

Scaling 4D Representations

Add code
Dec 19, 2024
Figure 1 for Scaling 4D Representations
Figure 2 for Scaling 4D Representations
Figure 3 for Scaling 4D Representations
Figure 4 for Scaling 4D Representations
Viaarxiv icon

EgoPoints: Advancing Point Tracking for Egocentric Videos

Add code
Dec 05, 2024
Figure 1 for EgoPoints: Advancing Point Tracking for Egocentric Videos
Figure 2 for EgoPoints: Advancing Point Tracking for Egocentric Videos
Figure 3 for EgoPoints: Advancing Point Tracking for Egocentric Videos
Figure 4 for EgoPoints: Advancing Point Tracking for Egocentric Videos
Viaarxiv icon

ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions

Add code
Dec 02, 2024
Viaarxiv icon

Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark

Add code
Nov 29, 2024
Viaarxiv icon

Context-Aware Multimodal Pretraining

Add code
Nov 22, 2024
Viaarxiv icon

It's Just Another Day: Unique Video Captioning by Discriminative Prompting

Add code
Oct 15, 2024
Figure 1 for It's Just Another Day: Unique Video Captioning by Discriminative Prompting
Figure 2 for It's Just Another Day: Unique Video Captioning by Discriminative Prompting
Figure 3 for It's Just Another Day: Unique Video Captioning by Discriminative Prompting
Figure 4 for It's Just Another Day: Unique Video Captioning by Discriminative Prompting
Viaarxiv icon

AMEGO: Active Memory from long EGOcentric videos

Add code
Sep 17, 2024
Figure 1 for AMEGO: Active Memory from long EGOcentric videos
Figure 2 for AMEGO: Active Memory from long EGOcentric videos
Figure 3 for AMEGO: Active Memory from long EGOcentric videos
Figure 4 for AMEGO: Active Memory from long EGOcentric videos
Viaarxiv icon

Rank2Reward: Learning Shaped Reward Functions from Passive Video

Add code
Apr 23, 2024
Figure 1 for Rank2Reward: Learning Shaped Reward Functions from Passive Video
Figure 2 for Rank2Reward: Learning Shaped Reward Functions from Passive Video
Figure 3 for Rank2Reward: Learning Shaped Reward Functions from Passive Video
Figure 4 for Rank2Reward: Learning Shaped Reward Functions from Passive Video
Viaarxiv icon

HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision

Add code
Apr 15, 2024
Figure 1 for HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision
Figure 2 for HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision
Figure 3 for HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision
Figure 4 for HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision
Viaarxiv icon

TIM: A Time Interval Machine for Audio-Visual Action Recognition

Add code
Apr 09, 2024
Figure 1 for TIM: A Time Interval Machine for Audio-Visual Action Recognition
Figure 2 for TIM: A Time Interval Machine for Audio-Visual Action Recognition
Figure 3 for TIM: A Time Interval Machine for Audio-Visual Action Recognition
Figure 4 for TIM: A Time Interval Machine for Audio-Visual Action Recognition
Viaarxiv icon