Picture for Dima Damen

Dima Damen

EgoPoints: Advancing Point Tracking for Egocentric Videos

Add code
Dec 05, 2024
Viaarxiv icon

ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions

Add code
Dec 02, 2024
Viaarxiv icon

Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark

Add code
Nov 29, 2024
Viaarxiv icon

Context-Aware Multimodal Pretraining

Add code
Nov 22, 2024
Viaarxiv icon

It's Just Another Day: Unique Video Captioning by Discriminative Prompting

Add code
Oct 15, 2024
Figure 1 for It's Just Another Day: Unique Video Captioning by Discriminative Prompting
Figure 2 for It's Just Another Day: Unique Video Captioning by Discriminative Prompting
Figure 3 for It's Just Another Day: Unique Video Captioning by Discriminative Prompting
Figure 4 for It's Just Another Day: Unique Video Captioning by Discriminative Prompting
Viaarxiv icon

AMEGO: Active Memory from long EGOcentric videos

Add code
Sep 17, 2024
Viaarxiv icon

Rank2Reward: Learning Shaped Reward Functions from Passive Video

Add code
Apr 23, 2024
Figure 1 for Rank2Reward: Learning Shaped Reward Functions from Passive Video
Figure 2 for Rank2Reward: Learning Shaped Reward Functions from Passive Video
Figure 3 for Rank2Reward: Learning Shaped Reward Functions from Passive Video
Figure 4 for Rank2Reward: Learning Shaped Reward Functions from Passive Video
Viaarxiv icon

HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision

Add code
Apr 15, 2024
Figure 1 for HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision
Figure 2 for HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision
Figure 3 for HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision
Figure 4 for HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision
Viaarxiv icon

TIM: A Time Interval Machine for Audio-Visual Action Recognition

Add code
Apr 09, 2024
Figure 1 for TIM: A Time Interval Machine for Audio-Visual Action Recognition
Figure 2 for TIM: A Time Interval Machine for Audio-Visual Action Recognition
Figure 3 for TIM: A Time Interval Machine for Audio-Visual Action Recognition
Figure 4 for TIM: A Time Interval Machine for Audio-Visual Action Recognition
Viaarxiv icon

Spatial Cognition from Egocentric Video: Out of Sight, Not Out of Mind

Add code
Apr 07, 2024
Viaarxiv icon