Picture for Lorenzo Torresani

Lorenzo Torresani

TimeRefine: Temporal Grounding with Time Refining Video LLM

Add code
Dec 12, 2024
Viaarxiv icon

Semantic Compositions Enhance Vision-Language Contrastive Learning

Add code
Jul 01, 2024
Figure 1 for Semantic Compositions Enhance Vision-Language Contrastive Learning
Figure 2 for Semantic Compositions Enhance Vision-Language Contrastive Learning
Figure 3 for Semantic Compositions Enhance Vision-Language Contrastive Learning
Figure 4 for Semantic Compositions Enhance Vision-Language Contrastive Learning
Viaarxiv icon

Step Differences in Instructional Video

Add code
Apr 24, 2024
Viaarxiv icon

Video ReCap: Recursive Captioning of Hour-Long Videos

Add code
Feb 28, 2024
Viaarxiv icon

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Add code
Nov 30, 2023
Figure 1 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 2 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 3 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 4 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Viaarxiv icon

Multiscale Video Pretraining for Long-Term Activity Forecasting

Add code
Jul 24, 2023
Viaarxiv icon

Learning to Ground Instructional Articles in Videos through Narrations

Add code
Jun 06, 2023
Viaarxiv icon

Open-world Instance Segmentation: Top-down Learning with Bottom-up Supervision

Add code
Mar 09, 2023
Viaarxiv icon

MINOTAUR: Multi-task Video Grounding From Multimodal Queries

Add code
Feb 16, 2023
Figure 1 for MINOTAUR: Multi-task Video Grounding From Multimodal Queries
Figure 2 for MINOTAUR: Multi-task Video Grounding From Multimodal Queries
Figure 3 for MINOTAUR: Multi-task Video Grounding From Multimodal Queries
Figure 4 for MINOTAUR: Multi-task Video Grounding From Multimodal Queries
Viaarxiv icon

Egocentric Video Task Translation @ Ego4D Challenge 2022

Add code
Feb 03, 2023
Figure 1 for Egocentric Video Task Translation @ Ego4D Challenge 2022
Figure 2 for Egocentric Video Task Translation @ Ego4D Challenge 2022
Figure 3 for Egocentric Video Task Translation @ Ego4D Challenge 2022
Figure 4 for Egocentric Video Task Translation @ Ego4D Challenge 2022
Viaarxiv icon