Picture for Lorenzo Torresani

Lorenzo Torresani

Semantic Compositions Enhance Vision-Language Contrastive Learning

Add code
Jul 01, 2024
Viaarxiv icon

Step Differences in Instructional Video

Add code
Apr 24, 2024
Viaarxiv icon

Video ReCap: Recursive Captioning of Hour-Long Videos

Add code
Feb 28, 2024
Viaarxiv icon

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Add code
Nov 30, 2023
Viaarxiv icon

Multiscale Video Pretraining for Long-Term Activity Forecasting

Add code
Jul 24, 2023
Viaarxiv icon

Learning to Ground Instructional Articles in Videos through Narrations

Add code
Jun 06, 2023
Viaarxiv icon

Open-world Instance Segmentation: Top-down Learning with Bottom-up Supervision

Add code
Mar 09, 2023
Viaarxiv icon

MINOTAUR: Multi-task Video Grounding From Multimodal Queries

Add code
Feb 16, 2023
Viaarxiv icon

Egocentric Video Task Translation @ Ego4D Challenge 2022

Add code
Feb 03, 2023
Viaarxiv icon

What You Say Is What You Show: Visual Narration Detection in Instructional Videos

Add code
Jan 05, 2023
Viaarxiv icon