Picture for Mattia Soldan

Mattia Soldan

Compressed-Language Models for Understanding Compressed File Formats: a JPEG Exploration

Add code
May 27, 2024
Viaarxiv icon

Towards Automated Movie Trailer Generation

Add code
Apr 04, 2024
Viaarxiv icon

Boundary-Denoising for Video Activity Localization

Add code
Apr 06, 2023
Viaarxiv icon

Localizing Moments in Long Video Via Multimodal Guidance

Add code
Feb 26, 2023
Viaarxiv icon

Egocentric Video-Language Pretraining @ Ego4D Challenge 2022

Add code
Jul 04, 2022
Figure 1 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 2 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 3 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 4 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Viaarxiv icon

Egocentric Video-Language Pretraining

Add code
Jun 03, 2022
Figure 1 for Egocentric Video-Language Pretraining
Figure 2 for Egocentric Video-Language Pretraining
Figure 3 for Egocentric Video-Language Pretraining
Figure 4 for Egocentric Video-Language Pretraining
Viaarxiv icon

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions

Add code
Dec 01, 2021
Figure 1 for MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
Figure 2 for MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
Figure 3 for MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
Figure 4 for MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
Viaarxiv icon

VLG-Net: Video-Language Graph Matching Network for Video Grounding

Add code
Nov 19, 2020
Figure 1 for VLG-Net: Video-Language Graph Matching Network for Video Grounding
Figure 2 for VLG-Net: Video-Language Graph Matching Network for Video Grounding
Figure 3 for VLG-Net: Video-Language Graph Matching Network for Video Grounding
Figure 4 for VLG-Net: Video-Language Graph Matching Network for Video Grounding
Viaarxiv icon

Seq2Seq RNN based Gait Anomaly Detection from Smartphone Acquired Multimodal Motion Data

Add code
Nov 19, 2019
Figure 1 for Seq2Seq RNN based Gait Anomaly Detection from Smartphone Acquired Multimodal Motion Data
Figure 2 for Seq2Seq RNN based Gait Anomaly Detection from Smartphone Acquired Multimodal Motion Data
Figure 3 for Seq2Seq RNN based Gait Anomaly Detection from Smartphone Acquired Multimodal Motion Data
Figure 4 for Seq2Seq RNN based Gait Anomaly Detection from Smartphone Acquired Multimodal Motion Data
Viaarxiv icon

Temporal Localization of Moments in Video Collections with Natural Language

Add code
Jul 30, 2019
Figure 1 for Temporal Localization of Moments in Video Collections with Natural Language
Figure 2 for Temporal Localization of Moments in Video Collections with Natural Language
Figure 3 for Temporal Localization of Moments in Video Collections with Natural Language
Figure 4 for Temporal Localization of Moments in Video Collections with Natural Language
Viaarxiv icon