Picture for Manuel Martin

Manuel Martin

Multi-modal Video Representation Alignment for Robust Self-supervised Driver Distraction Detection

Add code
Jun 01, 2026
Viaarxiv icon

Vision-language Models for Driver Monitoring Systems: A Driver Activity Description Dataset

Add code
Jun 01, 2026
Viaarxiv icon

FlowNar: Scalable Streaming Narration for Long-Form Videos

Add code
May 30, 2026
Viaarxiv icon

QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024

Add code
Jul 04, 2024
Figure 1 for QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024
Figure 2 for QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024
Figure 3 for QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024
Figure 4 for QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024
Viaarxiv icon

DiffAnt: Diffusion Models for Action Anticipation

Add code
Nov 27, 2023
Viaarxiv icon

A Survey on Deep Learning Techniques for Action Anticipation

Add code
Sep 29, 2023
Figure 1 for A Survey on Deep Learning Techniques for Action Anticipation
Figure 2 for A Survey on Deep Learning Techniques for Action Anticipation
Figure 3 for A Survey on Deep Learning Techniques for Action Anticipation
Figure 4 for A Survey on Deep Learning Techniques for Action Anticipation
Viaarxiv icon