Picture for Nakul Agarwal

Nakul Agarwal

ACE: Action Concept Enhancement of Video-Language Models in Procedural Videos

Add code
Nov 23, 2024
Viaarxiv icon

IIT Bombay Racing Driverless: Autonomous Driving Stack for Formula Student AI

Add code
Aug 12, 2024
Viaarxiv icon

M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models

Add code
Jul 19, 2024
Viaarxiv icon

Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models

Add code
May 30, 2024
Viaarxiv icon

Multi-Objective Recommendation via Multivariate Policy Learning

Add code
May 03, 2024
Viaarxiv icon

Disentangled Neural Relational Inference for Interpretable Motion Prediction

Add code
Jan 07, 2024
Viaarxiv icon

Vamos: Versatile Action Models for Video Understanding

Add code
Nov 22, 2023
Viaarxiv icon

Object-centric Video Representation for Long-term Action Anticipation

Add code
Oct 31, 2023
Viaarxiv icon

Rank2Tell: A Multimodal Driving Dataset for Joint Importance Ranking and Reasoning

Add code
Sep 12, 2023
Viaarxiv icon

AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?

Add code
Jul 31, 2023
Viaarxiv icon