Picture for Manuel von Hochmeister

Manuel von Hochmeister

OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog

Add code
Feb 20, 2024
Viaarxiv icon