Picture for Eslam Abdelrahman

Eslam Abdelrahman

Goldfish: Vision-Language Understanding of Arbitrarily Long Videos

Add code
Jul 17, 2024
Viaarxiv icon

InfiniBench: A Comprehensive Benchmark for Large Multimodal Models in Very Long Video Understanding

Add code
Jun 28, 2024
Viaarxiv icon

MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens

Add code
Apr 04, 2024
Viaarxiv icon