Picture for Hongjin Lu

Hongjin Lu

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences

Add code
Jan 25, 2024
Figure 1 for Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Figure 2 for Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Figure 3 for Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Figure 4 for Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Viaarxiv icon