Picture for Tat Seng Chua

Tat Seng Chua

MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding

Add code
Oct 25, 2024
Figure 1 for MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding
Figure 2 for MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding
Figure 3 for MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding
Figure 4 for MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding
Viaarxiv icon

Can I Trust Your Answer? Visually Grounded Video Question Answering

Add code
Sep 04, 2023
Viaarxiv icon

RDU: A Region-based Approach to Form-style Document Understanding

Add code
Jun 14, 2022
Figure 1 for RDU: A Region-based Approach to Form-style Document Understanding
Figure 2 for RDU: A Region-based Approach to Form-style Document Understanding
Figure 3 for RDU: A Region-based Approach to Form-style Document Understanding
Figure 4 for RDU: A Region-based Approach to Form-style Document Understanding
Viaarxiv icon