Picture for Minesh Mathew

Minesh Mathew

Understanding Video Scenes through Text: Insights from Text-based Video Question Answering

Add code
Sep 11, 2023
Viaarxiv icon

Reading Between the Lanes: Text VideoQA on the Road

Add code
Jul 08, 2023
Viaarxiv icon

Watching the News: Towards VideoQA Models that can Read

Add code
Nov 10, 2022
Viaarxiv icon

An empirical study of CTC based models for OCR of Indian languages

Add code
May 13, 2022
Figure 1 for An empirical study of CTC based models for OCR of Indian languages
Figure 2 for An empirical study of CTC based models for OCR of Indian languages
Figure 3 for An empirical study of CTC based models for OCR of Indian languages
Figure 4 for An empirical study of CTC based models for OCR of Indian languages
Viaarxiv icon

ICDAR 2021 Competition on Document VisualQuestion Answering

Add code
Nov 10, 2021
Figure 1 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 2 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 3 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 4 for ICDAR 2021 Competition on Document VisualQuestion Answering
Viaarxiv icon

Asking questions on handwritten document collections

Add code
Oct 02, 2021
Figure 1 for Asking questions on handwritten document collections
Figure 2 for Asking questions on handwritten document collections
Figure 3 for Asking questions on handwritten document collections
Figure 4 for Asking questions on handwritten document collections
Viaarxiv icon

InfographicVQA

Add code
Apr 26, 2021
Figure 1 for InfographicVQA
Figure 2 for InfographicVQA
Figure 3 for InfographicVQA
Figure 4 for InfographicVQA
Viaarxiv icon

Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam

Add code
Apr 09, 2021
Figure 1 for Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam
Figure 2 for Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam
Figure 3 for Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam
Figure 4 for Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam
Viaarxiv icon

MMBERT: Multimodal BERT Pretraining for Improved Medical VQA

Add code
Apr 03, 2021
Figure 1 for MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
Figure 2 for MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
Figure 3 for MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
Figure 4 for MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
Viaarxiv icon

Document Visual Question Answering Challenge 2020

Add code
Aug 20, 2020
Figure 1 for Document Visual Question Answering Challenge 2020
Figure 2 for Document Visual Question Answering Challenge 2020
Figure 3 for Document Visual Question Answering Challenge 2020
Viaarxiv icon