Picture for Gagan Bhatia

Gagan Bhatia

From RAG to Agentic RAG for Faithful Islamic Question Answering

Add code
Jan 12, 2026
Viaarxiv icon

Prototypicality Bias Reveals Blindspots in Multimodal Evaluation Metrics

Add code
Jan 08, 2026
Viaarxiv icon

Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images

Add code
Jun 16, 2025
Figure 1 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Figure 2 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Figure 3 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Figure 4 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Viaarxiv icon

Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning

Add code
May 22, 2025
Figure 1 for Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning
Figure 2 for Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning
Figure 3 for Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning
Figure 4 for Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning
Viaarxiv icon

DateLogicQA: Benchmarking Temporal Biases in Large Language Models

Add code
Dec 17, 2024
Figure 1 for DateLogicQA: Benchmarking Temporal Biases in Large Language Models
Figure 2 for DateLogicQA: Benchmarking Temporal Biases in Large Language Models
Figure 3 for DateLogicQA: Benchmarking Temporal Biases in Large Language Models
Figure 4 for DateLogicQA: Benchmarking Temporal Biases in Large Language Models
Viaarxiv icon

Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks

Add code
Nov 02, 2024
Viaarxiv icon

Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic

Add code
Jul 26, 2024
Figure 1 for Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic
Figure 2 for Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic
Figure 3 for Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic
Figure 4 for Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic
Viaarxiv icon

Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition

Add code
Jul 18, 2024
Figure 1 for Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition
Figure 2 for Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition
Figure 3 for Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition
Figure 4 for Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition
Viaarxiv icon

Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks

Add code
Mar 01, 2024
Figure 1 for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks
Figure 2 for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks
Figure 3 for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks
Figure 4 for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks
Viaarxiv icon

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Add code
Feb 16, 2024
Figure 1 for FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Figure 2 for FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Figure 3 for FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Figure 4 for FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Viaarxiv icon