Picture for Mizanur Rahman

Mizanur Rahman

Aligning Text, Code, and Vision: A Multi-Objective Reinforcement Learning Framework for Text-to-Visualization

Add code
Jan 08, 2026
Viaarxiv icon

Toward Generalized Detection of Synthetic Media: Limitations, Challenges, and the Path to Multimodal Solutions

Add code
Nov 14, 2025
Figure 1 for Toward Generalized Detection of Synthetic Media: Limitations, Challenges, and the Path to Multimodal Solutions
Figure 2 for Toward Generalized Detection of Synthetic Media: Limitations, Challenges, and the Path to Multimodal Solutions
Figure 3 for Toward Generalized Detection of Synthetic Media: Limitations, Challenges, and the Path to Multimodal Solutions
Figure 4 for Toward Generalized Detection of Synthetic Media: Limitations, Challenges, and the Path to Multimodal Solutions
Viaarxiv icon

Quantum-Classical Hybrid Framework for Zero-Day Time-Push GNSS Spoofing Detection

Add code
Aug 25, 2025
Viaarxiv icon

DashboardQA: Benchmarking Multimodal Agents for Question Answering on Interactive Dashboards

Add code
Aug 24, 2025
Viaarxiv icon

Vision-Based Localization and LLM-based Navigation for Indoor Environments

Add code
Aug 11, 2025
Viaarxiv icon

Grid2Guide: A* Enabled Small Language Model for Indoor Navigation

Add code
Aug 11, 2025
Viaarxiv icon

Evolution of ReID: From Early Methods to LLM Integration

Add code
Jun 16, 2025
Viaarxiv icon

Retrieval Augmented Generation-based Large Language Models for Bridging Transportation Cybersecurity Legal Knowledge Gaps

Add code
May 23, 2025
Viaarxiv icon

Judging the Judges: Can Large Vision-Language Models Fairly Evaluate Chart Comprehension and Reasoning?

Add code
May 13, 2025
Viaarxiv icon

ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering

Add code
Apr 10, 2025
Viaarxiv icon