Picture for Sunando Sengupta

Sunando Sengupta

STRIVE: Structured Spatiotemporal Exploration for Reinforcement Learning in Video Question Answering

Add code
Apr 02, 2026
Viaarxiv icon

Benchmarking at the Edge of Comprehension

Add code
Feb 15, 2026
Viaarxiv icon

DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding

Add code
Nov 17, 2025
Figure 1 for DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding
Figure 2 for DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding
Figure 3 for DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding
Figure 4 for DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding
Viaarxiv icon

Latent Directions: A Simple Pathway to Bias Mitigation in Generative AI

Add code
Jun 10, 2024
Figure 1 for Latent Directions: A Simple Pathway to Bias Mitigation in Generative AI
Figure 2 for Latent Directions: A Simple Pathway to Bias Mitigation in Generative AI
Figure 3 for Latent Directions: A Simple Pathway to Bias Mitigation in Generative AI
Figure 4 for Latent Directions: A Simple Pathway to Bias Mitigation in Generative AI
Viaarxiv icon

Cross-modal Spectrum Transformation Network For Acoustic Scene classification

Add code
Aug 13, 2021
Figure 1 for Cross-modal Spectrum Transformation Network For Acoustic Scene classification
Figure 2 for Cross-modal Spectrum Transformation Network For Acoustic Scene classification
Figure 3 for Cross-modal Spectrum Transformation Network For Acoustic Scene classification
Figure 4 for Cross-modal Spectrum Transformation Network For Acoustic Scene classification
Viaarxiv icon

Relighting Images in the Wild with a Self-Supervised Siamese Auto-Encoder

Add code
Dec 11, 2020
Figure 1 for Relighting Images in the Wild with a Self-Supervised Siamese Auto-Encoder
Figure 2 for Relighting Images in the Wild with a Self-Supervised Siamese Auto-Encoder
Figure 3 for Relighting Images in the Wild with a Self-Supervised Siamese Auto-Encoder
Figure 4 for Relighting Images in the Wild with a Self-Supervised Siamese Auto-Encoder
Viaarxiv icon