Picture for Salman Khan

Salman Khan

Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework

Add code
Apr 07, 2026
Viaarxiv icon

CoME-VL: Scaling Complementary Multi-Encoder Vision-Language Learning

Add code
Apr 03, 2026
Viaarxiv icon

The Eleventh NTIRE 2026 Efficient Super-Resolution Challenge Report

Add code
Apr 03, 2026
Viaarxiv icon

CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare

Add code
Mar 25, 2026
Viaarxiv icon

WorldCache: Content-Aware Caching for Accelerated Video World Models

Add code
Mar 23, 2026
Viaarxiv icon

From Masks to Pixels and Meaning: A New Taxonomy, Benchmark, and Metrics for VLM Image Tampering

Add code
Mar 20, 2026
Viaarxiv icon

Latent-DARM: Bridging Discrete Diffusion And Autoregressive Models For Reasoning

Add code
Mar 10, 2026
Viaarxiv icon

See, Plan, Rewind: Progress-Aware Vision-Language-Action Models for Robust Robotic Manipulation

Add code
Mar 10, 2026
Viaarxiv icon

MediX-R1: Open Ended Medical Reinforcement Learning

Add code
Feb 26, 2026
Viaarxiv icon

Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device

Add code
Feb 24, 2026
Viaarxiv icon