Picture for Philip Torr

Philip Torr

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

A Fragile Guardrail: Diffusion LLM's Safety Blessing and Its Failure Mode

Add code
Jan 30, 2026
Viaarxiv icon

The Alignment Curse: Cross-Modality Jailbreak Transfer in Omni-Models

Add code
Jan 30, 2026
Viaarxiv icon

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

TouchGuide: Inference-Time Steering of Visuomotor Policies via Touch Guidance

Add code
Jan 28, 2026
Viaarxiv icon

Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge

Add code
Jan 26, 2026
Viaarxiv icon

Single LLM Debate, MoLaCE: Mixture of Latent Concept Experts Against Confirmation Bias

Add code
Dec 29, 2025
Viaarxiv icon

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Add code
Dec 18, 2025
Viaarxiv icon

Memory in the Age of AI Agents

Add code
Dec 15, 2025
Viaarxiv icon

Unforgotten Safety: Preserving Safety Alignment of Large Language Models with Continual Learning

Add code
Dec 10, 2025
Viaarxiv icon