Picture for Ran He

Ran He

VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation

Add code
Oct 10, 2025
Viaarxiv icon

Towards Robust Defense against Customization via Protective Perturbation Resistant to Diffusion-based Purification

Add code
Sep 19, 2025
Viaarxiv icon

A Comprehensive Survey on Trustworthiness in Reasoning with Large Language Models

Add code
Sep 04, 2025
Viaarxiv icon

InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing

Add code
Aug 19, 2025
Viaarxiv icon

Adapting Vision-Language Models Without Labels: A Comprehensive Survey

Add code
Aug 07, 2025
Viaarxiv icon

Test-Time Immunization: A Universal Defense Framework Against Jailbreaks for (Multimodal) Large Language Models

Add code
May 28, 2025
Viaarxiv icon

HAD: Hybrid Architecture Distillation Outperforms Teacher in Genomic Sequence Modeling

Add code
May 27, 2025
Viaarxiv icon

T^2Agent A Tool-augmented Multimodal Misinformation Detection Agent with Monte Carlo Tree Search

Add code
May 26, 2025
Viaarxiv icon

Breaking Complexity Barriers: High-Resolution Image Restoration with Rank Enhanced Linear Attention

Add code
May 22, 2025
Viaarxiv icon

Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning

Add code
May 19, 2025
Viaarxiv icon