Picture for Xiufeng Song

Xiufeng Song

UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines

Add code
Mar 26, 2025
Viaarxiv icon

Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector

Add code
Mar 26, 2025
Viaarxiv icon

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Add code
Mar 20, 2025
Viaarxiv icon

On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection

Add code
Oct 31, 2024
Viaarxiv icon