Picture for Shuai Shao

Shuai Shao

OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention

Add code
Feb 05, 2026
Viaarxiv icon

Multimodal Generative Recommendation for Fusing Semantic and Collaborative Signals

Add code
Feb 03, 2026
Viaarxiv icon

MonoScale: Scaling Multi-Agent System with Monotonic Improvement

Add code
Jan 30, 2026
Viaarxiv icon

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Add code
Jan 26, 2026
Viaarxiv icon

The Why Behind the Action: Unveiling Internal Drivers via Agentic Attribution

Add code
Jan 21, 2026
Viaarxiv icon

Extreme Value Policy Optimization for Safe Reinforcement Learning

Add code
Jan 17, 2026
Viaarxiv icon

TAGRPO: Boosting GRPO on Image-to-Video Generation with Direct Trajectory Alignment

Add code
Jan 09, 2026
Viaarxiv icon

Rotate Your Character: Revisiting Video Diffusion Models for High-Quality 3D Character Generation

Add code
Jan 09, 2026
Viaarxiv icon

MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook

Add code
Sep 17, 2025
Figure 1 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 2 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 3 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 4 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Viaarxiv icon

Style4D-Bench: A Benchmark Suite for 4D Stylization

Add code
Aug 26, 2025
Viaarxiv icon