Picture for Rui Liu

Rui Liu

TellWhisper: Tell Whisper Who Speaks When

Add code
Jan 08, 2026
Viaarxiv icon

MoE Adapter for Large Audio Language Models: Sparsity, Disentanglement, and Gradient-Conflict-Free

Add code
Jan 08, 2026
Viaarxiv icon

Causality-Aware Temporal Projection for Video Understanding in Video-LLMs

Add code
Jan 05, 2026
Viaarxiv icon

AEGIS: Exploring the Limit of World Knowledge Capabilities for Unified Mulitmodal Models

Add code
Jan 02, 2026
Viaarxiv icon

Scaling Reinforcement Learning for Content Moderation with Large Language Models

Add code
Dec 23, 2025
Figure 1 for Scaling Reinforcement Learning for Content Moderation with Large Language Models
Figure 2 for Scaling Reinforcement Learning for Content Moderation with Large Language Models
Figure 3 for Scaling Reinforcement Learning for Content Moderation with Large Language Models
Figure 4 for Scaling Reinforcement Learning for Content Moderation with Large Language Models
Viaarxiv icon

VisionDirector: Vision-Language Guided Closed-Loop Refinement for Generative Image Synthesis

Add code
Dec 22, 2025
Viaarxiv icon

Stable and Efficient Single-Rollout RL for Multimodal Reasoning

Add code
Dec 20, 2025
Figure 1 for Stable and Efficient Single-Rollout RL for Multimodal Reasoning
Figure 2 for Stable and Efficient Single-Rollout RL for Multimodal Reasoning
Figure 3 for Stable and Efficient Single-Rollout RL for Multimodal Reasoning
Figure 4 for Stable and Efficient Single-Rollout RL for Multimodal Reasoning
Viaarxiv icon

MMMamba: A Versatile Cross-Modal In Context Fusion Framework for Pan-Sharpening and Zero-Shot Image Enhancement

Add code
Dec 17, 2025
Viaarxiv icon

Route-DETR: Pairwise Query Routing in Transformers for Object Detection

Add code
Dec 15, 2025
Viaarxiv icon

OCCDiff: Occupancy Diffusion Model for High-Fidelity 3D Building Reconstruction from Noisy Point Clouds

Add code
Dec 09, 2025
Viaarxiv icon