Picture for Rui Liu

Rui Liu

UI-KOBE: Knowledge-Oriented Behavior Exploration for Lightweight Graph-Guided GUI Agents

Add code
May 28, 2026
Viaarxiv icon

OmniInteract: Benchmarking Real-World Streaming Interaction for Real-Time Omnimodal Assistants

Add code
May 26, 2026
Viaarxiv icon

3D Gaussian Map with Open-Set Semantic Grouping for Vision-Language Navigation

Add code
May 26, 2026
Viaarxiv icon

CodecCap: High-Fidelity Codec-Inspired Residual Modeling for Dense Video Captioning

Add code
May 26, 2026
Viaarxiv icon

Not All Tokens Matter Equally: Dynamic In-context Vector Distillation with Decisive-Token Supervision for Long-form Medical Report Generation

Add code
May 26, 2026
Viaarxiv icon

Uncertainty-Aware Gaussian Map for Vision-Language Navigation

Add code
May 26, 2026
Viaarxiv icon

Beyond Text Prompts: Visual-to-Visual Generation as A Unified Paradigm

Add code
May 12, 2026
Viaarxiv icon

Self-Distilled Trajectory-Aware Boltzmann Modeling: Bridging the Training-Inference Discrepancy in Diffusion Language Models

Add code
May 12, 2026
Viaarxiv icon

Reinforcing Multimodal Reasoning Against Visual Degradation

Add code
May 10, 2026
Viaarxiv icon

DeltaRubric: Generative Multimodal Reward Modeling via Joint Planning and Verification

Add code
May 10, 2026
Viaarxiv icon