Picture for Bin Wen

Bin Wen

ContextRL: Enhancing MLLM's Knowledge Discovery Efficiency with Context-Augmented RL

Add code
Feb 26, 2026
Viaarxiv icon

Multi-Probe Zero Collision Hash (MPZCH): Mitigating Embedding Collisions and Enhancing Model Freshness in Large-Scale Recommenders

Add code
Feb 19, 2026
Viaarxiv icon

Breaking Data Efficiency Dilemma: A Federated and Augmented Learning Framework For Alzheimer's Disease Detection via Speech

Add code
Feb 16, 2026
Viaarxiv icon

UniRef-Image-Edit: Towards Scalable and Consistent Multi-Reference Image Editing

Add code
Feb 15, 2026
Viaarxiv icon

Spatial Chain-of-Thought: Bridging Understanding and Generation Models for Spatial Reasoning Generation

Add code
Feb 12, 2026
Viaarxiv icon

VideoTemp-o3: Harmonizing Temporal Grounding and Video Understanding in Agentic Thinking-with-Videos

Add code
Feb 08, 2026
Viaarxiv icon

Joint Reward Modeling: Internalizing Chain-of-Thought for Efficient Visual Reward Models

Add code
Feb 07, 2026
Viaarxiv icon

SpatialReward: Bridging the Perception Gap in Online RL for Image Editing via Explicit Spatial Reasoning

Add code
Feb 07, 2026
Viaarxiv icon

OpenOneRec Technical Report

Add code
Dec 31, 2025
Viaarxiv icon

LiveStar: Live Streaming Assistant for Real-World Online Video Understanding

Add code
Nov 07, 2025
Viaarxiv icon