Picture for Bin Wen

Bin Wen

Wan-Image: Pushing the Boundaries of Generative Visual Intelligence

Add code
Apr 21, 2026
Viaarxiv icon

Aligning Progress and Feasibility: A Neuro-Symbolic Dual Memory Framework for Long-Horizon LLM Agents

Add code
Apr 03, 2026
Viaarxiv icon

ContextRL: Enhancing MLLM's Knowledge Discovery Efficiency with Context-Augmented RL

Add code
Feb 26, 2026
Viaarxiv icon

Multi-Probe Zero Collision Hash (MPZCH): Mitigating Embedding Collisions and Enhancing Model Freshness in Large-Scale Recommenders

Add code
Feb 19, 2026
Viaarxiv icon

Breaking Data Efficiency Dilemma: A Federated and Augmented Learning Framework For Alzheimer's Disease Detection via Speech

Add code
Feb 16, 2026
Viaarxiv icon

UniRef-Image-Edit: Towards Scalable and Consistent Multi-Reference Image Editing

Add code
Feb 15, 2026
Viaarxiv icon

Spatial Chain-of-Thought: Bridging Understanding and Generation Models for Spatial Reasoning Generation

Add code
Feb 12, 2026
Viaarxiv icon

VideoTemp-o3: Harmonizing Temporal Grounding and Video Understanding in Agentic Thinking-with-Videos

Add code
Feb 08, 2026
Viaarxiv icon

Joint Reward Modeling: Internalizing Chain-of-Thought for Efficient Visual Reward Models

Add code
Feb 07, 2026
Viaarxiv icon

SpatialReward: Bridging the Perception Gap in Online RL for Image Editing via Explicit Spatial Reasoning

Add code
Feb 07, 2026
Viaarxiv icon