Picture for Weihua Luo

Weihua Luo

AI Business, Alibaba Group

M$^2$: Dual-Memory Augmentation for Long-Horizon Web Agents via Trajectory Summarization and Insight Retrieval

Add code
Feb 28, 2026
Viaarxiv icon

Building Autonomous GUI Navigation via Agentic-Q Estimation and Step-Wise Policy Optimization

Add code
Feb 14, 2026
Viaarxiv icon

UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory

Add code
Feb 11, 2026
Viaarxiv icon

Difficulty-Estimated Policy Optimization

Add code
Feb 06, 2026
Viaarxiv icon

Table-as-Search: Formulate Long-Horizon Agentic Information Seeking as Table Completion

Add code
Feb 06, 2026
Viaarxiv icon

A State-Transition Framework for Efficient LLM Reasoning

Add code
Feb 01, 2026
Viaarxiv icon

Vectra: A New Metric, Dataset, and Model for Visual Quality Assessment in E-Commerce In-Image Machine Translation

Add code
Jan 31, 2026
Viaarxiv icon

Finding the Translation Switch: Discovering and Exploiting the Task-Initiation Features in LLMs

Add code
Jan 16, 2026
Viaarxiv icon

Triplets Better Than Pairs: Towards Stable and Effective Self-Play Fine-Tuning for LLMs

Add code
Jan 13, 2026
Viaarxiv icon

Can LLMs Track Their Output Length? A Dynamic Feedback Mechanism for Precise Length Regulation

Add code
Jan 07, 2026
Viaarxiv icon