Picture for Pengjun Xie

Pengjun Xie

Evidence-Augmented Policy Optimization with Reward Co-Evolution for Long-Context Reasoning

Add code
Jan 15, 2026
Viaarxiv icon

ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking

Add code
Jan 10, 2026
Viaarxiv icon

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Add code
Jan 08, 2026
Viaarxiv icon

WebAnchor: Anchoring Agent Planning to Stabilize Long-Horizon Web Reasoning

Add code
Jan 07, 2026
Viaarxiv icon

Nested Browser-Use Learning for Agentic Information Seeking

Add code
Dec 29, 2025
Viaarxiv icon

AutoForge: Automated Environment Synthesis for Agentic Reinforcement Learning

Add code
Dec 28, 2025
Viaarxiv icon

EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce

Add code
Dec 11, 2025
Figure 1 for EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce
Figure 2 for EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce
Figure 3 for EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce
Figure 4 for EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce
Viaarxiv icon

IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction

Add code
Nov 10, 2025
Viaarxiv icon

Towards Universal Video Retrieval: Generalizing Video Embedding via Synthesized Multimodal Pyramid Curriculum

Add code
Oct 31, 2025
Viaarxiv icon

$\text{E}^2\text{Rank}$: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker

Add code
Oct 26, 2025
Viaarxiv icon