Picture for Xian Li

Xian Li

Bridging Offline and Online Reinforcement Learning for LLMs

Add code
Jun 26, 2025
Viaarxiv icon

PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time

Add code
Jun 06, 2025
Viaarxiv icon

Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models

Add code
Jun 05, 2025
Viaarxiv icon

DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction

Add code
May 27, 2025
Viaarxiv icon

J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning

Add code
May 15, 2025
Viaarxiv icon

Scalable Multi-task Edge Sensing via Task-oriented Joint Information Gathering and Broadcast

Add code
Apr 16, 2025
Viaarxiv icon

Transferable Deployment of Semantic Edge Inference Systems via Unsupervised Domain Adaption

Add code
Apr 16, 2025
Viaarxiv icon

Pillar-Voxel Fusion Network for 3D Object Detection in Airborne Hyperspectral Point Clouds

Add code
Apr 13, 2025
Viaarxiv icon

Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models

Add code
Mar 19, 2025
Viaarxiv icon

SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks

Add code
Mar 19, 2025
Viaarxiv icon