Picture for Xiaoxi Li

Xiaoxi Li

ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment

Add code
Mar 24, 2026
Viaarxiv icon

Deep Autocorrelation Modeling for Time-Series Forecasting: Progress and Prospects

Add code
Mar 20, 2026
Viaarxiv icon

CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks

Add code
Mar 19, 2026
Viaarxiv icon

OmniGAIA: Towards Native Omni-Modal AI Agents

Add code
Feb 26, 2026
Viaarxiv icon

Deep Time-series Forecasting Needs Kernelized Moment Balancing

Add code
Jan 31, 2026
Viaarxiv icon

TourPlanner: A Competitive Consensus Framework with Constraint-Gated Reinforcement Learning for Travel Planning

Add code
Jan 08, 2026
Viaarxiv icon

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Add code
Oct 24, 2025
Viaarxiv icon

Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search

Add code
Jul 03, 2025
Viaarxiv icon

Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation

Add code
Jun 26, 2025
Figure 1 for Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation
Figure 2 for Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation
Figure 3 for Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation
Viaarxiv icon

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon