Picture for Caiming Xiong

Caiming Xiong

Salesforce AI Research

Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts

Add code
Jan 23, 2026
Viaarxiv icon

From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models

Add code
Jan 22, 2026
Viaarxiv icon

Agentic Uncertainty Quantification

Add code
Jan 22, 2026
Viaarxiv icon

Agentic Confidence Calibration

Add code
Jan 22, 2026
Viaarxiv icon

MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks

Add code
Jan 21, 2026
Viaarxiv icon

Future Optical Flow Prediction Improves Robot Control & Video Generation

Add code
Jan 15, 2026
Viaarxiv icon

SweRank+: Multilingual, Multi-Turn Code Ranking for Software Issue Localization

Add code
Dec 23, 2025
Viaarxiv icon

Robotic VLA Benefits from Joint Learning with Motion Image Diffusion

Add code
Dec 19, 2025
Viaarxiv icon

LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering

Add code
Nov 17, 2025
Figure 1 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 2 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 3 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 4 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Viaarxiv icon

SSR: Socratic Self-Refine for Large Language Model Reasoning

Add code
Nov 13, 2025
Figure 1 for SSR: Socratic Self-Refine for Large Language Model Reasoning
Figure 2 for SSR: Socratic Self-Refine for Large Language Model Reasoning
Figure 3 for SSR: Socratic Self-Refine for Large Language Model Reasoning
Figure 4 for SSR: Socratic Self-Refine for Large Language Model Reasoning
Viaarxiv icon