Picture for Xianfeng Tang

Xianfeng Tang

How Far Are LLMs from Professional Poker Players? Revisiting Game-Theoretic Reasoning with Agentic Tool Use

Add code
Jan 31, 2026
Viaarxiv icon

Position: Agentic Evolution is the Path to Evolving LLMs

Add code
Jan 30, 2026
Viaarxiv icon

Trajectory2Task: Training Robust Tool-Calling Agents with Synthesized Yet Verifiable Data for Complex User Intents

Add code
Jan 28, 2026
Viaarxiv icon

Agentic Reasoning for Large Language Models

Add code
Jan 18, 2026
Viaarxiv icon

All You Need Are Random Visual Tokens? Demystifying Token Pruning in VLLMs

Add code
Dec 08, 2025
Viaarxiv icon

MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs

Add code
Oct 29, 2025
Viaarxiv icon

TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use

Add code
Oct 06, 2025
Figure 1 for TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Figure 2 for TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Figure 3 for TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Figure 4 for TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Viaarxiv icon

Can Large Language Models Adequately Perform Symbolic Reasoning Over Time Series?

Add code
Aug 05, 2025
Viaarxiv icon

Bradley-Terry and Multi-Objective Reward Modeling Are Complementary

Add code
Jul 10, 2025
Viaarxiv icon

RRO: LLM Agent Optimization Through Rising Reward Trajectories

Add code
May 27, 2025
Figure 1 for RRO: LLM Agent Optimization Through Rising Reward Trajectories
Figure 2 for RRO: LLM Agent Optimization Through Rising Reward Trajectories
Figure 3 for RRO: LLM Agent Optimization Through Rising Reward Trajectories
Figure 4 for RRO: LLM Agent Optimization Through Rising Reward Trajectories
Viaarxiv icon