Picture for Zhixun Li

Zhixun Li

Exploring Reasoning Reward Model for Agents

Add code
Jan 29, 2026
Viaarxiv icon

Rethinking Sample Polarity in Reinforcement Learning with Verifiable Rewards

Add code
Dec 25, 2025
Viaarxiv icon

AdaTooler-V: Adaptive Tool-Use for Images and Videos

Add code
Dec 19, 2025
Figure 1 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 2 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 3 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 4 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Viaarxiv icon

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Add code
Oct 21, 2025
Figure 1 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 2 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 3 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 4 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Viaarxiv icon

EraRAG: Efficient and Incremental Retrieval Augmented Generation for Growing Corpora

Add code
Jun 26, 2025
Viaarxiv icon

Can LLMs Alleviate Catastrophic Forgetting in Graph Continual Learning? A Systematic Study

Add code
May 24, 2025
Viaarxiv icon

Materials Generation in the Era of Artificial Intelligence: A Comprehensive Survey

Add code
May 22, 2025
Figure 1 for Materials Generation in the Era of Artificial Intelligence: A Comprehensive Survey
Figure 2 for Materials Generation in the Era of Artificial Intelligence: A Comprehensive Survey
Figure 3 for Materials Generation in the Era of Artificial Intelligence: A Comprehensive Survey
Figure 4 for Materials Generation in the Era of Artificial Intelligence: A Comprehensive Survey
Viaarxiv icon

IceBerg: Debiased Self-Training for Class-Imbalanced Node Classification

Add code
Feb 10, 2025
Figure 1 for IceBerg: Debiased Self-Training for Class-Imbalanced Node Classification
Figure 2 for IceBerg: Debiased Self-Training for Class-Imbalanced Node Classification
Figure 3 for IceBerg: Debiased Self-Training for Class-Imbalanced Node Classification
Figure 4 for IceBerg: Debiased Self-Training for Class-Imbalanced Node Classification
Viaarxiv icon

GDeR: Safeguarding Efficiency, Balancing, and Robustness via Prototypical Graph Pruning

Add code
Oct 17, 2024
Figure 1 for GDeR: Safeguarding Efficiency, Balancing, and Robustness via Prototypical Graph Pruning
Figure 2 for GDeR: Safeguarding Efficiency, Balancing, and Robustness via Prototypical Graph Pruning
Figure 3 for GDeR: Safeguarding Efficiency, Balancing, and Robustness via Prototypical Graph Pruning
Figure 4 for GDeR: Safeguarding Efficiency, Balancing, and Robustness via Prototypical Graph Pruning
Viaarxiv icon

Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems

Add code
Oct 03, 2024
Figure 1 for Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems
Figure 2 for Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems
Figure 3 for Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems
Figure 4 for Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems
Viaarxiv icon