Picture for Zhiwei Xu

Zhiwei Xu

Bridging the Capability Gap: Joint Alignment Tuning for Harmonizing LLM-based Multi-Agent Systems

Add code
Sep 11, 2025
Viaarxiv icon

SLIM: Subtrajectory-Level Elimination for More Effective Reasoning

Add code
Aug 27, 2025
Viaarxiv icon

Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs

Add code
Jun 14, 2025
Viaarxiv icon

NGENT: Next-Generation AI Agents Must Integrate Multi-Domain Abilities to Achieve Artificial General Intelligence

Add code
Apr 30, 2025
Viaarxiv icon

QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?

Add code
Apr 17, 2025
Figure 1 for QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
Figure 2 for QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
Figure 3 for QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
Figure 4 for QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
Viaarxiv icon

Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild

Add code
Apr 17, 2025
Viaarxiv icon

Let Me Grok for You: Accelerating Grokking via Embedding Transfer from a Weaker Model

Add code
Apr 17, 2025
Viaarxiv icon

Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition

Add code
Apr 14, 2025
Viaarxiv icon

Large Language Model Guided Self-Debugging Code Generation

Add code
Feb 05, 2025
Viaarxiv icon

Psychometric-Based Evaluation for Theorem Proving with Large Language Models

Add code
Feb 02, 2025
Figure 1 for Psychometric-Based Evaluation for Theorem Proving with Large Language Models
Figure 2 for Psychometric-Based Evaluation for Theorem Proving with Large Language Models
Figure 3 for Psychometric-Based Evaluation for Theorem Proving with Large Language Models
Figure 4 for Psychometric-Based Evaluation for Theorem Proving with Large Language Models
Viaarxiv icon