Picture for Zhenwei Dai

Zhenwei Dai

How Do Latent Reasoning Methods Perform Under Weak and Strong Supervision?

Add code
Feb 25, 2026
Viaarxiv icon

How Far Are LLMs from Professional Poker Players? Revisiting Game-Theoretic Reasoning with Agentic Tool Use

Add code
Jan 31, 2026
Viaarxiv icon

Position: Agentic Evolution is the Path to Evolving LLMs

Add code
Jan 30, 2026
Viaarxiv icon

TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use

Add code
Oct 06, 2025
Figure 1 for TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Figure 2 for TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Figure 3 for TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Figure 4 for TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Viaarxiv icon

Bradley-Terry and Multi-Objective Reward Modeling Are Complementary

Add code
Jul 10, 2025
Viaarxiv icon

Cite Before You Speak: Enhancing Context-Response Grounding in E-commerce Conversational LLM-Agents

Add code
Mar 05, 2025
Viaarxiv icon

How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities

Add code
Feb 26, 2025
Viaarxiv icon

A General Framework to Enhance Fine-tuning-based LLM Unlearning

Add code
Feb 25, 2025
Figure 1 for A General Framework to Enhance Fine-tuning-based LLM Unlearning
Figure 2 for A General Framework to Enhance Fine-tuning-based LLM Unlearning
Figure 3 for A General Framework to Enhance Fine-tuning-based LLM Unlearning
Figure 4 for A General Framework to Enhance Fine-tuning-based LLM Unlearning
Viaarxiv icon

Stepwise Perplexity-Guided Refinement for Efficient Chain-of-Thought Reasoning in Large Language Models

Add code
Feb 18, 2025
Viaarxiv icon

SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains

Add code
Oct 23, 2024
Figure 1 for SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains
Figure 2 for SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains
Figure 3 for SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains
Figure 4 for SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains
Viaarxiv icon