Picture for Zhewei Yao

Zhewei Yao

Learning to Hint for Reinforcement Learning

Add code
Apr 01, 2026
Viaarxiv icon

Learning to Self-Evolve

Add code
Mar 19, 2026
Viaarxiv icon

DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science

Add code
Feb 27, 2026
Viaarxiv icon

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Add code
Feb 11, 2026
Viaarxiv icon

ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback

Add code
Mar 25, 2025
Viaarxiv icon

CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation

Add code
Dec 19, 2024
Figure 1 for CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation
Figure 2 for CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation
Figure 3 for CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation
Figure 4 for CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation
Viaarxiv icon

Inference Scaling for Bridging Retrieval and Augmented Generation

Add code
Dec 14, 2024
Viaarxiv icon

SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation

Add code
Oct 04, 2024
Viaarxiv icon

STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning

Add code
Sep 10, 2024
Figure 1 for STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
Figure 2 for STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
Figure 3 for STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
Figure 4 for STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
Viaarxiv icon

AI and Memory Wall

Add code
Mar 21, 2024
Figure 1 for AI and Memory Wall
Figure 2 for AI and Memory Wall
Figure 3 for AI and Memory Wall
Figure 4 for AI and Memory Wall
Viaarxiv icon