Picture for Zirui Liu

Zirui Liu

PaperScout: An Autonomous Agent for Academic Paper Search with Process-Aware Sequence-Level Policy Optimization

Add code
Jan 15, 2026
Viaarxiv icon

EntroCoT: Enhancing Chain-of-Thought via Adaptive Entropy-Guided Segmentation

Add code
Jan 08, 2026
Viaarxiv icon

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Add code
Nov 18, 2025
Viaarxiv icon

Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching

Add code
Nov 18, 2025
Figure 1 for Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching
Figure 2 for Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching
Figure 3 for Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching
Figure 4 for Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching
Viaarxiv icon

MemWeaver: A Hierarchical Memory from Textual Interactive Behaviors for Personalized Generation

Add code
Oct 09, 2025
Figure 1 for MemWeaver: A Hierarchical Memory from Textual Interactive Behaviors for Personalized Generation
Figure 2 for MemWeaver: A Hierarchical Memory from Textual Interactive Behaviors for Personalized Generation
Figure 3 for MemWeaver: A Hierarchical Memory from Textual Interactive Behaviors for Personalized Generation
Figure 4 for MemWeaver: A Hierarchical Memory from Textual Interactive Behaviors for Personalized Generation
Viaarxiv icon

Systematic Evaluation of Optimization Techniques for Long-Context Language Models

Add code
Aug 01, 2025
Viaarxiv icon

Automating Expert-Level Medical Reasoning Evaluation of Large Language Models

Add code
Jul 10, 2025
Viaarxiv icon

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Add code
Jun 11, 2025
Viaarxiv icon

Are LLMs Reliable Translators of Logical Reasoning Across Lexically Diversified Contexts?

Add code
Jun 05, 2025
Viaarxiv icon

SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA

Add code
May 29, 2025
Viaarxiv icon