Picture for Zisu Huang

Zisu Huang

Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation

Add code
Feb 03, 2026
Viaarxiv icon

TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World Scenarios

Add code
Feb 02, 2026
Viaarxiv icon

BatCoder: Self-Supervised Bidirectional Code-Documentation Learning via Back-Translation

Add code
Jan 30, 2026
Viaarxiv icon

Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction

Add code
Jan 08, 2026
Viaarxiv icon

CSSG: Measuring Code Similarity with Semantic Graphs

Add code
Jan 07, 2026
Viaarxiv icon

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Add code
Jan 07, 2026
Viaarxiv icon

Enhancing Model Privacy in Federated Learning with Random Masking and Quantization

Add code
Aug 27, 2025
Viaarxiv icon

Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning

Add code
Jun 04, 2025
Viaarxiv icon

RECAST: Strengthening LLMs' Complex Instruction Following with Constraint-Verifiable Data

Add code
May 25, 2025
Viaarxiv icon

Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement

Add code
Jul 01, 2024
Figure 1 for Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement
Figure 2 for Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement
Figure 3 for Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement
Figure 4 for Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement
Viaarxiv icon