Picture for Zhijiang Guo

Zhijiang Guo

When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond?

Add code
Feb 04, 2026
Viaarxiv icon

Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

Merging Beyond: Streaming LLM Updates via Activation-Guided Rotations

Add code
Feb 03, 2026
Viaarxiv icon

ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall

Add code
Oct 09, 2025
Figure 1 for ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
Figure 2 for ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
Figure 3 for ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
Figure 4 for ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
Viaarxiv icon

When Inverse Data Outperforms: Exploring the Pitfalls of Mixed Data in Multi-Stage Fine-Tuning

Add code
Sep 16, 2025
Viaarxiv icon

ClimateViz: A Benchmark for Statistical Reasoning and Fact Verification on Scientific Charts

Add code
Jun 11, 2025
Viaarxiv icon

TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review

Add code
Jun 09, 2025
Figure 1 for TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review
Figure 2 for TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review
Figure 3 for TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review
Figure 4 for TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review
Viaarxiv icon

TreeRPO: Tree Relative Policy Optimization

Add code
Jun 05, 2025
Viaarxiv icon

SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving

Add code
May 29, 2025
Figure 1 for SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
Figure 2 for SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
Figure 3 for SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
Figure 4 for SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
Viaarxiv icon

AVerImaTeC: A Dataset for Automatic Verification of Image-Text Claims with Evidence from the Web

Add code
May 23, 2025
Viaarxiv icon