Picture for Jiafeng Guo

Jiafeng Guo

SAW: Stage-Aware Dynamic Weighting for Multi-Objective Reinforcement Learning in Large Language Models

Add code
Jun 05, 2026
Viaarxiv icon

Code-on-Graph: Iterative Programmatic Reasoning via Large Language Models on Knowledge Graphs

Add code
Jun 02, 2026
Viaarxiv icon

Can LLM Rerankers Predict Their Own Ranking Performance?

Add code
Jun 02, 2026
Viaarxiv icon

EGAD: Entropy-Guided Adaptive Distillation for Token-Level Knowledge Transfer

Add code
May 03, 2026
Viaarxiv icon

Detoxification for LLM: From Dataset Itself

Add code
Apr 21, 2026
Viaarxiv icon

AdversarialCoT: Single-Document Retrieval Poisoning for LLM Reasoning

Add code
Apr 14, 2026
Viaarxiv icon

Beyond Relevance: Utility-Centric Retrieval in the LLM Era

Add code
Apr 10, 2026
Viaarxiv icon

Towards Knowledgeable Deep Research: Framework and Benchmark

Add code
Apr 09, 2026
Viaarxiv icon

Data, Not Model: Explaining Bias toward LLM Texts in Neural Retrievers

Add code
Apr 07, 2026
Viaarxiv icon

HighlightBench: Benchmarking Markup-Driven Table Reasoning in Scientific Documents

Add code
Mar 25, 2026
Viaarxiv icon