Picture for Baibei Ji

Baibei Ji

MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

Add code
Jan 24, 2026
Viaarxiv icon

$\texttt{MemoryRewardBench}$: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

Add code
Jan 17, 2026
Viaarxiv icon

LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling

Add code
Oct 08, 2025
Viaarxiv icon

L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding?

Add code
Oct 03, 2024
Figure 1 for L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding?
Figure 2 for L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding?
Figure 3 for L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding?
Figure 4 for L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding?
Viaarxiv icon