Picture for Gengrui Zhang

Gengrui Zhang

Learning Compact Representations of LLM Abilities via Item Response Theory

Add code
Oct 01, 2025
Viaarxiv icon

Cache-Aware Reinforcement Learning in Large-Scale Recommender Systems

Add code
Apr 23, 2024
Viaarxiv icon

UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution

Add code
Jan 12, 2024
Viaarxiv icon