Picture for Jiandong Gao

Jiandong Gao

Learning the Mechanism of Catastrophic Forgetting: A Perspective from Gradient Similarity

Add code
Jan 29, 2026
Viaarxiv icon

CoScale-RL: Efficient Post-Training by Co-Scaling Data and Computation

Add code
Jan 21, 2026
Viaarxiv icon

Towards Revealing the Effectiveness of Small-Scale Fine-tuning in R1-style Reinforcement Learning

Add code
May 23, 2025
Figure 1 for Towards Revealing the Effectiveness of Small-Scale Fine-tuning in R1-style Reinforcement Learning
Figure 2 for Towards Revealing the Effectiveness of Small-Scale Fine-tuning in R1-style Reinforcement Learning
Figure 3 for Towards Revealing the Effectiveness of Small-Scale Fine-tuning in R1-style Reinforcement Learning
Figure 4 for Towards Revealing the Effectiveness of Small-Scale Fine-tuning in R1-style Reinforcement Learning
Viaarxiv icon

Dynamic feature selection in medical predictive monitoring by reinforcement learning

Add code
May 30, 2024
Viaarxiv icon