Picture for Lewei He

Lewei He

Enhancing Code LLMs with Reinforcement Learning in Code Generation

Add code
Dec 29, 2024
Viaarxiv icon

FALCON: Feedback-driven Adaptive Long/short-term memory reinforced Coding Optimization system

Add code
Oct 28, 2024
Viaarxiv icon

Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles

Add code
Sep 26, 2024
Figure 1 for Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles
Figure 2 for Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles
Figure 3 for Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles
Figure 4 for Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles
Viaarxiv icon