Picture for Renye Yan

Renye Yan

AdaMemento: Adaptive Memory-Assisted Policy Optimization for Reinforcement Learning

Add code
Oct 06, 2024
Viaarxiv icon

CopyLens: Dynamically Flagging Copyrighted Sub-Dataset Contributions to LLM Outputs

Add code
Oct 06, 2024
Figure 1 for CopyLens: Dynamically Flagging Copyrighted Sub-Dataset Contributions to LLM Outputs
Figure 2 for CopyLens: Dynamically Flagging Copyrighted Sub-Dataset Contributions to LLM Outputs
Figure 3 for CopyLens: Dynamically Flagging Copyrighted Sub-Dataset Contributions to LLM Outputs
Figure 4 for CopyLens: Dynamically Flagging Copyrighted Sub-Dataset Contributions to LLM Outputs
Viaarxiv icon

The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective

Add code
Aug 19, 2024
Viaarxiv icon

Transductive Off-policy Proximal Policy Optimization

Add code
Jun 06, 2024
Viaarxiv icon

Reflective Policy Optimization

Add code
Jun 06, 2024
Viaarxiv icon