Picture for Carolyne Pelletier

Carolyne Pelletier

Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs

Add code
Mar 19, 2025
Viaarxiv icon

Subtle Misogyny Detection and Mitigation: An Expert-Annotated Dataset

Add code
Nov 15, 2023
Viaarxiv icon