Picture for Vikramjeet Das

Vikramjeet Das

Sample Efficient Reinforcement Learning from Human Feedback via Active Exploration

Add code
Dec 01, 2023
Viaarxiv icon

Kernelized Offline Contextual Dueling Bandits

Add code
Jul 21, 2023
Viaarxiv icon