Picture for Samrat Phatale

Samrat Phatale

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Add code
Jun 05, 2024
Viaarxiv icon

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Add code
Mar 15, 2024
Viaarxiv icon

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Add code
Sep 01, 2023
Figure 1 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 2 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 3 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 4 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Viaarxiv icon

Conversational Recommendation as Retrieval: A Simple, Strong Baseline

Add code
May 23, 2023
Viaarxiv icon

Prose for a Painting

Add code
Oct 08, 2019
Figure 1 for Prose for a Painting
Figure 2 for Prose for a Painting
Figure 3 for Prose for a Painting
Figure 4 for Prose for a Painting
Viaarxiv icon