Picture for Pranjal Aggarwal

Pranjal Aggarwal

RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

Add code
Apr 12, 2024
Viaarxiv icon

GEO: Generative Engine Optimization

Add code
Nov 16, 2023
Viaarxiv icon

AutoMix: Automatically Mixing Language Models

Add code
Oct 19, 2023
Viaarxiv icon

Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs

Add code
May 19, 2023
Viaarxiv icon

SemSup-XC: Semantic Supervision for Zero and Few-shot Extreme Classification

Add code
Jan 26, 2023
Viaarxiv icon