Picture for Karthik Narasimhan

Karthik Narasimhan

Can Language Models Solve Olympiad Programming?

Add code
Apr 16, 2024
Viaarxiv icon

RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

Add code
Apr 12, 2024
Viaarxiv icon

Language-Guided World Models: A Model-Based Approach to AI Control

Add code
Jan 24, 2024
Viaarxiv icon

QualEval: Qualitative Evaluation for Model Improvement

Add code
Nov 06, 2023
Viaarxiv icon

Progressively Efficient Learning

Add code
Oct 13, 2023
Figure 1 for Progressively Efficient Learning
Figure 2 for Progressively Efficient Learning
Figure 3 for Progressively Efficient Learning
Figure 4 for Progressively Efficient Learning
Viaarxiv icon

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

Add code
Oct 10, 2023
Viaarxiv icon

FireAct: Toward Language Agent Fine-tuning

Add code
Oct 09, 2023
Viaarxiv icon

Cognitive Architectures for Language Agents

Add code
Sep 05, 2023
Viaarxiv icon

Scaling Laws for Imitation Learning in NetHack

Add code
Jul 18, 2023
Figure 1 for Scaling Laws for Imitation Learning in NetHack
Figure 2 for Scaling Laws for Imitation Learning in NetHack
Figure 3 for Scaling Laws for Imitation Learning in NetHack
Figure 4 for Scaling Laws for Imitation Learning in NetHack
Viaarxiv icon

COLLIE: Systematic Construction of Constrained Text Generation Tasks

Add code
Jul 17, 2023
Figure 1 for COLLIE: Systematic Construction of Constrained Text Generation Tasks
Figure 2 for COLLIE: Systematic Construction of Constrained Text Generation Tasks
Figure 3 for COLLIE: Systematic Construction of Constrained Text Generation Tasks
Figure 4 for COLLIE: Systematic Construction of Constrained Text Generation Tasks
Viaarxiv icon