Picture for Karthik Narasimhan

Karthik Narasimhan

Princeton University

LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks

Add code
Oct 16, 2024
Figure 1 for LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
Figure 2 for LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
Figure 3 for LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
Figure 4 for LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
Viaarxiv icon

An Annotated Dataset of Errors in Premodern Greek and Baselines for Detecting Them

Add code
Oct 14, 2024
Viaarxiv icon

EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges

Add code
Sep 24, 2024
Figure 1 for EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges
Figure 2 for EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges
Figure 3 for EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges
Figure 4 for EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges
Viaarxiv icon

LLMs are Superior Feedback Providers: Bootstrapping Reasoning for Lie Detection with Self-Generated Feedback

Add code
Aug 25, 2024
Viaarxiv icon

ShieldGemma: Generative AI Content Moderation Based on Gemma

Add code
Jul 31, 2024
Figure 1 for ShieldGemma: Generative AI Content Moderation Based on Gemma
Figure 2 for ShieldGemma: Generative AI Content Moderation Based on Gemma
Figure 3 for ShieldGemma: Generative AI Content Moderation Based on Gemma
Figure 4 for ShieldGemma: Generative AI Content Moderation Based on Gemma
Viaarxiv icon

PersonaGym: Evaluating Persona Agents and LLMs

Add code
Jul 29, 2024
Viaarxiv icon

$τ$-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains

Add code
Jun 17, 2024
Viaarxiv icon

Can Language Models Solve Olympiad Programming?

Add code
Apr 16, 2024
Viaarxiv icon

RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

Add code
Apr 12, 2024
Viaarxiv icon

Language-Guided World Models: A Model-Based Approach to AI Control

Add code
Jan 24, 2024
Figure 1 for Language-Guided World Models: A Model-Based Approach to AI Control
Figure 2 for Language-Guided World Models: A Model-Based Approach to AI Control
Figure 3 for Language-Guided World Models: A Model-Based Approach to AI Control
Figure 4 for Language-Guided World Models: A Model-Based Approach to AI Control
Viaarxiv icon