Picture for Archiki Prasad

Archiki Prasad

Multi-Attribute Steering of Language Models via Targeted Intervention

Add code
Feb 18, 2025
Viaarxiv icon

Learning to Generate Unit Tests for Automated Debugging

Add code
Feb 03, 2025
Viaarxiv icon

Self-Consistency Preference Optimization

Add code
Nov 06, 2024
Figure 1 for Self-Consistency Preference Optimization
Figure 2 for Self-Consistency Preference Optimization
Figure 3 for Self-Consistency Preference Optimization
Figure 4 for Self-Consistency Preference Optimization
Viaarxiv icon

LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits

Add code
Oct 02, 2024
Viaarxiv icon

MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning

Add code
Sep 18, 2024
Viaarxiv icon

AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge

Add code
Sep 11, 2024
Viaarxiv icon

System-1.x: Learning to Balance Fast and Slow Planning with Language Models

Add code
Jul 19, 2024
Viaarxiv icon

Soft Self-Consistency Improves Language Model Agents

Add code
Feb 20, 2024
Viaarxiv icon

ReGAL: Refactoring Programs to Discover Generalizable Abstractions

Add code
Jan 29, 2024
Viaarxiv icon

ADaPT: As-Needed Decomposition and Planning with Language Models

Add code
Nov 08, 2023
Viaarxiv icon