Picture for Pradeep Dasigi

Pradeep Dasigi

Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback

Add code
Oct 24, 2024
Figure 1 for Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
Figure 2 for Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
Figure 3 for Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
Figure 4 for Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
Viaarxiv icon

Scalable Data Ablation Approximations for Language Models through Modular Training and Merging

Add code
Oct 21, 2024
Figure 1 for Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
Figure 2 for Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
Figure 3 for Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
Figure 4 for Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
Viaarxiv icon

Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging

Add code
Oct 16, 2024
Viaarxiv icon

OLMo: Accelerating the Science of Language Models

Add code
Feb 07, 2024
Figure 1 for OLMo: Accelerating the Science of Language Models
Figure 2 for OLMo: Accelerating the Science of Language Models
Figure 3 for OLMo: Accelerating the Science of Language Models
Figure 4 for OLMo: Accelerating the Science of Language Models
Viaarxiv icon

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Add code
Nov 20, 2023
Viaarxiv icon

Evaluating In-Context Learning of Libraries for Code Generation

Add code
Nov 16, 2023
Viaarxiv icon

TRAM: Bridging Trust Regions and Sharpness Aware Minimization

Add code
Oct 05, 2023
Figure 1 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Figure 2 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Figure 3 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Figure 4 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Viaarxiv icon

How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources

Add code
Jun 07, 2023
Viaarxiv icon

Inference-time Re-ranker Relevance Feedback for Neural Information Retrieval

Add code
May 19, 2023
Viaarxiv icon

LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization

Add code
Jan 30, 2023
Viaarxiv icon