Picture for Pradeep Dasigi

Pradeep Dasigi

2 OLMo 2 Furious

Add code
Dec 31, 2024
Figure 1 for 2 OLMo 2 Furious
Figure 2 for 2 OLMo 2 Furious
Figure 3 for 2 OLMo 2 Furious
Figure 4 for 2 OLMo 2 Furious
Viaarxiv icon

HREF: Human Response-Guided Evaluation of Instruction Following in Language Models

Add code
Dec 20, 2024
Viaarxiv icon

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Add code
Nov 22, 2024
Figure 1 for TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Figure 2 for TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Figure 3 for TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Figure 4 for TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Viaarxiv icon

Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback

Add code
Oct 24, 2024
Figure 1 for Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
Figure 2 for Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
Figure 3 for Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
Figure 4 for Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
Viaarxiv icon

Scalable Data Ablation Approximations for Language Models through Modular Training and Merging

Add code
Oct 21, 2024
Figure 1 for Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
Figure 2 for Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
Figure 3 for Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
Figure 4 for Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
Viaarxiv icon

Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging

Add code
Oct 16, 2024
Figure 1 for Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
Figure 2 for Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
Figure 3 for Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
Figure 4 for Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
Viaarxiv icon

OLMo: Accelerating the Science of Language Models

Add code
Feb 07, 2024
Figure 1 for OLMo: Accelerating the Science of Language Models
Figure 2 for OLMo: Accelerating the Science of Language Models
Figure 3 for OLMo: Accelerating the Science of Language Models
Figure 4 for OLMo: Accelerating the Science of Language Models
Viaarxiv icon

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Add code
Nov 20, 2023
Viaarxiv icon

Evaluating In-Context Learning of Libraries for Code Generation

Add code
Nov 16, 2023
Viaarxiv icon

TRAM: Bridging Trust Regions and Sharpness Aware Minimization

Add code
Oct 05, 2023
Figure 1 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Figure 2 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Figure 3 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Figure 4 for TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Viaarxiv icon