Picture for Prateek Yadav

Prateek Yadav

Glider: Global and Local Instruction-Driven Expert Router

Add code
Oct 09, 2024
Viaarxiv icon

What Matters for Model Merging at Scale?

Add code
Oct 04, 2024
Viaarxiv icon

A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning

Add code
Aug 13, 2024
Figure 1 for A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning
Viaarxiv icon

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Add code
Jun 26, 2024
Viaarxiv icon

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Add code
Mar 30, 2024
Figure 1 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 2 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 3 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 4 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Viaarxiv icon

ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization

Add code
Nov 22, 2023
Viaarxiv icon

D2 Pruning: Message Passing for Balancing Diversity and Difficulty in Data Pruning

Add code
Oct 11, 2023
Viaarxiv icon

Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy

Add code
Oct 02, 2023
Viaarxiv icon

Exploring Continual Learning for Code Generation Models

Add code
Jul 05, 2023
Viaarxiv icon

Resolving Interference When Merging Models

Add code
Jun 02, 2023
Viaarxiv icon