Picture for Nikhil Sardana

Nikhil Sardana

Sparse Upcycling: Inference Inefficient Finetuning

Add code
Nov 13, 2024
Viaarxiv icon

MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining

Add code
Jan 16, 2024
Viaarxiv icon

Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws

Add code
Dec 31, 2023
Figure 1 for Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Figure 2 for Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Figure 3 for Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Figure 4 for Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Viaarxiv icon

Autonomous Reinforcement Learning: Formalism and Benchmarking

Add code
Dec 17, 2021
Figure 1 for Autonomous Reinforcement Learning: Formalism and Benchmarking
Figure 2 for Autonomous Reinforcement Learning: Formalism and Benchmarking
Figure 3 for Autonomous Reinforcement Learning: Formalism and Benchmarking
Figure 4 for Autonomous Reinforcement Learning: Formalism and Benchmarking
Viaarxiv icon

Bayesian Meta-Learning Through Variational Gaussian Processes

Add code
Oct 21, 2021
Figure 1 for Bayesian Meta-Learning Through Variational Gaussian Processes
Figure 2 for Bayesian Meta-Learning Through Variational Gaussian Processes
Figure 3 for Bayesian Meta-Learning Through Variational Gaussian Processes
Figure 4 for Bayesian Meta-Learning Through Variational Gaussian Processes
Viaarxiv icon