Picture for Gurpreet Gosal

Gurpreet Gosal

Charles

Straight to Zero: Why Linearly Decaying the Learning Rate to Zero Works Best for LLMs

Add code
Feb 21, 2025
Viaarxiv icon

Bilingual Adaptation of Monolingual Foundation Models

Add code
Jul 13, 2024
Figure 1 for Bilingual Adaptation of Monolingual Foundation Models
Figure 2 for Bilingual Adaptation of Monolingual Foundation Models
Figure 3 for Bilingual Adaptation of Monolingual Foundation Models
Figure 4 for Bilingual Adaptation of Monolingual Foundation Models
Viaarxiv icon

Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches

Add code
Apr 23, 2024
Figure 1 for Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches
Figure 2 for Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches
Figure 3 for Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches
Figure 4 for Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches
Viaarxiv icon

Improving Resnet-9 Generalization Trained on Small Datasets

Add code
Sep 07, 2023
Viaarxiv icon

Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster

Add code
Apr 06, 2023
Viaarxiv icon