Picture for Shreyas Saxena

Shreyas Saxena

MediSwift: Efficient Sparse Pre-trained Biomedical Language Models

Add code
Mar 01, 2024
Figure 1 for MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Figure 2 for MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Figure 3 for MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Figure 4 for MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Viaarxiv icon

Sparse Iso-FLOP Transformations for Maximizing Training Efficiency

Add code
Mar 25, 2023
Figure 1 for Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Figure 2 for Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Figure 3 for Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Figure 4 for Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Viaarxiv icon

SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models

Add code
Mar 18, 2023
Figure 1 for SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models
Figure 2 for SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models
Figure 3 for SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models
Figure 4 for SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models
Viaarxiv icon

Instance-Level Task Parameters: A Robust Multi-task Weighting Framework

Add code
Jun 11, 2021
Figure 1 for Instance-Level Task Parameters: A Robust Multi-task Weighting Framework
Figure 2 for Instance-Level Task Parameters: A Robust Multi-task Weighting Framework
Figure 3 for Instance-Level Task Parameters: A Robust Multi-task Weighting Framework
Figure 4 for Instance-Level Task Parameters: A Robust Multi-task Weighting Framework
Viaarxiv icon

Training With Data Dependent Dynamic Learning Rates

Add code
May 27, 2021
Figure 1 for Training With Data Dependent Dynamic Learning Rates
Figure 2 for Training With Data Dependent Dynamic Learning Rates
Figure 3 for Training With Data Dependent Dynamic Learning Rates
Figure 4 for Training With Data Dependent Dynamic Learning Rates
Viaarxiv icon

Dynamic curriculum learning via data parameters for noise robust keyword spotting

Add code
Feb 18, 2021
Figure 1 for Dynamic curriculum learning via data parameters for noise robust keyword spotting
Figure 2 for Dynamic curriculum learning via data parameters for noise robust keyword spotting
Figure 3 for Dynamic curriculum learning via data parameters for noise robust keyword spotting
Figure 4 for Dynamic curriculum learning via data parameters for noise robust keyword spotting
Viaarxiv icon

Learning Soft Labels via Meta Learning

Add code
Sep 20, 2020
Figure 1 for Learning Soft Labels via Meta Learning
Figure 2 for Learning Soft Labels via Meta Learning
Figure 3 for Learning Soft Labels via Meta Learning
Figure 4 for Learning Soft Labels via Meta Learning
Viaarxiv icon

Learning Unsupervised Visual Grounding Through Semantic Self-Supervision

Add code
Sep 05, 2018
Figure 1 for Learning Unsupervised Visual Grounding Through Semantic Self-Supervision
Figure 2 for Learning Unsupervised Visual Grounding Through Semantic Self-Supervision
Figure 3 for Learning Unsupervised Visual Grounding Through Semantic Self-Supervision
Figure 4 for Learning Unsupervised Visual Grounding Through Semantic Self-Supervision
Viaarxiv icon

Convolutional Neural Fabrics

Add code
Jan 30, 2017
Figure 1 for Convolutional Neural Fabrics
Figure 2 for Convolutional Neural Fabrics
Figure 3 for Convolutional Neural Fabrics
Figure 4 for Convolutional Neural Fabrics
Viaarxiv icon