Picture for Vithursan Thangarasa

Vithursan Thangarasa

Self-Data Distillation for Recovering Quality in Pruned Large Language Models

Add code
Oct 15, 2024
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Viaarxiv icon

MediSwift: Efficient Sparse Pre-trained Biomedical Language Models

Add code
Mar 01, 2024
Figure 1 for MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Figure 2 for MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Figure 3 for MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Figure 4 for MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Viaarxiv icon

Sparse Iso-FLOP Transformations for Maximizing Training Efficiency

Add code
Mar 25, 2023
Figure 1 for Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Figure 2 for Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Figure 3 for Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Figure 4 for Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Viaarxiv icon

SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models

Add code
Mar 18, 2023
Figure 1 for SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models
Figure 2 for SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models
Figure 3 for SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models
Figure 4 for SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models
Viaarxiv icon

RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network

Add code
Jun 28, 2022
Figure 1 for RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network
Figure 2 for RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network
Figure 3 for RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network
Figure 4 for RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network
Viaarxiv icon

Memory Efficient 3D U-Net with Reversible Mobile Inverted Bottlenecks for Brain Tumor Segmentation

Add code
Apr 21, 2021
Figure 1 for Memory Efficient 3D U-Net with Reversible Mobile Inverted Bottlenecks for Brain Tumor Segmentation
Figure 2 for Memory Efficient 3D U-Net with Reversible Mobile Inverted Bottlenecks for Brain Tumor Segmentation
Figure 3 for Memory Efficient 3D U-Net with Reversible Mobile Inverted Bottlenecks for Brain Tumor Segmentation
Figure 4 for Memory Efficient 3D U-Net with Reversible Mobile Inverted Bottlenecks for Brain Tumor Segmentation
Viaarxiv icon

Enabling Continual Learning with Differentiable Hebbian Plasticity

Add code
Jun 30, 2020
Figure 1 for Enabling Continual Learning with Differentiable Hebbian Plasticity
Figure 2 for Enabling Continual Learning with Differentiable Hebbian Plasticity
Figure 3 for Enabling Continual Learning with Differentiable Hebbian Plasticity
Figure 4 for Enabling Continual Learning with Differentiable Hebbian Plasticity
Viaarxiv icon

Self-Paced Learning with Adaptive Deep Visual Embeddings

Add code
Jul 24, 2018
Figure 1 for Self-Paced Learning with Adaptive Deep Visual Embeddings
Figure 2 for Self-Paced Learning with Adaptive Deep Visual Embeddings
Figure 3 for Self-Paced Learning with Adaptive Deep Visual Embeddings
Figure 4 for Self-Paced Learning with Adaptive Deep Visual Embeddings
Viaarxiv icon