Picture for Aditya Krishna Menon

Aditya Krishna Menon

Data61/CSIRO and the Australian National University

A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs

Add code
Oct 24, 2024
Figure 1 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 2 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 3 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 4 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Viaarxiv icon

Efficient Document Ranking with Learnable Late Interactions

Add code
Jun 25, 2024
Figure 1 for Efficient Document Ranking with Learnable Late Interactions
Figure 2 for Efficient Document Ranking with Learnable Late Interactions
Figure 3 for Efficient Document Ranking with Learnable Late Interactions
Figure 4 for Efficient Document Ranking with Learnable Late Interactions
Viaarxiv icon

Faster Cascades via Speculative Decoding

Add code
May 29, 2024
Viaarxiv icon

Cascade-Aware Training of Language Models

Add code
May 29, 2024
Viaarxiv icon

Language Model Cascades: Token-level uncertainty and beyond

Add code
Apr 15, 2024
Viaarxiv icon

Metric-aware LLM inference

Add code
Mar 07, 2024
Viaarxiv icon

DistillSpec: Improving Speculative Decoding via Knowledge Distillation

Add code
Oct 12, 2023
Viaarxiv icon

What do larger image classifiers memorise?

Add code
Oct 09, 2023
Viaarxiv icon

Think before you speak: Training Language Models With Pause Tokens

Add code
Oct 03, 2023
Viaarxiv icon

The importance of feature preprocessing for differentially private linear optimization

Add code
Jul 19, 2023
Figure 1 for The importance of feature preprocessing for differentially private linear optimization
Figure 2 for The importance of feature preprocessing for differentially private linear optimization
Viaarxiv icon