Picture for Ankit Singh Rawat

Ankit Singh Rawat

A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs

Add code
Oct 24, 2024
Figure 1 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 2 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 3 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 4 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Viaarxiv icon

A Statistical Framework for Data-dependent Retrieval-Augmented Models

Add code
Aug 27, 2024
Viaarxiv icon

Analysis of Plan-based Retrieval for Grounded Text Generation

Add code
Aug 20, 2024
Viaarxiv icon

Fine-grained Analysis of In-context Linear Estimation: Data, Architecture, and Beyond

Add code
Jul 13, 2024
Viaarxiv icon

Efficient Document Ranking with Learnable Late Interactions

Add code
Jun 25, 2024
Figure 1 for Efficient Document Ranking with Learnable Late Interactions
Figure 2 for Efficient Document Ranking with Learnable Late Interactions
Figure 3 for Efficient Document Ranking with Learnable Late Interactions
Figure 4 for Efficient Document Ranking with Learnable Late Interactions
Viaarxiv icon

Faster Cascades via Speculative Decoding

Add code
May 29, 2024
Viaarxiv icon

Cascade-Aware Training of Language Models

Add code
May 29, 2024
Viaarxiv icon

Language Model Cascades: Token-level uncertainty and beyond

Add code
Apr 15, 2024
Viaarxiv icon

Mechanics of Next Token Prediction with Self-Attention

Add code
Mar 12, 2024
Figure 1 for Mechanics of Next Token Prediction with Self-Attention
Figure 2 for Mechanics of Next Token Prediction with Self-Attention
Figure 3 for Mechanics of Next Token Prediction with Self-Attention
Figure 4 for Mechanics of Next Token Prediction with Self-Attention
Viaarxiv icon

From Self-Attention to Markov Models: Unveiling the Dynamics of Generative Transformers

Add code
Feb 21, 2024
Figure 1 for From Self-Attention to Markov Models: Unveiling the Dynamics of Generative Transformers
Figure 2 for From Self-Attention to Markov Models: Unveiling the Dynamics of Generative Transformers
Figure 3 for From Self-Attention to Markov Models: Unveiling the Dynamics of Generative Transformers
Figure 4 for From Self-Attention to Markov Models: Unveiling the Dynamics of Generative Transformers
Viaarxiv icon