Picture for Dimitris Papailiopoulos

Dimitris Papailiopoulos

Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries

Add code
Dec 12, 2024
Viaarxiv icon

Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition

Add code
Oct 08, 2024
Figure 1 for Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition
Figure 2 for Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition
Figure 3 for Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition
Figure 4 for Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition
Viaarxiv icon

From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data

Add code
Jun 27, 2024
Viaarxiv icon

CHAI: Clustered Head Attention for Efficient LLM Inference

Add code
Mar 12, 2024
Viaarxiv icon

How Well Can Transformers Emulate In-context Newton's Method?

Add code
Mar 05, 2024
Viaarxiv icon

Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks

Add code
Feb 06, 2024
Viaarxiv icon

Looped Transformers are Better at Learning Learning Algorithms

Add code
Nov 21, 2023
Viaarxiv icon

Predictive Pipelined Decoding: A Compute-Latency Trade-off for Exact LLM Decoding

Add code
Jul 12, 2023
Viaarxiv icon

Mini-Batch Optimization of Contrastive Loss

Add code
Jul 12, 2023
Viaarxiv icon

Teaching Arithmetic to Small Transformers

Add code
Jul 07, 2023
Viaarxiv icon