Picture for Dimitris Papailiopoulos

Dimitris Papailiopoulos

Task Vectors in In-Context Learning: Emergence, Formation, and Benefit

Add code
Jan 16, 2025
Viaarxiv icon

Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries

Add code
Dec 12, 2024
Viaarxiv icon

Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition

Add code
Oct 08, 2024
Figure 1 for Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition
Figure 2 for Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition
Figure 3 for Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition
Figure 4 for Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition
Viaarxiv icon

From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data

Add code
Jun 27, 2024
Figure 1 for From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data
Figure 2 for From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data
Figure 3 for From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data
Figure 4 for From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data
Viaarxiv icon

CHAI: Clustered Head Attention for Efficient LLM Inference

Add code
Mar 12, 2024
Viaarxiv icon

How Well Can Transformers Emulate In-context Newton's Method?

Add code
Mar 05, 2024
Viaarxiv icon

Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks

Add code
Feb 06, 2024
Viaarxiv icon

Looped Transformers are Better at Learning Learning Algorithms

Add code
Nov 21, 2023
Viaarxiv icon

Mini-Batch Optimization of Contrastive Loss

Add code
Jul 12, 2023
Viaarxiv icon

Predictive Pipelined Decoding: A Compute-Latency Trade-off for Exact LLM Decoding

Add code
Jul 12, 2023
Viaarxiv icon