Picture for Dimitris Papailiopoulos

Dimitris Papailiopoulos

Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition

Add code
Oct 08, 2024
Viaarxiv icon

From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data

Add code
Jun 27, 2024
Viaarxiv icon

CHAI: Clustered Head Attention for Efficient LLM Inference

Add code
Mar 12, 2024
Viaarxiv icon

How Well Can Transformers Emulate In-context Newton's Method?

Add code
Mar 05, 2024
Viaarxiv icon

Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks

Add code
Feb 06, 2024
Viaarxiv icon

Looped Transformers are Better at Learning Learning Algorithms

Add code
Nov 21, 2023
Viaarxiv icon

Predictive Pipelined Decoding: A Compute-Latency Trade-off for Exact LLM Decoding

Add code
Jul 12, 2023
Viaarxiv icon

Mini-Batch Optimization of Contrastive Loss

Add code
Jul 12, 2023
Viaarxiv icon

Teaching Arithmetic to Small Transformers

Add code
Jul 07, 2023
Viaarxiv icon

Dissecting Chain-of-Thought: A Study on Compositional In-Context Learning of MLPs

Add code
May 30, 2023
Figure 1 for Dissecting Chain-of-Thought: A Study on Compositional In-Context Learning of MLPs
Figure 2 for Dissecting Chain-of-Thought: A Study on Compositional In-Context Learning of MLPs
Figure 3 for Dissecting Chain-of-Thought: A Study on Compositional In-Context Learning of MLPs
Figure 4 for Dissecting Chain-of-Thought: A Study on Compositional In-Context Learning of MLPs
Viaarxiv icon