Picture for Beren Millidge

Beren Millidge

University of Oxford

Mixture-of-PageRanks: Replacing Long-Context with Real-Time, Sparse GraphRAG

Add code
Dec 08, 2024
Viaarxiv icon

The Zamba2 Suite: Technical Report

Add code
Nov 22, 2024
Viaarxiv icon

Zyda-2: a 5 Trillion Token High-Quality Dataset

Add code
Nov 09, 2024
Figure 1 for Zyda-2: a 5 Trillion Token High-Quality Dataset
Figure 2 for Zyda-2: a 5 Trillion Token High-Quality Dataset
Figure 3 for Zyda-2: a 5 Trillion Token High-Quality Dataset
Figure 4 for Zyda-2: a 5 Trillion Token High-Quality Dataset
Viaarxiv icon

Exploring Action-Centric Representations Through the Lens of Rate-Distortion Theory

Add code
Sep 13, 2024
Viaarxiv icon

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Add code
Aug 09, 2024
Figure 1 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Figure 2 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Figure 3 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Figure 4 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Viaarxiv icon

Zyda: A 1.3T Dataset for Open Language Modeling

Add code
Jun 04, 2024
Viaarxiv icon

Toward Conversational Agents with Context and Time Sensitive Long-term Memory

Add code
May 29, 2024
Viaarxiv icon

Zamba: A Compact 7B SSM Hybrid Model

Add code
May 26, 2024
Viaarxiv icon

Associative Memories in the Feature Space

Add code
Feb 16, 2024
Figure 1 for Associative Memories in the Feature Space
Figure 2 for Associative Memories in the Feature Space
Figure 3 for Associative Memories in the Feature Space
Figure 4 for Associative Memories in the Feature Space
Viaarxiv icon

BlackMamba: Mixture of Experts for State-Space Models

Add code
Feb 01, 2024
Viaarxiv icon