Picture for Jonathan Pilault

Jonathan Pilault

The Zamba2 Suite: Technical Report

Add code
Nov 22, 2024
Viaarxiv icon

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Add code
Aug 09, 2024
Figure 1 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Figure 2 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Figure 3 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Figure 4 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Viaarxiv icon

Zyda: A 1.3T Dataset for Open Language Modeling

Add code
Jun 04, 2024
Viaarxiv icon

Zamba: A Compact 7B SSM Hybrid Model

Add code
May 26, 2024
Viaarxiv icon

Course Correcting Koopman Representations

Add code
Oct 23, 2023
Viaarxiv icon

On Conditional and Compositional Language Model Differentiable Prompting

Add code
Jul 04, 2023
Viaarxiv icon

Block-State Transformer

Add code
Jun 15, 2023
Viaarxiv icon

JaxPruner: A concise library for sparsity research

Add code
May 02, 2023
Figure 1 for JaxPruner: A concise library for sparsity research
Figure 2 for JaxPruner: A concise library for sparsity research
Figure 3 for JaxPruner: A concise library for sparsity research
Viaarxiv icon

Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction

Add code
Jan 24, 2023
Viaarxiv icon

Using Graph Algorithms to Pretrain Graph Completion Transformers

Add code
Oct 14, 2022
Figure 1 for Using Graph Algorithms to Pretrain Graph Completion Transformers
Figure 2 for Using Graph Algorithms to Pretrain Graph Completion Transformers
Figure 3 for Using Graph Algorithms to Pretrain Graph Completion Transformers
Figure 4 for Using Graph Algorithms to Pretrain Graph Completion Transformers
Viaarxiv icon