Picture for Aran Komatsuzaki

Aran Komatsuzaki

ARB: Advanced Reasoning Benchmark for Large Language Models

Add code
Jul 28, 2023
Viaarxiv icon

Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints

Add code
Dec 09, 2022
Figure 1 for Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints
Figure 2 for Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints
Figure 3 for Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints
Figure 4 for Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints
Viaarxiv icon

LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs

Add code
Nov 03, 2021
Figure 1 for LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs
Figure 2 for LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs
Figure 3 for LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs
Figure 4 for LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs
Viaarxiv icon

Current Limitations of Language Models: What You Need is Retrieval

Add code
Sep 15, 2020
Figure 1 for Current Limitations of Language Models: What You Need is Retrieval
Figure 2 for Current Limitations of Language Models: What You Need is Retrieval
Figure 3 for Current Limitations of Language Models: What You Need is Retrieval
Figure 4 for Current Limitations of Language Models: What You Need is Retrieval
Viaarxiv icon

One Epoch Is All You Need

Add code
Jun 16, 2019
Figure 1 for One Epoch Is All You Need
Figure 2 for One Epoch Is All You Need
Figure 3 for One Epoch Is All You Need
Figure 4 for One Epoch Is All You Need
Viaarxiv icon

Extractive Summary as Discrete Latent Variables

Add code
Nov 14, 2018
Figure 1 for Extractive Summary as Discrete Latent Variables
Figure 2 for Extractive Summary as Discrete Latent Variables
Figure 3 for Extractive Summary as Discrete Latent Variables
Figure 4 for Extractive Summary as Discrete Latent Variables
Viaarxiv icon