Picture for Shawn Tan

Shawn Tan

Stick-breaking Attention

Add code
Oct 23, 2024
Viaarxiv icon

Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler

Add code
Aug 23, 2024
Viaarxiv icon

Scattered Mixture-of-Experts Implementation

Add code
Mar 13, 2024
Viaarxiv icon

CattleEyeView: A Multi-task Top-down View Cattle Dataset for Smarter Precision Livestock Farming

Add code
Dec 14, 2023
Viaarxiv icon

Sparse Universal Transformer

Add code
Oct 11, 2023
Viaarxiv icon

ModuleFormer: Learning Modular Large Language Models From Uncurated Data

Add code
Jun 07, 2023
Figure 1 for ModuleFormer: Learning Modular Large Language Models From Uncurated Data
Figure 2 for ModuleFormer: Learning Modular Large Language Models From Uncurated Data
Figure 3 for ModuleFormer: Learning Modular Large Language Models From Uncurated Data
Figure 4 for ModuleFormer: Learning Modular Large Language Models From Uncurated Data
Viaarxiv icon

Recursive Top-Down Production for Sentence Generation with Latent Trees

Add code
Oct 09, 2020
Figure 1 for Recursive Top-Down Production for Sentence Generation with Latent Trees
Figure 2 for Recursive Top-Down Production for Sentence Generation with Latent Trees
Figure 3 for Recursive Top-Down Production for Sentence Generation with Latent Trees
Figure 4 for Recursive Top-Down Production for Sentence Generation with Latent Trees
Viaarxiv icon

Ordered Memory

Add code
Nov 03, 2019
Figure 1 for Ordered Memory
Figure 2 for Ordered Memory
Figure 3 for Ordered Memory
Figure 4 for Ordered Memory
Viaarxiv icon

Icentia11K: An Unsupervised Representation Learning Dataset for Arrhythmia Subtype Discovery

Add code
Oct 21, 2019
Figure 1 for Icentia11K: An Unsupervised Representation Learning Dataset for Arrhythmia Subtype Discovery
Figure 2 for Icentia11K: An Unsupervised Representation Learning Dataset for Arrhythmia Subtype Discovery
Figure 3 for Icentia11K: An Unsupervised Representation Learning Dataset for Arrhythmia Subtype Discovery
Figure 4 for Icentia11K: An Unsupervised Representation Learning Dataset for Arrhythmia Subtype Discovery
Viaarxiv icon

Investigating Biases in Textual Entailment Datasets

Add code
Jun 23, 2019
Figure 1 for Investigating Biases in Textual Entailment Datasets
Figure 2 for Investigating Biases in Textual Entailment Datasets
Figure 3 for Investigating Biases in Textual Entailment Datasets
Figure 4 for Investigating Biases in Textual Entailment Datasets
Viaarxiv icon