A Mixture of $h-1$ Heads is Better than $h$ Heads

Add code
May 13, 2020
Figure 1 for A Mixture of $h-1$ Heads is Better than $h$ Heads
Figure 2 for A Mixture of $h-1$ Heads is Better than $h$ Heads
Figure 3 for A Mixture of $h-1$ Heads is Better than $h$ Heads
Figure 4 for A Mixture of $h-1$ Heads is Better than $h$ Heads

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: