Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers

Add code
Jan 01, 2021
Figure 1 for Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers
Figure 2 for Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers
Figure 3 for Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers
Figure 4 for Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: