SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks

Add code
Oct 18, 2023
Figure 1 for SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks
Figure 2 for SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks
Figure 3 for SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks
Figure 4 for SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: