Picture for Xiaodan Song

Xiaodan Song

Token Dropping for Efficient BERT Pretraining

Add code
Mar 24, 2022
Figure 1 for Token Dropping for Efficient BERT Pretraining
Figure 2 for Token Dropping for Efficient BERT Pretraining
Figure 3 for Token Dropping for Efficient BERT Pretraining
Figure 4 for Token Dropping for Efficient BERT Pretraining
Viaarxiv icon

Auto-scaling Vision Transformers without Training

Add code
Feb 27, 2022
Figure 1 for Auto-scaling Vision Transformers without Training
Figure 2 for Auto-scaling Vision Transformers without Training
Figure 3 for Auto-scaling Vision Transformers without Training
Figure 4 for Auto-scaling Vision Transformers without Training
Viaarxiv icon

A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation

Add code
Dec 17, 2021
Figure 1 for A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation
Figure 2 for A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation
Figure 3 for A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation
Figure 4 for A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation
Viaarxiv icon

Speeding up Deep Model Training by Sharing Weights and Then Unsharing

Add code
Oct 08, 2021
Figure 1 for Speeding up Deep Model Training by Sharing Weights and Then Unsharing
Figure 2 for Speeding up Deep Model Training by Sharing Weights and Then Unsharing
Figure 3 for Speeding up Deep Model Training by Sharing Weights and Then Unsharing
Figure 4 for Speeding up Deep Model Training by Sharing Weights and Then Unsharing
Viaarxiv icon

Efficient Scale-Permuted Backbone with Learned Resource Distribution

Add code
Oct 22, 2020
Figure 1 for Efficient Scale-Permuted Backbone with Learned Resource Distribution
Figure 2 for Efficient Scale-Permuted Backbone with Learned Resource Distribution
Figure 3 for Efficient Scale-Permuted Backbone with Learned Resource Distribution
Figure 4 for Efficient Scale-Permuted Backbone with Learned Resource Distribution
Viaarxiv icon

Go Wide, Then Narrow: Efficient Training of Deep Thin Networks

Add code
Jul 01, 2020
Figure 1 for Go Wide, Then Narrow: Efficient Training of Deep Thin Networks
Figure 2 for Go Wide, Then Narrow: Efficient Training of Deep Thin Networks
Figure 3 for Go Wide, Then Narrow: Efficient Training of Deep Thin Networks
Figure 4 for Go Wide, Then Narrow: Efficient Training of Deep Thin Networks
Viaarxiv icon

MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices

Add code
Apr 14, 2020
Figure 1 for MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices
Figure 2 for MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices
Figure 3 for MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices
Figure 4 for MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices
Viaarxiv icon

BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models

Add code
Mar 24, 2020
Figure 1 for BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models
Figure 2 for BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models
Figure 3 for BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models
Figure 4 for BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models
Viaarxiv icon

SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization

Add code
Dec 10, 2019
Figure 1 for SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization
Figure 2 for SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization
Figure 3 for SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization
Figure 4 for SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization
Viaarxiv icon

High Resolution Medical Image Analysis with Spatial Partitioning

Add code
Sep 12, 2019
Figure 1 for High Resolution Medical Image Analysis with Spatial Partitioning
Figure 2 for High Resolution Medical Image Analysis with Spatial Partitioning
Figure 3 for High Resolution Medical Image Analysis with Spatial Partitioning
Viaarxiv icon