Picture for Shengnan Wang

Shengnan Wang

SIFM: A Foundation Model for Multi-granularity Arctic Sea Ice Forecasting

Add code
Oct 16, 2024
Viaarxiv icon

XL3M: A Training-free Framework for LLM Length Extension Based on Segment-wise Inference

Add code
May 28, 2024
Viaarxiv icon

BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences

Add code
Mar 14, 2024
Viaarxiv icon

Digital twin-assisted three-dimensional electrical capacitance tomography for multiphase flow imaging

Add code
Dec 22, 2023
Viaarxiv icon

Progressively Stacking 2.0: A Multi-stage Layerwise Training Method for BERT Training Speedup

Add code
Nov 27, 2020
Figure 1 for Progressively Stacking 2.0: A Multi-stage Layerwise Training Method for BERT Training Speedup
Figure 2 for Progressively Stacking 2.0: A Multi-stage Layerwise Training Method for BERT Training Speedup
Figure 3 for Progressively Stacking 2.0: A Multi-stage Layerwise Training Method for BERT Training Speedup
Figure 4 for Progressively Stacking 2.0: A Multi-stage Layerwise Training Method for BERT Training Speedup
Viaarxiv icon

CoRe: An Efficient Coarse-refined Training Framework for BERT

Add code
Nov 27, 2020
Figure 1 for CoRe: An Efficient Coarse-refined Training Framework for BERT
Figure 2 for CoRe: An Efficient Coarse-refined Training Framework for BERT
Figure 3 for CoRe: An Efficient Coarse-refined Training Framework for BERT
Figure 4 for CoRe: An Efficient Coarse-refined Training Framework for BERT
Viaarxiv icon