Picture for Gordon Euhyun Moon

Gordon Euhyun Moon

SPION: Layer-Wise Sparse Training of Transformer via Convolutional Flood Filling

Add code
Sep 22, 2023
Viaarxiv icon

Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences

Add code
Mar 07, 2022
Figure 1 for Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences
Figure 2 for Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences
Figure 3 for Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences
Figure 4 for Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences
Viaarxiv icon