Picture for Ran Guo

Ran Guo

UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning

Add code
Aug 26, 2025
Figure 1 for UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning
Figure 2 for UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning
Figure 3 for UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning
Figure 4 for UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning
Viaarxiv icon

Ultra-Sparse Memory Network

Add code
Nov 19, 2024
Viaarxiv icon

OneFlow: Redesign the Distributed Deep Learning Framework from Scratch

Add code
Oct 29, 2021
Figure 1 for OneFlow: Redesign the Distributed Deep Learning Framework from Scratch
Figure 2 for OneFlow: Redesign the Distributed Deep Learning Framework from Scratch
Figure 3 for OneFlow: Redesign the Distributed Deep Learning Framework from Scratch
Figure 4 for OneFlow: Redesign the Distributed Deep Learning Framework from Scratch
Viaarxiv icon