Picture for Roger Waleffe

Roger Waleffe

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Add code
Aug 21, 2025
Viaarxiv icon

Llama-Nemotron: Efficient Reasoning Models

Add code
May 02, 2025
Viaarxiv icon

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Viaarxiv icon

MLKV: Efficiently Scaling up Large Embedding Model Training with Disk-based Key-Value Storage

Add code
Apr 02, 2025
Viaarxiv icon

Armada: Memory-Efficient Distributed Training of Large-Scale Graph Neural Networks

Add code
Feb 25, 2025
Viaarxiv icon

GraphSnapShot: Graph Machine Learning Acceleration with Fast Storage and Retrieval

Add code
Jun 25, 2024
Figure 1 for GraphSnapShot: Graph Machine Learning Acceleration with Fast Storage and Retrieval
Figure 2 for GraphSnapShot: Graph Machine Learning Acceleration with Fast Storage and Retrieval
Figure 3 for GraphSnapShot: Graph Machine Learning Acceleration with Fast Storage and Retrieval
Figure 4 for GraphSnapShot: Graph Machine Learning Acceleration with Fast Storage and Retrieval
Viaarxiv icon

An Empirical Study of Mamba-based Language Models

Add code
Jun 12, 2024
Figure 1 for An Empirical Study of Mamba-based Language Models
Figure 2 for An Empirical Study of Mamba-based Language Models
Figure 3 for An Empirical Study of Mamba-based Language Models
Figure 4 for An Empirical Study of Mamba-based Language Models
Viaarxiv icon

Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models

Add code
Oct 15, 2023
Figure 1 for Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Figure 2 for Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Figure 3 for Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Figure 4 for Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Viaarxiv icon

Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning

Add code
May 28, 2023
Viaarxiv icon

Marius++: Large-Scale Training of Graph Neural Networks on a Single Machine

Add code
Feb 04, 2022
Figure 1 for Marius++: Large-Scale Training of Graph Neural Networks on a Single Machine
Figure 2 for Marius++: Large-Scale Training of Graph Neural Networks on a Single Machine
Figure 3 for Marius++: Large-Scale Training of Graph Neural Networks on a Single Machine
Figure 4 for Marius++: Large-Scale Training of Graph Neural Networks on a Single Machine
Viaarxiv icon