Picture for Rich Harang

Rich Harang

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Dec 23, 2025
Viaarxiv icon

Magnificent Minified Models

Add code
Jun 16, 2023
Figure 1 for Magnificent Minified Models
Figure 2 for Magnificent Minified Models
Figure 3 for Magnificent Minified Models
Figure 4 for Magnificent Minified Models
Viaarxiv icon

Catastrophic Forgetting in the Context of Model Updates

Add code
Jun 16, 2023
Viaarxiv icon