Picture for Huizi Mao

Huizi Mao

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Dec 23, 2025
Viaarxiv icon

BLASST: Dynamic BLocked Attention Sparsity via Softmax Thresholding

Add code
Dec 12, 2025
Viaarxiv icon

VILA: On Pre-training for Visual Language Models

Add code
Dec 14, 2023
Figure 1 for VILA: On Pre-training for Visual Language Models
Figure 2 for VILA: On Pre-training for Visual Language Models
Figure 3 for VILA: On Pre-training for Visual Language Models
Figure 4 for VILA: On Pre-training for Visual Language Models
Viaarxiv icon

BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Add code
May 26, 2022
Figure 1 for BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Figure 2 for BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Figure 3 for BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Figure 4 for BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Viaarxiv icon

PatchNet -- Short-range Template Matching for Efficient Video Processing

Add code
Mar 10, 2021
Figure 1 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Figure 2 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Figure 3 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Figure 4 for PatchNet -- Short-range Template Matching for Efficient Video Processing
Viaarxiv icon

A Delay Metric for Video Object Detection: What Average Precision Fails to Tell

Add code
Aug 18, 2019
Figure 1 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Figure 2 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Figure 3 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Figure 4 for A Delay Metric for Video Object Detection: What Average Precision Fails to Tell
Viaarxiv icon

CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video

Add code
Sep 30, 2018
Figure 1 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Figure 2 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Figure 3 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Figure 4 for CaTDet: Cascaded Tracked Detector for Efficient Object Detection from Video
Viaarxiv icon

Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training

Add code
Feb 05, 2018
Figure 1 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Figure 2 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Figure 3 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Figure 4 for Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Viaarxiv icon

Exploring the Regularity of Sparse Structure in Convolutional Neural Networks

Add code
Jun 05, 2017
Figure 1 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Figure 2 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Figure 3 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Figure 4 for Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Viaarxiv icon