Picture for David Wu

David Wu

NVIDIA

What Does the AI Doctor Value? Auditing Pluralism in the Clinical Ethics of Language Models

Add code
May 18, 2026
Viaarxiv icon

Scalable Training of Mixture-of-Experts Models with Megatron Core

Add code
Mar 10, 2026
Viaarxiv icon

MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core

Add code
Apr 21, 2025
Figure 1 for MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core
Figure 2 for MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core
Figure 3 for MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core
Figure 4 for MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core
Viaarxiv icon

Aligning LLMs with Domain Invariant Reward Models

Add code
Jan 01, 2025
Viaarxiv icon

Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning

Add code
Oct 31, 2024
Figure 1 for Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning
Figure 2 for Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning
Figure 3 for Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning
Figure 4 for Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning
Viaarxiv icon

DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction

Add code
Sep 16, 2024
Figure 1 for DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction
Figure 2 for DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction
Figure 3 for DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction
Figure 4 for DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction
Viaarxiv icon

The Virtues of Pessimism in Inverse Reinforcement Learning

Add code
Feb 08, 2024
Figure 1 for The Virtues of Pessimism in Inverse Reinforcement Learning
Figure 2 for The Virtues of Pessimism in Inverse Reinforcement Learning
Figure 3 for The Virtues of Pessimism in Inverse Reinforcement Learning
Figure 4 for The Virtues of Pessimism in Inverse Reinforcement Learning
Viaarxiv icon

Accelerating Inverse Reinforcement Learning with Expert Bootstrapping

Add code
Feb 04, 2024
Viaarxiv icon

The KiTS21 Challenge: Automatic segmentation of kidneys, renal tumors, and renal cysts in corticomedullary-phase CT

Add code
Jul 05, 2023
Figure 1 for The KiTS21 Challenge: Automatic segmentation of kidneys, renal tumors, and renal cysts in corticomedullary-phase CT
Figure 2 for The KiTS21 Challenge: Automatic segmentation of kidneys, renal tumors, and renal cysts in corticomedullary-phase CT
Figure 3 for The KiTS21 Challenge: Automatic segmentation of kidneys, renal tumors, and renal cysts in corticomedullary-phase CT
Figure 4 for The KiTS21 Challenge: Automatic segmentation of kidneys, renal tumors, and renal cysts in corticomedullary-phase CT
Viaarxiv icon

CryptOpt: Automatic Optimization of Straightline Code

Add code
May 31, 2023
Figure 1 for CryptOpt: Automatic Optimization of Straightline Code
Figure 2 for CryptOpt: Automatic Optimization of Straightline Code
Figure 3 for CryptOpt: Automatic Optimization of Straightline Code
Figure 4 for CryptOpt: Automatic Optimization of Straightline Code
Viaarxiv icon