Picture for Gerald Shen

Gerald Shen

HelpSteer2-Preference: Complementing Ratings with Preferences

Add code
Oct 02, 2024
Viaarxiv icon

Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image Diffusion Models

Add code
Sep 09, 2024
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Viaarxiv icon

HelpSteer2: Open-source dataset for training top-performing reward models

Add code
Jun 12, 2024
Viaarxiv icon

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Add code
May 02, 2024
Viaarxiv icon