Picture for Makesh Narsimhan Sreedhar

Makesh Narsimhan Sreedhar

Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails

Add code
Jan 15, 2025
Viaarxiv icon

Unsupervised Extraction of Dialogue Policies from Conversations

Add code
Jun 21, 2024
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

HelpSteer2: Open-source dataset for training top-performing reward models

Add code
Jun 12, 2024
Viaarxiv icon

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Add code
Apr 04, 2024
Viaarxiv icon

HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM

Add code
Nov 16, 2023
Figure 1 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 2 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 3 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 4 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Viaarxiv icon

Evolving Domain Adaptation of Pretrained Language Models for Text Classification

Add code
Nov 16, 2023
Viaarxiv icon

SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF

Add code
Oct 09, 2023
Viaarxiv icon

Prompt Learning for Domain Adaptation in Task-Oriented Dialogue

Add code
Nov 10, 2022
Viaarxiv icon

Local Byte Fusion for Neural Machine Translation

Add code
May 23, 2022
Figure 1 for Local Byte Fusion for Neural Machine Translation
Figure 2 for Local Byte Fusion for Neural Machine Translation
Figure 3 for Local Byte Fusion for Neural Machine Translation
Figure 4 for Local Byte Fusion for Neural Machine Translation
Viaarxiv icon