Picture for Makesh Narsimhan Sreedhar

Makesh Narsimhan Sreedhar

Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails

Add code
Jan 15, 2025
Figure 1 for Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails
Figure 2 for Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails
Figure 3 for Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails
Figure 4 for Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails
Viaarxiv icon

Unsupervised Extraction of Dialogue Policies from Conversations

Add code
Jun 21, 2024
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

HelpSteer2: Open-source dataset for training top-performing reward models

Add code
Jun 12, 2024
Viaarxiv icon

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Add code
Apr 04, 2024
Viaarxiv icon

HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM

Add code
Nov 16, 2023
Figure 1 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 2 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 3 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 4 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Viaarxiv icon

Evolving Domain Adaptation of Pretrained Language Models for Text Classification

Add code
Nov 16, 2023
Viaarxiv icon

SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF

Add code
Oct 09, 2023
Viaarxiv icon

Prompt Learning for Domain Adaptation in Task-Oriented Dialogue

Add code
Nov 10, 2022
Figure 1 for Prompt Learning for Domain Adaptation in Task-Oriented Dialogue
Figure 2 for Prompt Learning for Domain Adaptation in Task-Oriented Dialogue
Figure 3 for Prompt Learning for Domain Adaptation in Task-Oriented Dialogue
Figure 4 for Prompt Learning for Domain Adaptation in Task-Oriented Dialogue
Viaarxiv icon

Local Byte Fusion for Neural Machine Translation

Add code
May 23, 2022
Figure 1 for Local Byte Fusion for Neural Machine Translation
Figure 2 for Local Byte Fusion for Neural Machine Translation
Figure 3 for Local Byte Fusion for Neural Machine Translation
Figure 4 for Local Byte Fusion for Neural Machine Translation
Viaarxiv icon