Picture for Makesh Narsimhan Sreedhar

Makesh Narsimhan Sreedhar

Unsupervised Extraction of Dialogue Policies from Conversations

Add code
Jun 21, 2024
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

HelpSteer2: Open-source dataset for training top-performing reward models

Add code
Jun 12, 2024
Viaarxiv icon

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Add code
Apr 04, 2024
Viaarxiv icon

HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM

Add code
Nov 16, 2023
Figure 1 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 2 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 3 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 4 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Viaarxiv icon

Evolving Domain Adaptation of Pretrained Language Models for Text Classification

Add code
Nov 16, 2023
Viaarxiv icon

SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF

Add code
Oct 09, 2023
Viaarxiv icon

Prompt Learning for Domain Adaptation in Task-Oriented Dialogue

Add code
Nov 10, 2022
Viaarxiv icon

Local Byte Fusion for Neural Machine Translation

Add code
May 23, 2022
Figure 1 for Local Byte Fusion for Neural Machine Translation
Figure 2 for Local Byte Fusion for Neural Machine Translation
Figure 3 for Local Byte Fusion for Neural Machine Translation
Figure 4 for Local Byte Fusion for Neural Machine Translation
Viaarxiv icon

Learning Improvised Chatbots from Adversarial Modifications of Natural Language Feedback

Add code
Oct 15, 2020
Figure 1 for Learning Improvised Chatbots from Adversarial Modifications of Natural Language Feedback
Figure 2 for Learning Improvised Chatbots from Adversarial Modifications of Natural Language Feedback
Figure 3 for Learning Improvised Chatbots from Adversarial Modifications of Natural Language Feedback
Figure 4 for Learning Improvised Chatbots from Adversarial Modifications of Natural Language Feedback
Viaarxiv icon