Picture for Abhinav Rastogi

Abhinav Rastogi

Shammie

MoDE: Effective Multi-task Parameter Efficient Fine-Tuning with a Mixture of Dyadic Experts

Add code
Aug 02, 2024
Viaarxiv icon

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Add code
Jun 05, 2024
Figure 1 for Improve Mathematical Reasoning in Language Models by Automated Process Supervision
Figure 2 for Improve Mathematical Reasoning in Language Models by Automated Process Supervision
Figure 3 for Improve Mathematical Reasoning in Language Models by Automated Process Supervision
Figure 4 for Improve Mathematical Reasoning in Language Models by Automated Process Supervision
Viaarxiv icon

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Add code
Mar 15, 2024
Figure 1 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 2 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 3 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 4 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Viaarxiv icon

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Add code
Sep 01, 2023
Figure 1 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 2 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 3 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 4 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Viaarxiv icon

Conversational Recommendation as Retrieval: A Simple, Strong Baseline

Add code
May 23, 2023
Viaarxiv icon

AnyTOD: A Programmable Task-Oriented Dialog System

Add code
Dec 20, 2022
Figure 1 for AnyTOD: A Programmable Task-Oriented Dialog System
Figure 2 for AnyTOD: A Programmable Task-Oriented Dialog System
Figure 3 for AnyTOD: A Programmable Task-Oriented Dialog System
Figure 4 for AnyTOD: A Programmable Task-Oriented Dialog System
Viaarxiv icon

Speech Aware Dialog System Technology Challenge (DSTC11)

Add code
Dec 16, 2022
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue

Add code
Apr 08, 2022
Figure 1 for Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue
Figure 2 for Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue
Figure 3 for Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue
Figure 4 for Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue
Viaarxiv icon

A Unified Approach to Entity-Centric Context Tracking in Social Conversations

Add code
Jan 28, 2022
Figure 1 for A Unified Approach to Entity-Centric Context Tracking in Social Conversations
Figure 2 for A Unified Approach to Entity-Centric Context Tracking in Social Conversations
Figure 3 for A Unified Approach to Entity-Centric Context Tracking in Social Conversations
Figure 4 for A Unified Approach to Entity-Centric Context Tracking in Social Conversations
Viaarxiv icon