Picture for Siddharth Verma

Siddharth Verma

Suppressing Pink Elephants with Direct Principle Feedback

Add code
Feb 13, 2024
Viaarxiv icon

OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models

Add code
May 19, 2023
Viaarxiv icon

Uniform Masking Prevails in Vision-Language Pretraining

Add code
Dec 10, 2022
Viaarxiv icon

CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning

Add code
Apr 18, 2022
Figure 1 for CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
Figure 2 for CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
Figure 3 for CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
Figure 4 for CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
Viaarxiv icon

Continual Learning of Control Primitives: Skill Discovery via Reset-Games

Add code
Nov 10, 2020
Figure 1 for Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Figure 2 for Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Figure 3 for Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Figure 4 for Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Viaarxiv icon

Fast Online "Next Best Offers" using Deep Learning

Add code
May 31, 2019
Figure 1 for Fast Online "Next Best Offers" using Deep Learning
Figure 2 for Fast Online "Next Best Offers" using Deep Learning
Figure 3 for Fast Online "Next Best Offers" using Deep Learning
Figure 4 for Fast Online "Next Best Offers" using Deep Learning
Viaarxiv icon