Picture for Runzhe Yang

Runzhe Yang

LLMs are Superior Feedback Providers: Bootstrapping Reasoning for Lie Detection with Self-Generated Feedback

Add code
Aug 25, 2024
Viaarxiv icon

COLLIE: Systematic Construction of Constrained Text Generation Tasks

Add code
Jul 17, 2023
Viaarxiv icon

DataMUX: Data Multiplexing for Neural Networks

Add code
Feb 18, 2022
Figure 1 for DataMUX: Data Multiplexing for Neural Networks
Figure 2 for DataMUX: Data Multiplexing for Neural Networks
Figure 3 for DataMUX: Data Multiplexing for Neural Networks
Figure 4 for DataMUX: Data Multiplexing for Neural Networks
Viaarxiv icon

Generating Strategic Dialogue for Negotiation with Theory of Mind

Add code
Oct 20, 2020
Figure 1 for Generating Strategic Dialogue for Negotiation with Theory of Mind
Figure 2 for Generating Strategic Dialogue for Negotiation with Theory of Mind
Figure 3 for Generating Strategic Dialogue for Negotiation with Theory of Mind
Figure 4 for Generating Strategic Dialogue for Negotiation with Theory of Mind
Viaarxiv icon

A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation

Add code
Aug 21, 2019
Figure 1 for A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation
Figure 2 for A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation
Figure 3 for A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation
Figure 4 for A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation
Viaarxiv icon

Imitation Refinement

Add code
May 07, 2018
Figure 1 for Imitation Refinement
Figure 2 for Imitation Refinement
Figure 3 for Imitation Refinement
Figure 4 for Imitation Refinement
Viaarxiv icon