Picture for Vishal Maini

Vishal Maini

Reducing Sentiment Bias in Language Models via Counterfactual Evaluation

Add code
Nov 08, 2019
Figure 1 for Reducing Sentiment Bias in Language Models via Counterfactual Evaluation
Figure 2 for Reducing Sentiment Bias in Language Models via Counterfactual Evaluation
Figure 3 for Reducing Sentiment Bias in Language Models via Counterfactual Evaluation
Figure 4 for Reducing Sentiment Bias in Language Models via Counterfactual Evaluation
Viaarxiv icon

Scalable agent alignment via reward modeling: a research direction

Add code
Nov 19, 2018
Figure 1 for Scalable agent alignment via reward modeling: a research direction
Figure 2 for Scalable agent alignment via reward modeling: a research direction
Figure 3 for Scalable agent alignment via reward modeling: a research direction
Figure 4 for Scalable agent alignment via reward modeling: a research direction
Viaarxiv icon