Picture for Chris Cremer

Chris Cremer

Averaging log-likelihoods in direct alignment

Add code
Jun 27, 2024
Viaarxiv icon

Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion

Add code
Jun 27, 2024
Viaarxiv icon

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Add code
Feb 26, 2024
Viaarxiv icon

Inference Suboptimality in Variational Autoencoders

Add code
May 27, 2018
Figure 1 for Inference Suboptimality in Variational Autoencoders
Figure 2 for Inference Suboptimality in Variational Autoencoders
Figure 3 for Inference Suboptimality in Variational Autoencoders
Figure 4 for Inference Suboptimality in Variational Autoencoders
Viaarxiv icon

Reinterpreting Importance-Weighted Autoencoders

Add code
Aug 15, 2017
Figure 1 for Reinterpreting Importance-Weighted Autoencoders
Figure 2 for Reinterpreting Importance-Weighted Autoencoders
Figure 3 for Reinterpreting Importance-Weighted Autoencoders
Viaarxiv icon