Picture for Sougata Chaudhuri

Sougata Chaudhuri

James

A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More

Add code
Jul 23, 2024
Figure 1 for A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More
Figure 2 for A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More
Figure 3 for A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More
Figure 4 for A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More
Viaarxiv icon

On Lipschitz Continuity and Smoothness of Loss Functions in Learning to Rank

Add code
Sep 13, 2016
Viaarxiv icon

Online Learning to Rank with Top-k Feedback

Add code
Aug 23, 2016
Figure 1 for Online Learning to Rank with Top-k Feedback
Figure 2 for Online Learning to Rank with Top-k Feedback
Figure 3 for Online Learning to Rank with Top-k Feedback
Figure 4 for Online Learning to Rank with Top-k Feedback
Viaarxiv icon

Phased Exploration with Greedy Exploitation in Stochastic Combinatorial Partial Monitoring Games

Add code
Aug 23, 2016
Viaarxiv icon

Perceptron like Algorithms for Online Learning to Rank

Add code
Aug 23, 2016
Figure 1 for Perceptron like Algorithms for Online Learning to Rank
Figure 2 for Perceptron like Algorithms for Online Learning to Rank
Viaarxiv icon

Online Ranking with Top-1 Feedback

Add code
Mar 06, 2016
Figure 1 for Online Ranking with Top-1 Feedback
Figure 2 for Online Ranking with Top-1 Feedback
Figure 3 for Online Ranking with Top-1 Feedback
Figure 4 for Online Ranking with Top-1 Feedback
Viaarxiv icon

Personalized Advertisement Recommendation: A Ranking Approach to Address the Ubiquitous Click Sparsity Problem

Add code
Mar 06, 2016
Figure 1 for Personalized Advertisement Recommendation: A Ranking Approach to Address the Ubiquitous Click Sparsity Problem
Figure 2 for Personalized Advertisement Recommendation: A Ranking Approach to Address the Ubiquitous Click Sparsity Problem
Figure 3 for Personalized Advertisement Recommendation: A Ranking Approach to Address the Ubiquitous Click Sparsity Problem
Figure 4 for Personalized Advertisement Recommendation: A Ranking Approach to Address the Ubiquitous Click Sparsity Problem
Viaarxiv icon

Generalization error bounds for learning to rank: Does the length of document lists matter?

Add code
Mar 06, 2016
Figure 1 for Generalization error bounds for learning to rank: Does the length of document lists matter?
Viaarxiv icon

Online Learning to Rank with Feedback at the Top

Add code
Mar 06, 2016
Figure 1 for Online Learning to Rank with Feedback at the Top
Figure 2 for Online Learning to Rank with Feedback at the Top
Viaarxiv icon

Handling Class Imbalance in Link Prediction using Learning to Rank Techniques

Add code
Feb 22, 2016
Figure 1 for Handling Class Imbalance in Link Prediction using Learning to Rank Techniques
Figure 2 for Handling Class Imbalance in Link Prediction using Learning to Rank Techniques
Figure 3 for Handling Class Imbalance in Link Prediction using Learning to Rank Techniques
Viaarxiv icon