Picture for Mohammad Saleh

Mohammad Saleh

Building Math Agents with Multi-Turn Iterative Preference Learning

Add code
Sep 04, 2024
Figure 1 for Building Math Agents with Multi-Turn Iterative Preference Learning
Figure 2 for Building Math Agents with Multi-Turn Iterative Preference Learning
Figure 3 for Building Math Agents with Multi-Turn Iterative Preference Learning
Figure 4 for Building Math Agents with Multi-Turn Iterative Preference Learning
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

LiPO: Listwise Preference Optimization through Learning-to-Rank

Add code
Feb 02, 2024
Figure 1 for LiPO: Listwise Preference Optimization through Learning-to-Rank
Figure 2 for LiPO: Listwise Preference Optimization through Learning-to-Rank
Figure 3 for LiPO: Listwise Preference Optimization through Learning-to-Rank
Figure 4 for LiPO: Listwise Preference Optimization through Learning-to-Rank
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Statistical Rejection Sampling Improves Preference Optimization

Add code
Sep 13, 2023
Viaarxiv icon

SLiC-HF: Sequence Likelihood Calibration with Human Feedback

Add code
May 17, 2023
Viaarxiv icon

Improving the Robustness of Summarization Models by Detecting and Removing Input Noise

Add code
Dec 20, 2022
Viaarxiv icon

Calibrating Sequence likelihood Improves Conditional Language Generation

Add code
Sep 30, 2022
Figure 1 for Calibrating Sequence likelihood Improves Conditional Language Generation
Figure 2 for Calibrating Sequence likelihood Improves Conditional Language Generation
Figure 3 for Calibrating Sequence likelihood Improves Conditional Language Generation
Figure 4 for Calibrating Sequence likelihood Improves Conditional Language Generation
Viaarxiv icon

Out-of-Distribution Detection and Selective Generation for Conditional Language Models

Add code
Sep 30, 2022
Figure 1 for Out-of-Distribution Detection and Selective Generation for Conditional Language Models
Figure 2 for Out-of-Distribution Detection and Selective Generation for Conditional Language Models
Figure 3 for Out-of-Distribution Detection and Selective Generation for Conditional Language Models
Figure 4 for Out-of-Distribution Detection and Selective Generation for Conditional Language Models
Viaarxiv icon

WebRED: Effective Pretraining And Finetuning For Relation Extraction On The Web

Add code
Feb 18, 2021
Figure 1 for WebRED: Effective Pretraining And Finetuning For Relation Extraction On The Web
Figure 2 for WebRED: Effective Pretraining And Finetuning For Relation Extraction On The Web
Figure 3 for WebRED: Effective Pretraining And Finetuning For Relation Extraction On The Web
Figure 4 for WebRED: Effective Pretraining And Finetuning For Relation Extraction On The Web
Viaarxiv icon