Picture for Peter J. Liu

Peter J. Liu

Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

Add code
Aug 14, 2024
Figure 1 for Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Figure 2 for Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Figure 3 for Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Figure 4 for Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Viaarxiv icon

Scaling Exponents Across Parameterizations and Optimizers

Add code
Jul 08, 2024
Viaarxiv icon

LiPO: Listwise Preference Optimization through Learning-to-Rank

Add code
Feb 02, 2024
Figure 1 for LiPO: Listwise Preference Optimization through Learning-to-Rank
Figure 2 for LiPO: Listwise Preference Optimization through Learning-to-Rank
Figure 3 for LiPO: Listwise Preference Optimization through Learning-to-Rank
Figure 4 for LiPO: Listwise Preference Optimization through Learning-to-Rank
Viaarxiv icon

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Add code
Dec 22, 2023
Viaarxiv icon

Self-Evaluation Improves Selective Generation in Large Language Models

Add code
Dec 14, 2023
Viaarxiv icon

Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

Add code
Nov 15, 2023
Viaarxiv icon

Improving Large Language Model Fine-tuning for Solving Math Problems

Add code
Oct 16, 2023
Viaarxiv icon

Small-scale proxies for large-scale Transformer training instabilities

Add code
Sep 25, 2023
Viaarxiv icon

Statistical Rejection Sampling Improves Preference Optimization

Add code
Sep 13, 2023
Viaarxiv icon

SLiC-HF: Sequence Likelihood Calibration with Human Feedback

Add code
May 17, 2023
Viaarxiv icon