Picture for Ben Adlam

Ben Adlam

Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

Add code
Aug 14, 2024
Figure 1 for Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Figure 2 for Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Figure 3 for Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Figure 4 for Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Viaarxiv icon

Understanding Optimal Feature Transfer via a Fine-Grained Bias-Variance Analysis

Add code
Apr 18, 2024
Viaarxiv icon

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Add code
Dec 22, 2023
Viaarxiv icon

Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

Add code
Nov 15, 2023
Figure 1 for Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?
Figure 2 for Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?
Figure 3 for Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?
Figure 4 for Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?
Viaarxiv icon

Small-scale proxies for large-scale Transformer training instabilities

Add code
Sep 25, 2023
Figure 1 for Small-scale proxies for large-scale Transformer training instabilities
Figure 2 for Small-scale proxies for large-scale Transformer training instabilities
Figure 3 for Small-scale proxies for large-scale Transformer training instabilities
Figure 4 for Small-scale proxies for large-scale Transformer training instabilities
Viaarxiv icon

Kernel Regression with Infinite-Width Neural Networks on Millions of Examples

Add code
Mar 09, 2023
Viaarxiv icon

Ensembling over Classifiers: a Bias-Variance Perspective

Add code
Jun 21, 2022
Figure 1 for Ensembling over Classifiers: a Bias-Variance Perspective
Figure 2 for Ensembling over Classifiers: a Bias-Variance Perspective
Figure 3 for Ensembling over Classifiers: a Bias-Variance Perspective
Figure 4 for Ensembling over Classifiers: a Bias-Variance Perspective
Viaarxiv icon

Implicit Regularization or Implicit Conditioning? Exact Risk Trajectories of SGD in High Dimensions

Add code
Jun 15, 2022
Figure 1 for Implicit Regularization or Implicit Conditioning? Exact Risk Trajectories of SGD in High Dimensions
Figure 2 for Implicit Regularization or Implicit Conditioning? Exact Risk Trajectories of SGD in High Dimensions
Figure 3 for Implicit Regularization or Implicit Conditioning? Exact Risk Trajectories of SGD in High Dimensions
Figure 4 for Implicit Regularization or Implicit Conditioning? Exact Risk Trajectories of SGD in High Dimensions
Viaarxiv icon

Homogenization of SGD in high-dimensions: Exact dynamics and generalization properties

Add code
May 14, 2022
Figure 1 for Homogenization of SGD in high-dimensions: Exact dynamics and generalization properties
Figure 2 for Homogenization of SGD in high-dimensions: Exact dynamics and generalization properties
Figure 3 for Homogenization of SGD in high-dimensions: Exact dynamics and generalization properties
Figure 4 for Homogenization of SGD in high-dimensions: Exact dynamics and generalization properties
Viaarxiv icon

Understanding the bias-variance tradeoff of Bregman divergences

Add code
Feb 10, 2022
Figure 1 for Understanding the bias-variance tradeoff of Bregman divergences
Figure 2 for Understanding the bias-variance tradeoff of Bregman divergences
Viaarxiv icon