Picture for Yee Whye Teh

Yee Whye Teh

University College London

Enhancing Large Language Model Reasoning with Reward Models: An Analytical Survey

Add code
Oct 02, 2025
Viaarxiv icon

Is Model Editing Built on Sand? Revealing Its Illusory Success and Fragile Foundation

Add code
Oct 01, 2025
Viaarxiv icon

Rao-Blackwellised Reparameterisation Gradients

Add code
Jun 09, 2025
Viaarxiv icon

Extending Epistemic Uncertainty Beyond Parameters Would Assist in Designing Reliable LLMs

Add code
Jun 09, 2025
Viaarxiv icon

NoProp: Training Neural Networks without Back-propagation or Forward-propagation

Add code
Mar 31, 2025
Figure 1 for NoProp: Training Neural Networks without Back-propagation or Forward-propagation
Figure 2 for NoProp: Training Neural Networks without Back-propagation or Forward-propagation
Figure 3 for NoProp: Training Neural Networks without Back-propagation or Forward-propagation
Figure 4 for NoProp: Training Neural Networks without Back-propagation or Forward-propagation
Viaarxiv icon

Prompting Strategies for Enabling Large Language Models to Infer Causation from Correlation

Add code
Dec 18, 2024
Figure 1 for Prompting Strategies for Enabling Large Language Models to Infer Causation from Correlation
Figure 2 for Prompting Strategies for Enabling Large Language Models to Infer Causation from Correlation
Figure 3 for Prompting Strategies for Enabling Large Language Models to Infer Causation from Correlation
Figure 4 for Prompting Strategies for Enabling Large Language Models to Infer Causation from Correlation
Viaarxiv icon

Learning Loss Landscapes in Preference Optimization

Add code
Nov 10, 2024
Figure 1 for Learning Loss Landscapes in Preference Optimization
Figure 2 for Learning Loss Landscapes in Preference Optimization
Figure 3 for Learning Loss Landscapes in Preference Optimization
Figure 4 for Learning Loss Landscapes in Preference Optimization
Viaarxiv icon

Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset

Add code
Nov 06, 2024
Figure 1 for Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset
Figure 2 for Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset
Figure 3 for Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset
Figure 4 for Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset
Viaarxiv icon

L3Ms -- Lagrange Large Language Models

Add code
Oct 28, 2024
Viaarxiv icon

SymDiff: Equivariant Diffusion via Stochastic Symmetrisation

Add code
Oct 08, 2024
Figure 1 for SymDiff: Equivariant Diffusion via Stochastic Symmetrisation
Figure 2 for SymDiff: Equivariant Diffusion via Stochastic Symmetrisation
Figure 3 for SymDiff: Equivariant Diffusion via Stochastic Symmetrisation
Figure 4 for SymDiff: Equivariant Diffusion via Stochastic Symmetrisation
Viaarxiv icon