Picture for Boris Hanin

Boris Hanin

Hyperparameter Transfer with Mixture-of-Expert Layers

Add code
Jan 28, 2026
Viaarxiv icon

Implicit Bias of the JKO Scheme

Add code
Nov 18, 2025
Figure 1 for Implicit Bias of the JKO Scheme
Figure 2 for Implicit Bias of the JKO Scheme
Figure 3 for Implicit Bias of the JKO Scheme
Figure 4 for Implicit Bias of the JKO Scheme
Viaarxiv icon

Don't be lazy: CompleteP enables compute-efficient deep transformers

Add code
May 02, 2025
Viaarxiv icon

Deep Nets as Hamiltonians

Add code
Mar 31, 2025
Figure 1 for Deep Nets as Hamiltonians
Figure 2 for Deep Nets as Hamiltonians
Figure 3 for Deep Nets as Hamiltonians
Figure 4 for Deep Nets as Hamiltonians
Viaarxiv icon

Optimizing Model Selection for Compound AI Systems

Add code
Feb 20, 2025
Viaarxiv icon

Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization

Add code
Oct 11, 2024
Viaarxiv icon

Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design

Add code
Jul 23, 2024
Figure 1 for Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design
Figure 2 for Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design
Figure 3 for Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design
Figure 4 for Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design
Viaarxiv icon

Bayesian Inference with Deep Weakly Nonlinear Networks

Add code
May 26, 2024
Figure 1 for Bayesian Inference with Deep Weakly Nonlinear Networks
Figure 2 for Bayesian Inference with Deep Weakly Nonlinear Networks
Figure 3 for Bayesian Inference with Deep Weakly Nonlinear Networks
Viaarxiv icon

Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems

Add code
Mar 04, 2024
Figure 1 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Figure 2 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Figure 3 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Figure 4 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Viaarxiv icon

Principled Architecture-aware Scaling of Hyperparameters

Add code
Feb 27, 2024
Figure 1 for Principled Architecture-aware Scaling of Hyperparameters
Figure 2 for Principled Architecture-aware Scaling of Hyperparameters
Figure 3 for Principled Architecture-aware Scaling of Hyperparameters
Figure 4 for Principled Architecture-aware Scaling of Hyperparameters
Viaarxiv icon