Picture for Oliver Sieberling

Oliver Sieberling

MesaNet: Sequence Modeling by Locally Optimal Test-Time Training

Add code
Jun 05, 2025
Viaarxiv icon

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Add code
May 20, 2025
Viaarxiv icon

DarwinLM: Evolutionary Structured Pruning of Large Language Models

Add code
Feb 11, 2025
Figure 1 for DarwinLM: Evolutionary Structured Pruning of Large Language Models
Figure 2 for DarwinLM: Evolutionary Structured Pruning of Large Language Models
Figure 3 for DarwinLM: Evolutionary Structured Pruning of Large Language Models
Figure 4 for DarwinLM: Evolutionary Structured Pruning of Large Language Models
Viaarxiv icon

EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search

Add code
Oct 18, 2024
Figure 1 for EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search
Figure 2 for EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search
Figure 3 for EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search
Figure 4 for EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search
Viaarxiv icon

Plus Strategies are Exponentially Slower for Planted Optima of Random Height

Add code
Apr 15, 2024
Figure 1 for Plus Strategies are Exponentially Slower for Planted Optima of Random Height
Figure 2 for Plus Strategies are Exponentially Slower for Planted Optima of Random Height
Figure 3 for Plus Strategies are Exponentially Slower for Planted Optima of Random Height
Figure 4 for Plus Strategies are Exponentially Slower for Planted Optima of Random Height
Viaarxiv icon

Hardest Monotone Functions for Evolutionary Algorithms

Add code
Nov 13, 2023
Viaarxiv icon