Picture for Simon Weissmann

Simon Weissmann

An Approximate Ascent Approach To Prove Convergence of PPO

Add code
Feb 03, 2026
Viaarxiv icon

Adaptive Kernel Selection for Stein Variational Gradient Descent

Add code
Oct 02, 2025
Viaarxiv icon

Controlling the Flow: Stability and Convergence for Stochastic Gradient Descent with Decaying Regularization

Add code
May 16, 2025
Viaarxiv icon

Clustered KL-barycenter design for policy evaluation

Add code
Mar 04, 2025
Viaarxiv icon

Structure Matters: Dynamic Policy Gradient

Add code
Nov 07, 2024
Figure 1 for Structure Matters: Dynamic Policy Gradient
Figure 2 for Structure Matters: Dynamic Policy Gradient
Figure 3 for Structure Matters: Dynamic Policy Gradient
Figure 4 for Structure Matters: Dynamic Policy Gradient
Viaarxiv icon

Polyak's Heavy Ball Method Achieves Accelerated Local Rate of Convergence under Polyak-Lojasiewicz Inequality

Add code
Oct 22, 2024
Viaarxiv icon

Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods

Add code
Oct 04, 2023
Viaarxiv icon

Consistency analysis of bilevel data-driven learning in inverse problems

Add code
Jul 06, 2020
Figure 1 for Consistency analysis of bilevel data-driven learning in inverse problems
Figure 2 for Consistency analysis of bilevel data-driven learning in inverse problems
Figure 3 for Consistency analysis of bilevel data-driven learning in inverse problems
Figure 4 for Consistency analysis of bilevel data-driven learning in inverse problems
Viaarxiv icon