Picture for Soham Sane

Soham Sane

AM-PPO: (Advantage) Alpha-Modulation with Proximal Policy Optimization

Add code
May 21, 2025
Viaarxiv icon

AlphaGrad: Non-Linear Gradient Normalization Optimizer

Add code
Apr 22, 2025
Viaarxiv icon

Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework

Add code
Mar 26, 2025
Figure 1 for Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework
Figure 2 for Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework
Figure 3 for Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework
Figure 4 for Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework
Viaarxiv icon

A NotSo Simple Way to Beat Simple Bench

Add code
Dec 12, 2024
Figure 1 for A NotSo Simple Way to Beat Simple Bench
Figure 2 for A NotSo Simple Way to Beat Simple Bench
Figure 3 for A NotSo Simple Way to Beat Simple Bench
Figure 4 for A NotSo Simple Way to Beat Simple Bench
Viaarxiv icon