Picture for Soummya Kar

Soummya Kar

Distributed Gradient Clustering: Convergence and the Effect of Initialization

Add code
Mar 20, 2026
Viaarxiv icon

Tight Long-Term Tail Decay of (Clipped) SGD in Non-Convex Optimization

Add code
Feb 05, 2026
Viaarxiv icon

Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts

Add code
Oct 06, 2025
Figure 1 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Figure 2 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Figure 3 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Figure 4 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Viaarxiv icon

Federated Multi-Objective Learning with Controlled Pareto Frontiers

Add code
Aug 07, 2025
Figure 1 for Federated Multi-Objective Learning with Controlled Pareto Frontiers
Figure 2 for Federated Multi-Objective Learning with Controlled Pareto Frontiers
Figure 3 for Federated Multi-Objective Learning with Controlled Pareto Frontiers
Figure 4 for Federated Multi-Objective Learning with Controlled Pareto Frontiers
Viaarxiv icon

Distributed gradient methods under heavy-tailed communication noise

Add code
May 30, 2025
Figure 1 for Distributed gradient methods under heavy-tailed communication noise
Figure 2 for Distributed gradient methods under heavy-tailed communication noise
Figure 3 for Distributed gradient methods under heavy-tailed communication noise
Figure 4 for Distributed gradient methods under heavy-tailed communication noise
Viaarxiv icon

Distributed Sign Momentum with Local Steps for Training Transformers

Add code
Nov 26, 2024
Viaarxiv icon

Large Deviations and Improved Mean-squared Error Rates of Nonlinear SGD: Heavy-tailed Noise and Power of Symmetry

Add code
Oct 21, 2024
Figure 1 for Large Deviations and Improved Mean-squared Error Rates of Nonlinear SGD: Heavy-tailed Noise and Power of Symmetry
Figure 2 for Large Deviations and Improved Mean-squared Error Rates of Nonlinear SGD: Heavy-tailed Noise and Power of Symmetry
Viaarxiv icon

Nonlinear Stochastic Gradient Descent and Heavy-tailed Noise: A Unified Framework and High-probability Guarantees

Add code
Oct 17, 2024
Figure 1 for Nonlinear Stochastic Gradient Descent and Heavy-tailed Noise: A Unified Framework and High-probability Guarantees
Figure 2 for Nonlinear Stochastic Gradient Descent and Heavy-tailed Noise: A Unified Framework and High-probability Guarantees
Figure 3 for Nonlinear Stochastic Gradient Descent and Heavy-tailed Noise: A Unified Framework and High-probability Guarantees
Figure 4 for Nonlinear Stochastic Gradient Descent and Heavy-tailed Noise: A Unified Framework and High-probability Guarantees
Viaarxiv icon

Computational Imaging for Long-Term Prediction of Solar Irradiance

Add code
Sep 18, 2024
Figure 1 for Computational Imaging for Long-Term Prediction of Solar Irradiance
Figure 2 for Computational Imaging for Long-Term Prediction of Solar Irradiance
Figure 3 for Computational Imaging for Long-Term Prediction of Solar Irradiance
Figure 4 for Computational Imaging for Long-Term Prediction of Solar Irradiance
Viaarxiv icon

Vehicle-to-Vehicle Charging: Model, Complexity, and Heuristics

Add code
Apr 12, 2024
Figure 1 for Vehicle-to-Vehicle Charging: Model, Complexity, and Heuristics
Figure 2 for Vehicle-to-Vehicle Charging: Model, Complexity, and Heuristics
Figure 3 for Vehicle-to-Vehicle Charging: Model, Complexity, and Heuristics
Figure 4 for Vehicle-to-Vehicle Charging: Model, Complexity, and Heuristics
Viaarxiv icon