Picture for Michael W. Mahoney

Michael W. Mahoney

UC Berkeley/LBNL/ICSI

Visualizing Loss Functions as Topological Landscape Profiles

Add code
Nov 19, 2024
Viaarxiv icon

Evaluating Loss Landscapes from a Topology Perspective

Add code
Nov 14, 2024
Viaarxiv icon

Squeezed Attention: Accelerating Long Context Length LLM Inference

Add code
Nov 14, 2024
Viaarxiv icon

How many classifiers do we need?

Add code
Nov 01, 2024
Viaarxiv icon

AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models

Add code
Oct 14, 2024
Figure 1 for AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
Figure 2 for AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
Figure 3 for AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
Figure 4 for AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
Viaarxiv icon

Elucidating the Design Choice of Probability Paths in Flow Matching for Forecasting

Add code
Oct 04, 2024
Viaarxiv icon

Mitigating Memorization In Language Models

Add code
Oct 03, 2024
Figure 1 for Mitigating Memorization In Language Models
Figure 2 for Mitigating Memorization In Language Models
Figure 3 for Mitigating Memorization In Language Models
Figure 4 for Mitigating Memorization In Language Models
Viaarxiv icon

Tuning Frequency Bias of State Space Models

Add code
Oct 02, 2024
Figure 1 for Tuning Frequency Bias of State Space Models
Figure 2 for Tuning Frequency Bias of State Space Models
Figure 3 for Tuning Frequency Bias of State Space Models
Figure 4 for Tuning Frequency Bias of State Space Models
Viaarxiv icon

Learning Physics for Unveiling Hidden Earthquake Ground Motions via Conditional Generative Modeling

Add code
Jul 21, 2024
Figure 1 for Learning Physics for Unveiling Hidden Earthquake Ground Motions via Conditional Generative Modeling
Figure 2 for Learning Physics for Unveiling Hidden Earthquake Ground Motions via Conditional Generative Modeling
Figure 3 for Learning Physics for Unveiling Hidden Earthquake Ground Motions via Conditional Generative Modeling
Figure 4 for Learning Physics for Unveiling Hidden Earthquake Ground Motions via Conditional Generative Modeling
Viaarxiv icon

Comparing and Contrasting Deep Learning Weather Prediction Backbones on Navier-Stokes and Atmospheric Dynamics

Add code
Jul 19, 2024
Viaarxiv icon