Picture for Ali Ramezani-Kebrya

Ali Ramezani-Kebrya

Aligning Attention with Human Rationales for Self-Explaining Hate Speech Detection

Add code
Nov 10, 2025
Viaarxiv icon

Layer-wise Quantization for Quantized Optimistic Dual Averaging

Add code
May 20, 2025
Figure 1 for Layer-wise Quantization for Quantized Optimistic Dual Averaging
Figure 2 for Layer-wise Quantization for Quantized Optimistic Dual Averaging
Figure 3 for Layer-wise Quantization for Quantized Optimistic Dual Averaging
Figure 4 for Layer-wise Quantization for Quantized Optimistic Dual Averaging
Viaarxiv icon

Addressing Label Shift in Distributed Learning via Entropy Regularization

Add code
Feb 04, 2025
Viaarxiv icon

Distributed Extra-gradient with Optimal Complexity and Communication Guarantees

Add code
Aug 17, 2023
Figure 1 for Distributed Extra-gradient with Optimal Complexity and Communication Guarantees
Figure 2 for Distributed Extra-gradient with Optimal Complexity and Communication Guarantees
Figure 3 for Distributed Extra-gradient with Optimal Complexity and Communication Guarantees
Figure 4 for Distributed Extra-gradient with Optimal Complexity and Communication Guarantees
Viaarxiv icon

Federated Learning under Covariate Shifts with Generalization Guarantees

Add code
Jun 08, 2023
Viaarxiv icon

MixTailor: Mixed Gradient Aggregation for Robust Learning Against Tailored Attacks

Add code
Jul 16, 2022
Figure 1 for MixTailor: Mixed Gradient Aggregation for Robust Learning Against Tailored Attacks
Figure 2 for MixTailor: Mixed Gradient Aggregation for Robust Learning Against Tailored Attacks
Figure 3 for MixTailor: Mixed Gradient Aggregation for Robust Learning Against Tailored Attacks
Figure 4 for MixTailor: Mixed Gradient Aggregation for Robust Learning Against Tailored Attacks
Viaarxiv icon

Subquadratic Overparameterization for Shallow Neural Networks

Add code
Nov 02, 2021
Figure 1 for Subquadratic Overparameterization for Shallow Neural Networks
Figure 2 for Subquadratic Overparameterization for Shallow Neural Networks
Viaarxiv icon

NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization

Add code
May 01, 2021
Figure 1 for NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization
Figure 2 for NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization
Figure 3 for NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization
Figure 4 for NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization
Viaarxiv icon

On the Generalization of Stochastic Gradient Descent with Momentum

Add code
Feb 26, 2021
Figure 1 for On the Generalization of Stochastic Gradient Descent with Momentum
Figure 2 for On the Generalization of Stochastic Gradient Descent with Momentum
Figure 3 for On the Generalization of Stochastic Gradient Descent with Momentum
Figure 4 for On the Generalization of Stochastic Gradient Descent with Momentum
Viaarxiv icon

Adaptive Gradient Quantization for Data-Parallel SGD

Add code
Oct 23, 2020
Figure 1 for Adaptive Gradient Quantization for Data-Parallel SGD
Figure 2 for Adaptive Gradient Quantization for Data-Parallel SGD
Figure 3 for Adaptive Gradient Quantization for Data-Parallel SGD
Figure 4 for Adaptive Gradient Quantization for Data-Parallel SGD
Viaarxiv icon