Picture for Kyriakos Axiotis

Kyriakos Axiotis

DeepCrossAttention: Supercharging Transformer Residual Connections

Add code
Feb 10, 2025
Viaarxiv icon

SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization

Add code
Feb 27, 2024
Figure 1 for SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization
Figure 2 for SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization
Figure 3 for SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization
Figure 4 for SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization
Viaarxiv icon

Data-Efficient Learning via Clustering-Based Sensitivity Sampling: Foundation Models and Beyond

Add code
Feb 27, 2024
Figure 1 for Data-Efficient Learning via Clustering-Based Sensitivity Sampling: Foundation Models and Beyond
Figure 2 for Data-Efficient Learning via Clustering-Based Sensitivity Sampling: Foundation Models and Beyond
Figure 3 for Data-Efficient Learning via Clustering-Based Sensitivity Sampling: Foundation Models and Beyond
Figure 4 for Data-Efficient Learning via Clustering-Based Sensitivity Sampling: Foundation Models and Beyond
Viaarxiv icon

Greedy PIG: Adaptive Integrated Gradients

Add code
Nov 10, 2023
Figure 1 for Greedy PIG: Adaptive Integrated Gradients
Figure 2 for Greedy PIG: Adaptive Integrated Gradients
Figure 3 for Greedy PIG: Adaptive Integrated Gradients
Figure 4 for Greedy PIG: Adaptive Integrated Gradients
Viaarxiv icon

Performance of $\ell_1$ Regularization for Sparse Convex Optimization

Add code
Jul 14, 2023
Viaarxiv icon

Gradient Descent Converges Linearly for Logistic Regression on Separable Data

Add code
Jun 26, 2023
Figure 1 for Gradient Descent Converges Linearly for Logistic Regression on Separable Data
Figure 2 for Gradient Descent Converges Linearly for Logistic Regression on Separable Data
Figure 3 for Gradient Descent Converges Linearly for Logistic Regression on Separable Data
Figure 4 for Gradient Descent Converges Linearly for Logistic Regression on Separable Data
Viaarxiv icon

Iterative Hard Thresholding with Adaptive Regularization: Sparser Solutions Without Sacrificing Runtime

Add code
Apr 11, 2022
Figure 1 for Iterative Hard Thresholding with Adaptive Regularization: Sparser Solutions Without Sacrificing Runtime
Figure 2 for Iterative Hard Thresholding with Adaptive Regularization: Sparser Solutions Without Sacrificing Runtime
Figure 3 for Iterative Hard Thresholding with Adaptive Regularization: Sparser Solutions Without Sacrificing Runtime
Figure 4 for Iterative Hard Thresholding with Adaptive Regularization: Sparser Solutions Without Sacrificing Runtime
Viaarxiv icon

Decomposable Submodular Function Minimization via Maximum Flow

Add code
Mar 05, 2021
Viaarxiv icon

Local Search Algorithms for Rank-Constrained Convex Optimization

Add code
Jan 15, 2021
Figure 1 for Local Search Algorithms for Rank-Constrained Convex Optimization
Figure 2 for Local Search Algorithms for Rank-Constrained Convex Optimization
Figure 3 for Local Search Algorithms for Rank-Constrained Convex Optimization
Figure 4 for Local Search Algorithms for Rank-Constrained Convex Optimization
Viaarxiv icon

Sparse Convex Optimization via Adaptively Regularized Hard Thresholding

Add code
Jun 25, 2020
Figure 1 for Sparse Convex Optimization via Adaptively Regularized Hard Thresholding
Figure 2 for Sparse Convex Optimization via Adaptively Regularized Hard Thresholding
Figure 3 for Sparse Convex Optimization via Adaptively Regularized Hard Thresholding
Figure 4 for Sparse Convex Optimization via Adaptively Regularized Hard Thresholding
Viaarxiv icon