Picture for Jingyang Li

Jingyang Li

MiniMax-01: Scaling Foundation Models with Lightning Attention

Add code
Jan 14, 2025
Viaarxiv icon

Memory-Efficient 4-bit Preconditioned Stochastic Optimization

Add code
Dec 14, 2024
Viaarxiv icon

Federated PCA and Estimation for Spiked Covariance Matrices: Optimal Rates and Efficient Algorithm

Add code
Nov 23, 2024
Figure 1 for Federated PCA and Estimation for Spiked Covariance Matrices: Optimal Rates and Efficient Algorithm
Figure 2 for Federated PCA and Estimation for Spiked Covariance Matrices: Optimal Rates and Efficient Algorithm
Figure 3 for Federated PCA and Estimation for Spiked Covariance Matrices: Optimal Rates and Efficient Algorithm
Viaarxiv icon

Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning

Add code
Oct 15, 2024
Figure 1 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Figure 2 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Figure 3 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Figure 4 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Viaarxiv icon

Online Policy Learning and Inference by Matrix Completion

Add code
Apr 26, 2024
Viaarxiv icon

Towards More Faithful Natural Language Explanation Using Multi-Level Contrastive Learning in VQA

Add code
Dec 21, 2023
Viaarxiv icon

Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning

Add code
Jun 29, 2023
Figure 1 for Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning
Figure 2 for Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning
Figure 3 for Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning
Figure 4 for Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning
Viaarxiv icon

Online Tensor Learning: Computational and Statistical Trade-offs, Adaptivity and Optimal Regret

Add code
Jun 06, 2023
Viaarxiv icon

Computationally Efficient and Statistically Optimal Robust High-Dimensional Linear Regression

Add code
May 10, 2023
Viaarxiv icon

Towards Generalized Open Information Extraction

Add code
Nov 29, 2022
Viaarxiv icon