Picture for Xingyu Xie

Xingyu Xie

Optimization Hyper-parameter Laws for Large Language Models

Add code
Sep 07, 2024
Viaarxiv icon

LoCo: Low-Bit Communication Adaptor for Large-scale Model Training

Add code
Jul 05, 2024
Viaarxiv icon

On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization

Add code
May 26, 2024
Viaarxiv icon

Task-Robust Pre-Training for Worst-Case Downstream Adaptation

Add code
Jul 05, 2023
Viaarxiv icon

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Add code
Sep 01, 2022
Figure 1 for Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Figure 2 for Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Figure 3 for Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Figure 4 for Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Viaarxiv icon

Global Convergence of Over-parameterized Deep Equilibrium Models

Add code
May 27, 2022
Figure 1 for Global Convergence of Over-parameterized Deep Equilibrium Models
Viaarxiv icon

High Quality Segmentation for Ultra High-resolution Images

Add code
Dec 26, 2021
Figure 1 for High Quality Segmentation for Ultra High-resolution Images
Figure 2 for High Quality Segmentation for Ultra High-resolution Images
Figure 3 for High Quality Segmentation for Ultra High-resolution Images
Figure 4 for High Quality Segmentation for Ultra High-resolution Images
Viaarxiv icon

Optimization Induced Equilibrium Networks

Add code
Jun 07, 2021
Figure 1 for Optimization Induced Equilibrium Networks
Figure 2 for Optimization Induced Equilibrium Networks
Figure 3 for Optimization Induced Equilibrium Networks
Viaarxiv icon

Maximum-and-Concatenation Networks

Add code
Jul 09, 2020
Figure 1 for Maximum-and-Concatenation Networks
Figure 2 for Maximum-and-Concatenation Networks
Figure 3 for Maximum-and-Concatenation Networks
Figure 4 for Maximum-and-Concatenation Networks
Viaarxiv icon

Differentiable Linearized ADMM

Add code
May 15, 2019
Figure 1 for Differentiable Linearized ADMM
Figure 2 for Differentiable Linearized ADMM
Figure 3 for Differentiable Linearized ADMM
Figure 4 for Differentiable Linearized ADMM
Viaarxiv icon