Picture for Guoqiang Zhang

Guoqiang Zhang

AttentionX: Exploiting Consensus Discrepancy In Attention from A Distributed Optimization Perspective

Add code
Sep 09, 2024
Viaarxiv icon

On Exact Bit-level Reversible Transformers Without Changing Architectures

Add code
Jul 12, 2024
Viaarxiv icon

A Distributed Algorithm for Personal Sound Zones Systems

Add code
Nov 21, 2023
Viaarxiv icon

Exact Diffusion Inversion via Bi-directional Integration Approximation

Add code
Jul 21, 2023
Viaarxiv icon

On Accelerating Diffusion-Based Sampling Process via Improved Integration Approximation

Add code
Apr 25, 2023
Viaarxiv icon

Lookahead Diffusion Probabilistic Models for Refining Mean Estimation

Add code
Apr 22, 2023
Viaarxiv icon

On Suppressing Range of Adaptive Stepsizes of Adam to Improve Generalisation Performance

Add code
Feb 02, 2023
Viaarxiv icon

On Exploiting Layerwise Gradient Statistics for Effective Training of Deep Neural Networks

Add code
Apr 06, 2022
Figure 1 for On Exploiting Layerwise Gradient Statistics for Effective Training of Deep Neural Networks
Figure 2 for On Exploiting Layerwise Gradient Statistics for Effective Training of Deep Neural Networks
Figure 3 for On Exploiting Layerwise Gradient Statistics for Effective Training of Deep Neural Networks
Figure 4 for On Exploiting Layerwise Gradient Statistics for Effective Training of Deep Neural Networks
Viaarxiv icon

Extending AdamW by Leveraging Its Second Moment and Magnitude

Add code
Dec 09, 2021
Figure 1 for Extending AdamW by Leveraging Its Second Moment and Magnitude
Figure 2 for Extending AdamW by Leveraging Its Second Moment and Magnitude
Figure 3 for Extending AdamW by Leveraging Its Second Moment and Magnitude
Figure 4 for Extending AdamW by Leveraging Its Second Moment and Magnitude
Viaarxiv icon

Coarsening Optimization for Differentiable Programming

Add code
Oct 05, 2021
Figure 1 for Coarsening Optimization for Differentiable Programming
Figure 2 for Coarsening Optimization for Differentiable Programming
Figure 3 for Coarsening Optimization for Differentiable Programming
Figure 4 for Coarsening Optimization for Differentiable Programming
Viaarxiv icon