Picture for Guodong Zhang

Guodong Zhang

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation

Add code
Feb 20, 2023
Viaarxiv icon

Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers

Add code
Mar 15, 2022
Figure 1 for Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers
Figure 2 for Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers
Figure 3 for Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers
Figure 4 for Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers
Viaarxiv icon

On the Application of Data-Driven Deep Neural Networks in Linear and Nonlinear Structural Dynamics

Add code
Nov 03, 2021
Figure 1 for On the Application of Data-Driven Deep Neural Networks in Linear and Nonlinear Structural Dynamics
Figure 2 for On the Application of Data-Driven Deep Neural Networks in Linear and Nonlinear Structural Dynamics
Figure 3 for On the Application of Data-Driven Deep Neural Networks in Linear and Nonlinear Structural Dynamics
Figure 4 for On the Application of Data-Driven Deep Neural Networks in Linear and Nonlinear Structural Dynamics
Viaarxiv icon

Learning to Give Checkable Answers with Prover-Verifier Games

Add code
Aug 27, 2021
Figure 1 for Learning to Give Checkable Answers with Prover-Verifier Games
Figure 2 for Learning to Give Checkable Answers with Prover-Verifier Games
Figure 3 for Learning to Give Checkable Answers with Prover-Verifier Games
Figure 4 for Learning to Give Checkable Answers with Prover-Verifier Games
Viaarxiv icon

Differentiable Annealed Importance Sampling and the Perils of Gradient Noise

Add code
Jul 21, 2021
Figure 1 for Differentiable Annealed Importance Sampling and the Perils of Gradient Noise
Figure 2 for Differentiable Annealed Importance Sampling and the Perils of Gradient Noise
Figure 3 for Differentiable Annealed Importance Sampling and the Perils of Gradient Noise
Figure 4 for Differentiable Annealed Importance Sampling and the Perils of Gradient Noise
Viaarxiv icon

A Central Limit Theorem, Loss Aversion and Multi-Armed Bandits

Add code
Jun 10, 2021
Viaarxiv icon

Don't Fix What ain't Broke: Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization

Add code
Feb 18, 2021
Figure 1 for Don't Fix What ain't Broke: Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization
Figure 2 for Don't Fix What ain't Broke: Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization
Figure 3 for Don't Fix What ain't Broke: Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization
Figure 4 for Don't Fix What ain't Broke: Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization
Viaarxiv icon

A Unified Analysis of First-Order Methods for Smooth Games via Integral Quadratic Constraints

Add code
Oct 02, 2020
Figure 1 for A Unified Analysis of First-Order Methods for Smooth Games via Integral Quadratic Constraints
Figure 2 for A Unified Analysis of First-Order Methods for Smooth Games via Integral Quadratic Constraints
Figure 3 for A Unified Analysis of First-Order Methods for Smooth Games via Integral Quadratic Constraints
Figure 4 for A Unified Analysis of First-Order Methods for Smooth Games via Integral Quadratic Constraints
Viaarxiv icon

On the Suboptimality of Negative Momentum for Minimax Optimization

Add code
Aug 17, 2020
Figure 1 for On the Suboptimality of Negative Momentum for Minimax Optimization
Figure 2 for On the Suboptimality of Negative Momentum for Minimax Optimization
Figure 3 for On the Suboptimality of Negative Momentum for Minimax Optimization
Viaarxiv icon