Picture for Wenbo Gong

Wenbo Gong

SWAN: SGD with Normalization and Whitening Enables Stateless LLM Training

Add code
Dec 23, 2024
Figure 1 for SWAN: SGD with Normalization and Whitening Enables Stateless LLM Training
Figure 2 for SWAN: SGD with Normalization and Whitening Enables Stateless LLM Training
Figure 3 for SWAN: SGD with Normalization and Whitening Enables Stateless LLM Training
Figure 4 for SWAN: SGD with Normalization and Whitening Enables Stateless LLM Training
Viaarxiv icon

SWAN: Preprocessing SGD Enables Adam-Level Performance On LLM Training With Significant Memory Reduction

Add code
Dec 17, 2024
Figure 1 for SWAN: Preprocessing SGD Enables Adam-Level Performance On LLM Training With Significant Memory Reduction
Figure 2 for SWAN: Preprocessing SGD Enables Adam-Level Performance On LLM Training With Significant Memory Reduction
Figure 3 for SWAN: Preprocessing SGD Enables Adam-Level Performance On LLM Training With Significant Memory Reduction
Figure 4 for SWAN: Preprocessing SGD Enables Adam-Level Performance On LLM Training With Significant Memory Reduction
Viaarxiv icon

The Essential Role of Causality in Foundation World Models for Embodied AI

Add code
Feb 06, 2024
Figure 1 for The Essential Role of Causality in Foundation World Models for Embodied AI
Figure 2 for The Essential Role of Causality in Foundation World Models for Embodied AI
Viaarxiv icon

Neural Structure Learning with Stochastic Differential Equations

Add code
Nov 06, 2023
Figure 1 for Neural Structure Learning with Stochastic Differential Equations
Figure 2 for Neural Structure Learning with Stochastic Differential Equations
Figure 3 for Neural Structure Learning with Stochastic Differential Equations
Figure 4 for Neural Structure Learning with Stochastic Differential Equations
Viaarxiv icon

BayesDAG: Gradient-Based Posterior Sampling for Causal Discovery

Add code
Jul 26, 2023
Viaarxiv icon

Understanding Causality with Large Language Models: Feasibility and Opportunities

Add code
Apr 11, 2023
Figure 1 for Understanding Causality with Large Language Models: Feasibility and Opportunities
Figure 2 for Understanding Causality with Large Language Models: Feasibility and Opportunities
Figure 3 for Understanding Causality with Large Language Models: Feasibility and Opportunities
Figure 4 for Understanding Causality with Large Language Models: Feasibility and Opportunities
Viaarxiv icon

Rhino: Deep Causal Temporal Relationship Learning With History-dependent Noise

Add code
Oct 26, 2022
Figure 1 for Rhino: Deep Causal Temporal Relationship Learning With History-dependent Noise
Figure 2 for Rhino: Deep Causal Temporal Relationship Learning With History-dependent Noise
Figure 3 for Rhino: Deep Causal Temporal Relationship Learning With History-dependent Noise
Figure 4 for Rhino: Deep Causal Temporal Relationship Learning With History-dependent Noise
Viaarxiv icon

NeurIPS Competition Instructions and Guide: Causal Insights for Learning Paths in Education

Add code
Aug 31, 2022
Figure 1 for NeurIPS Competition Instructions and Guide: Causal Insights for Learning Paths in Education
Figure 2 for NeurIPS Competition Instructions and Guide: Causal Insights for Learning Paths in Education
Figure 3 for NeurIPS Competition Instructions and Guide: Causal Insights for Learning Paths in Education
Viaarxiv icon

Deep End-to-end Causal Inference

Add code
Feb 04, 2022
Figure 1 for Deep End-to-end Causal Inference
Figure 2 for Deep End-to-end Causal Inference
Figure 3 for Deep End-to-end Causal Inference
Figure 4 for Deep End-to-end Causal Inference
Viaarxiv icon

Interpreting diffusion score matching using normalizing flow

Add code
Jul 21, 2021
Figure 1 for Interpreting diffusion score matching using normalizing flow
Figure 2 for Interpreting diffusion score matching using normalizing flow
Viaarxiv icon