Picture for Simon Du

Simon Du

Decoding-Time Language Model Alignment with Multiple Objectives

Add code
Jun 27, 2024
Figure 1 for Decoding-Time Language Model Alignment with Multiple Objectives
Figure 2 for Decoding-Time Language Model Alignment with Multiple Objectives
Figure 3 for Decoding-Time Language Model Alignment with Multiple Objectives
Figure 4 for Decoding-Time Language Model Alignment with Multiple Objectives
Viaarxiv icon

JoMA: Demystifying Multilayer Transformers via JOint Dynamics of MLP and Attention

Add code
Oct 03, 2023
Viaarxiv icon

Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer

Add code
May 25, 2023
Viaarxiv icon

Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path

Add code
May 22, 2022
Figure 1 for Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path
Figure 2 for Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path
Figure 3 for Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path
Viaarxiv icon

AdaLoss: A computationally-efficient and provably convergent adaptive gradient method

Add code
Sep 17, 2021
Figure 1 for AdaLoss: A computationally-efficient and provably convergent adaptive gradient method
Figure 2 for AdaLoss: A computationally-efficient and provably convergent adaptive gradient method
Figure 3 for AdaLoss: A computationally-efficient and provably convergent adaptive gradient method
Figure 4 for AdaLoss: A computationally-efficient and provably convergent adaptive gradient method
Viaarxiv icon

Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization

Add code
Mar 12, 2021
Figure 1 for Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
Figure 2 for Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
Figure 3 for Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
Figure 4 for Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
Viaarxiv icon

Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms

Add code
Jun 08, 2018
Figure 1 for Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms
Figure 2 for Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms
Figure 3 for Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms
Figure 4 for Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms
Viaarxiv icon

Stochastic Zeroth-order Optimization in High Dimensions

Add code
Feb 26, 2018
Figure 1 for Stochastic Zeroth-order Optimization in High Dimensions
Figure 2 for Stochastic Zeroth-order Optimization in High Dimensions
Figure 3 for Stochastic Zeroth-order Optimization in High Dimensions
Viaarxiv icon