Picture for Ruibin Xiong

Ruibin Xiong

From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning

Add code
Nov 06, 2024
Figure 1 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Figure 2 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Figure 3 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Figure 4 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Viaarxiv icon

When Does Group Invariant Learning Survive Spurious Correlations?

Add code
Jun 29, 2022
Figure 1 for When Does Group Invariant Learning Survive Spurious Correlations?
Figure 2 for When Does Group Invariant Learning Survive Spurious Correlations?
Figure 3 for When Does Group Invariant Learning Survive Spurious Correlations?
Figure 4 for When Does Group Invariant Learning Survive Spurious Correlations?
Viaarxiv icon

Uncertainty Calibration for Ensemble-Based Debiasing Methods

Add code
Nov 07, 2021
Figure 1 for Uncertainty Calibration for Ensemble-Based Debiasing Methods
Figure 2 for Uncertainty Calibration for Ensemble-Based Debiasing Methods
Figure 3 for Uncertainty Calibration for Ensemble-Based Debiasing Methods
Figure 4 for Uncertainty Calibration for Ensemble-Based Debiasing Methods
Viaarxiv icon

On Layer Normalization in the Transformer Architecture

Add code
Feb 12, 2020
Figure 1 for On Layer Normalization in the Transformer Architecture
Figure 2 for On Layer Normalization in the Transformer Architecture
Figure 3 for On Layer Normalization in the Transformer Architecture
Figure 4 for On Layer Normalization in the Transformer Architecture
Viaarxiv icon