Picture for Yuguang Yue

Yuguang Yue

MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning

Add code
May 19, 2024
Viaarxiv icon

Learning to Rank For Push Notifications Using Pairwise Expected Regret

Add code
Jan 19, 2022
Viaarxiv icon

Implicit Distributional Reinforcement Learning

Add code
Jul 13, 2020
Figure 1 for Implicit Distributional Reinforcement Learning
Figure 2 for Implicit Distributional Reinforcement Learning
Figure 3 for Implicit Distributional Reinforcement Learning
Figure 4 for Implicit Distributional Reinforcement Learning
Viaarxiv icon

Discrete Action On-Policy Learning with Action-Value Critic

Add code
Feb 21, 2020
Figure 1 for Discrete Action On-Policy Learning with Action-Value Critic
Figure 2 for Discrete Action On-Policy Learning with Action-Value Critic
Figure 3 for Discrete Action On-Policy Learning with Action-Value Critic
Figure 4 for Discrete Action On-Policy Learning with Action-Value Critic
Viaarxiv icon

Semi-supervised Learning using Adversarial Training with Good and Bad Samples

Add code
Oct 18, 2019
Figure 1 for Semi-supervised Learning using Adversarial Training with Good and Bad Samples
Figure 2 for Semi-supervised Learning using Adversarial Training with Good and Bad Samples
Figure 3 for Semi-supervised Learning using Adversarial Training with Good and Bad Samples
Figure 4 for Semi-supervised Learning using Adversarial Training with Good and Bad Samples
Viaarxiv icon

A Unified Framework for Tuning Hyperparameters in Clustering Problems

Add code
Oct 17, 2019
Figure 1 for A Unified Framework for Tuning Hyperparameters in Clustering Problems
Figure 2 for A Unified Framework for Tuning Hyperparameters in Clustering Problems
Figure 3 for A Unified Framework for Tuning Hyperparameters in Clustering Problems
Figure 4 for A Unified Framework for Tuning Hyperparameters in Clustering Problems
Viaarxiv icon

ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables

Add code
May 04, 2019
Figure 1 for ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
Figure 2 for ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
Figure 3 for ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
Figure 4 for ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
Viaarxiv icon