Picture for Yuguang Yue

Yuguang Yue

MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning

Add code
May 19, 2024
Figure 1 for MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning
Figure 2 for MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning
Figure 3 for MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning
Figure 4 for MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning
Viaarxiv icon

Learning to Rank For Push Notifications Using Pairwise Expected Regret

Add code
Jan 19, 2022
Figure 1 for Learning to Rank For Push Notifications Using Pairwise Expected Regret
Figure 2 for Learning to Rank For Push Notifications Using Pairwise Expected Regret
Viaarxiv icon

Implicit Distributional Reinforcement Learning

Add code
Jul 13, 2020
Figure 1 for Implicit Distributional Reinforcement Learning
Figure 2 for Implicit Distributional Reinforcement Learning
Figure 3 for Implicit Distributional Reinforcement Learning
Figure 4 for Implicit Distributional Reinforcement Learning
Viaarxiv icon

Discrete Action On-Policy Learning with Action-Value Critic

Add code
Feb 21, 2020
Figure 1 for Discrete Action On-Policy Learning with Action-Value Critic
Figure 2 for Discrete Action On-Policy Learning with Action-Value Critic
Figure 3 for Discrete Action On-Policy Learning with Action-Value Critic
Figure 4 for Discrete Action On-Policy Learning with Action-Value Critic
Viaarxiv icon

Semi-supervised Learning using Adversarial Training with Good and Bad Samples

Add code
Oct 18, 2019
Figure 1 for Semi-supervised Learning using Adversarial Training with Good and Bad Samples
Figure 2 for Semi-supervised Learning using Adversarial Training with Good and Bad Samples
Figure 3 for Semi-supervised Learning using Adversarial Training with Good and Bad Samples
Figure 4 for Semi-supervised Learning using Adversarial Training with Good and Bad Samples
Viaarxiv icon

A Unified Framework for Tuning Hyperparameters in Clustering Problems

Add code
Oct 17, 2019
Figure 1 for A Unified Framework for Tuning Hyperparameters in Clustering Problems
Figure 2 for A Unified Framework for Tuning Hyperparameters in Clustering Problems
Figure 3 for A Unified Framework for Tuning Hyperparameters in Clustering Problems
Figure 4 for A Unified Framework for Tuning Hyperparameters in Clustering Problems
Viaarxiv icon

ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables

Add code
May 04, 2019
Figure 1 for ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
Figure 2 for ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
Figure 3 for ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
Figure 4 for ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
Viaarxiv icon