Picture for Michael Luo

Michael Luo

SimpleStrat: Diversifying Language Model Generation with Stratification

Add code
Oct 11, 2024
Figure 1 for SimpleStrat: Diversifying Language Model Generation with Stratification
Figure 2 for SimpleStrat: Diversifying Language Model Generation with Stratification
Figure 3 for SimpleStrat: Diversifying Language Model Generation with Stratification
Figure 4 for SimpleStrat: Diversifying Language Model Generation with Stratification
Viaarxiv icon

Stylus: Automatic Adapter Selection for Diffusion Models

Add code
Apr 29, 2024
Figure 1 for Stylus: Automatic Adapter Selection for Diffusion Models
Figure 2 for Stylus: Automatic Adapter Selection for Diffusion Models
Figure 3 for Stylus: Automatic Adapter Selection for Diffusion Models
Figure 4 for Stylus: Automatic Adapter Selection for Diffusion Models
Viaarxiv icon

Balsa: Learning a Query Optimizer Without Expert Demonstrations

Add code
Jan 05, 2022
Figure 1 for Balsa: Learning a Query Optimizer Without Expert Demonstrations
Figure 2 for Balsa: Learning a Query Optimizer Without Expert Demonstrations
Figure 3 for Balsa: Learning a Query Optimizer Without Expert Demonstrations
Figure 4 for Balsa: Learning a Query Optimizer Without Expert Demonstrations
Viaarxiv icon

MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance

Add code
Dec 07, 2021
Figure 1 for MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance
Figure 2 for MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance
Figure 3 for MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance
Figure 4 for MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance
Viaarxiv icon

Discovering Non-monotonic Autoregressive Orderings with Variational Inference

Add code
Oct 27, 2021
Figure 1 for Discovering Non-monotonic Autoregressive Orderings with Variational Inference
Figure 2 for Discovering Non-monotonic Autoregressive Orderings with Variational Inference
Figure 3 for Discovering Non-monotonic Autoregressive Orderings with Variational Inference
Figure 4 for Discovering Non-monotonic Autoregressive Orderings with Variational Inference
Viaarxiv icon

Accelerating Quadratic Optimization with Reinforcement Learning

Add code
Jul 22, 2021
Figure 1 for Accelerating Quadratic Optimization with Reinforcement Learning
Figure 2 for Accelerating Quadratic Optimization with Reinforcement Learning
Figure 3 for Accelerating Quadratic Optimization with Reinforcement Learning
Figure 4 for Accelerating Quadratic Optimization with Reinforcement Learning
Viaarxiv icon

LazyDAgger: Reducing Context Switching in Interactive Imitation Learning

Add code
Mar 31, 2021
Figure 1 for LazyDAgger: Reducing Context Switching in Interactive Imitation Learning
Figure 2 for LazyDAgger: Reducing Context Switching in Interactive Imitation Learning
Figure 3 for LazyDAgger: Reducing Context Switching in Interactive Imitation Learning
Figure 4 for LazyDAgger: Reducing Context Switching in Interactive Imitation Learning
Viaarxiv icon

Distributed Reinforcement Learning is a Dataflow Problem

Add code
Dec 03, 2020
Figure 1 for Distributed Reinforcement Learning is a Dataflow Problem
Figure 2 for Distributed Reinforcement Learning is a Dataflow Problem
Figure 3 for Distributed Reinforcement Learning is a Dataflow Problem
Figure 4 for Distributed Reinforcement Learning is a Dataflow Problem
Viaarxiv icon

Connecting Context-specific Adaptation in Humans to Meta-learning

Add code
Dec 01, 2020
Figure 1 for Connecting Context-specific Adaptation in Humans to Meta-learning
Figure 2 for Connecting Context-specific Adaptation in Humans to Meta-learning
Figure 3 for Connecting Context-specific Adaptation in Humans to Meta-learning
Figure 4 for Connecting Context-specific Adaptation in Humans to Meta-learning
Viaarxiv icon

Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones

Add code
Oct 29, 2020
Figure 1 for Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones
Figure 2 for Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones
Figure 3 for Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones
Figure 4 for Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones
Viaarxiv icon