Picture for Viraj Mehta

Viraj Mehta

Group Robust Preference Optimization in Reward-free RLHF

Add code
May 30, 2024
Viaarxiv icon

Sample Efficient Reinforcement Learning from Human Feedback via Active Exploration

Add code
Dec 01, 2023
Viaarxiv icon

Kernelized Offline Contextual Dueling Bandits

Add code
Jul 21, 2023
Viaarxiv icon

Near-optimal Policy Identification in Active Reinforcement Learning

Add code
Dec 19, 2022
Viaarxiv icon

Exploration via Planning for Information about the Optimal Trajectory

Add code
Oct 06, 2022
Figure 1 for Exploration via Planning for Information about the Optimal Trajectory
Figure 2 for Exploration via Planning for Information about the Optimal Trajectory
Figure 3 for Exploration via Planning for Information about the Optimal Trajectory
Figure 4 for Exploration via Planning for Information about the Optimal Trajectory
Viaarxiv icon

BATS: Best Action Trajectory Stitching

Add code
Apr 26, 2022
Figure 1 for BATS: Best Action Trajectory Stitching
Figure 2 for BATS: Best Action Trajectory Stitching
Figure 3 for BATS: Best Action Trajectory Stitching
Figure 4 for BATS: Best Action Trajectory Stitching
Viaarxiv icon

Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias

Add code
Dec 13, 2021
Figure 1 for Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias
Figure 2 for Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias
Figure 3 for Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias
Figure 4 for Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias
Viaarxiv icon

An Experimental Design Perspective on Model-Based Reinforcement Learning

Add code
Dec 09, 2021
Figure 1 for An Experimental Design Perspective on Model-Based Reinforcement Learning
Figure 2 for An Experimental Design Perspective on Model-Based Reinforcement Learning
Figure 3 for An Experimental Design Perspective on Model-Based Reinforcement Learning
Figure 4 for An Experimental Design Perspective on Model-Based Reinforcement Learning
Viaarxiv icon

Representational aspects of depth and conditioning in normalizing flows

Add code
Oct 02, 2020
Figure 1 for Representational aspects of depth and conditioning in normalizing flows
Figure 2 for Representational aspects of depth and conditioning in normalizing flows
Figure 3 for Representational aspects of depth and conditioning in normalizing flows
Figure 4 for Representational aspects of depth and conditioning in normalizing flows
Viaarxiv icon

Neural Dynamical Systems: Balancing Structure and Flexibility in Physical Prediction

Add code
Jun 23, 2020
Figure 1 for Neural Dynamical Systems: Balancing Structure and Flexibility in Physical Prediction
Figure 2 for Neural Dynamical Systems: Balancing Structure and Flexibility in Physical Prediction
Figure 3 for Neural Dynamical Systems: Balancing Structure and Flexibility in Physical Prediction
Figure 4 for Neural Dynamical Systems: Balancing Structure and Flexibility in Physical Prediction
Viaarxiv icon