Picture for Erdem Bıyık

Erdem Bıyık

NaVILA: Legged Robot Vision-Language-Action Model for Navigation

Add code
Dec 05, 2024
Viaarxiv icon

Accurate and Data-Efficient Toxicity Prediction when Annotators Disagree

Add code
Oct 16, 2024
Viaarxiv icon

Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions

Add code
Oct 15, 2024
Figure 1 for Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Figure 2 for Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Figure 3 for Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Figure 4 for Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Viaarxiv icon

Trajectory Improvement and Reward Learning from Comparative Language Feedback

Add code
Oct 08, 2024
Figure 1 for Trajectory Improvement and Reward Learning from Comparative Language Feedback
Figure 2 for Trajectory Improvement and Reward Learning from Comparative Language Feedback
Figure 3 for Trajectory Improvement and Reward Learning from Comparative Language Feedback
Figure 4 for Trajectory Improvement and Reward Learning from Comparative Language Feedback
Viaarxiv icon

Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation

Add code
Jun 10, 2024
Figure 1 for Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation
Figure 2 for Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation
Figure 3 for Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation
Figure 4 for Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation
Viaarxiv icon

ViSaRL: Visual Reinforcement Learning Guided by Human Saliency

Add code
Mar 16, 2024
Figure 1 for ViSaRL: Visual Reinforcement Learning Guided by Human Saliency
Figure 2 for ViSaRL: Visual Reinforcement Learning Guided by Human Saliency
Figure 3 for ViSaRL: Visual Reinforcement Learning Guided by Human Saliency
Figure 4 for ViSaRL: Visual Reinforcement Learning Guided by Human Saliency
Viaarxiv icon

A Generalized Acquisition Function for Preference-based Reward Learning

Add code
Mar 09, 2024
Viaarxiv icon

DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning

Add code
Feb 25, 2024
Viaarxiv icon

Batch Active Learning of Reward Functions from Human Preferences

Add code
Feb 24, 2024
Viaarxiv icon

RoboCLIP: One Demonstration is Enough to Learn Robot Policies

Add code
Oct 11, 2023
Figure 1 for RoboCLIP: One Demonstration is Enough to Learn Robot Policies
Figure 2 for RoboCLIP: One Demonstration is Enough to Learn Robot Policies
Figure 3 for RoboCLIP: One Demonstration is Enough to Learn Robot Policies
Figure 4 for RoboCLIP: One Demonstration is Enough to Learn Robot Policies
Viaarxiv icon