Picture for Amrit Singh Bedi

Amrit Singh Bedi

Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction

Add code
Nov 01, 2024
Viaarxiv icon

EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering

Add code
Oct 26, 2024
Viaarxiv icon

On The Global Convergence Of Online RLHF With Neural Parametrization

Add code
Oct 21, 2024
Viaarxiv icon

On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning

Add code
Oct 05, 2024
Viaarxiv icon

AIME: AI System Optimization via Multiple LLM Evaluators

Add code
Oct 04, 2024
Viaarxiv icon

Auction-Based Regulation for Artificial Intelligence

Add code
Oct 02, 2024
Figure 1 for Auction-Based Regulation for Artificial Intelligence
Figure 2 for Auction-Based Regulation for Artificial Intelligence
Figure 3 for Auction-Based Regulation for Artificial Intelligence
Figure 4 for Auction-Based Regulation for Artificial Intelligence
Viaarxiv icon

CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk

Add code
Aug 16, 2024
Viaarxiv icon

TrustNavGPT: Modeling Uncertainty to Improve Trustworthiness of Audio-Guided LLM-Based Robot Navigation

Add code
Aug 03, 2024
Viaarxiv icon

Embodied Question Answering via Multi-LLM Systems

Add code
Jun 18, 2024
Viaarxiv icon

DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning

Add code
Jun 16, 2024
Viaarxiv icon