Picture for Mudit Verma

Mudit Verma

Robust Planning with Compound LLM Architectures: An LLM-Modulo Approach

Add code
Nov 20, 2024
Viaarxiv icon

Robust Planning with LLM-Modulo Framework: Case Study in Travel Planning

Add code
May 31, 2024
Viaarxiv icon

On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models

Add code
May 22, 2024
Figure 1 for On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models
Figure 2 for On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models
Figure 3 for On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models
Figure 4 for On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models
Viaarxiv icon

Hindsight PRIORs for Reward Learning from Human Preferences

Add code
Apr 12, 2024
Viaarxiv icon

LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks

Add code
Feb 06, 2024
Figure 1 for LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Figure 2 for LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Figure 3 for LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Viaarxiv icon

Theory of Mind abilities of Large Language Models in Human-Robot Interaction : An Illusion?

Add code
Jan 17, 2024
Viaarxiv icon

Benchmarking Multi-Agent Preference-based Reinforcement Learning for Human-AI Teaming

Add code
Dec 21, 2023
Viaarxiv icon

Methods and Mechanisms for Interactive Novelty Handling in Adversarial Environments

Add code
Mar 06, 2023
Viaarxiv icon

Data Driven Reward Initialization for Preference based Reinforcement Learning

Add code
Feb 17, 2023
Viaarxiv icon

A State Augmentation based approach to Reinforcement Learning from Human Preferences

Add code
Feb 17, 2023
Viaarxiv icon