Picture for Siddhant Bhambri

Siddhant Bhambri

Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation

Add code
May 20, 2025
Viaarxiv icon

Who is Helping Whom? Analyzing Inter-dependencies to Evaluate Cooperation in Human-AI Teaming

Add code
Feb 10, 2025
Figure 1 for Who is Helping Whom? Analyzing Inter-dependencies to Evaluate Cooperation in Human-AI Teaming
Figure 2 for Who is Helping Whom? Analyzing Inter-dependencies to Evaluate Cooperation in Human-AI Teaming
Figure 3 for Who is Helping Whom? Analyzing Inter-dependencies to Evaluate Cooperation in Human-AI Teaming
Figure 4 for Who is Helping Whom? Analyzing Inter-dependencies to Evaluate Cooperation in Human-AI Teaming
Viaarxiv icon

Robust Planning with LLM-Modulo Framework: Case Study in Travel Planning

Add code
May 31, 2024
Viaarxiv icon

Efficient Reinforcement Learning via Large Language Model-based Search

Add code
May 24, 2024
Figure 1 for Efficient Reinforcement Learning via Large Language Model-based Search
Figure 2 for Efficient Reinforcement Learning via Large Language Model-based Search
Figure 3 for Efficient Reinforcement Learning via Large Language Model-based Search
Figure 4 for Efficient Reinforcement Learning via Large Language Model-based Search
Viaarxiv icon

On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models

Add code
May 22, 2024
Figure 1 for On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models
Figure 2 for On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models
Figure 3 for On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models
Figure 4 for On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models
Viaarxiv icon

LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks

Add code
Feb 06, 2024
Figure 1 for LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Figure 2 for LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Figure 3 for LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Viaarxiv icon

Theory of Mind abilities of Large Language Models in Human-Robot Interaction : An Illusion?

Add code
Jan 17, 2024
Viaarxiv icon

Benchmarking Multi-Agent Preference-based Reinforcement Learning for Human-AI Teaming

Add code
Dec 21, 2023
Viaarxiv icon

Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning

Add code
Feb 17, 2023
Figure 1 for Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning
Figure 2 for Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning
Viaarxiv icon

Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach

Add code
Nov 29, 2022
Figure 1 for Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach
Figure 2 for Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach
Figure 3 for Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach
Figure 4 for Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach
Viaarxiv icon