Picture for Sahil Badyal

Sahil Badyal

Deconstructing Instruction-Following: A New Benchmark for Granular Evaluation of Large Language Model Instruction Compliance Abilities

Add code
Jan 26, 2026
Viaarxiv icon

Enhancing LLM Instruction Following: An Evaluation-Driven Multi-Agentic Workflow for Prompt Instructions Optimization

Add code
Jan 06, 2026
Viaarxiv icon

Multiagent Rollout and Policy Iteration for POMDP with Application to Multi-Robot Repair Problems

Add code
Nov 09, 2020
Figure 1 for Multiagent Rollout and Policy Iteration for POMDP with Application to Multi-Robot Repair Problems
Figure 2 for Multiagent Rollout and Policy Iteration for POMDP with Application to Multi-Robot Repair Problems
Figure 3 for Multiagent Rollout and Policy Iteration for POMDP with Application to Multi-Robot Repair Problems
Figure 4 for Multiagent Rollout and Policy Iteration for POMDP with Application to Multi-Robot Repair Problems
Viaarxiv icon

Reinforcement Learning for POMDP: Partitioned Rollout and Policy Iteration with Application to Autonomous Sequential Repair Problems

Add code
Feb 11, 2020
Figure 1 for Reinforcement Learning for POMDP: Partitioned Rollout and Policy Iteration with Application to Autonomous Sequential Repair Problems
Figure 2 for Reinforcement Learning for POMDP: Partitioned Rollout and Policy Iteration with Application to Autonomous Sequential Repair Problems
Figure 3 for Reinforcement Learning for POMDP: Partitioned Rollout and Policy Iteration with Application to Autonomous Sequential Repair Problems
Figure 4 for Reinforcement Learning for POMDP: Partitioned Rollout and Policy Iteration with Application to Autonomous Sequential Repair Problems
Viaarxiv icon