Picture for Sapana Chaudhary

Sapana Chaudhary

VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks

Add code
Nov 06, 2025
Viaarxiv icon

Teaching Large Language Models to Reason through Learning and Forgetting

Add code
Apr 15, 2025
Viaarxiv icon

Risk-Averse Finetuning of Large Language Models

Add code
Jan 12, 2025
Figure 1 for Risk-Averse Finetuning of Large Language Models
Figure 2 for Risk-Averse Finetuning of Large Language Models
Figure 3 for Risk-Averse Finetuning of Large Language Models
Figure 4 for Risk-Averse Finetuning of Large Language Models
Viaarxiv icon

AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents

Add code
Oct 17, 2024
Figure 1 for AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
Figure 2 for AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
Figure 3 for AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
Figure 4 for AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
Viaarxiv icon

Pedagogical Alignment of Large Language Models

Add code
Feb 07, 2024
Viaarxiv icon

Dynamic Regret Analysis of Safe Distributed Online Optimization for Convex and Non-convex Problems

Add code
Feb 23, 2023
Viaarxiv icon

Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments

Add code
Sep 26, 2022
Figure 1 for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Figure 2 for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Figure 3 for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Figure 4 for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Viaarxiv icon

Safe Online Convex Optimization with Unknown Linear Safety Constraints

Add code
Nov 14, 2021
Figure 1 for Safe Online Convex Optimization with Unknown Linear Safety Constraints
Figure 2 for Safe Online Convex Optimization with Unknown Linear Safety Constraints
Figure 3 for Safe Online Convex Optimization with Unknown Linear Safety Constraints
Figure 4 for Safe Online Convex Optimization with Unknown Linear Safety Constraints
Viaarxiv icon

Smooth Imitation Learning via Smooth Costs and Smooth Policies

Add code
Nov 03, 2021
Figure 1 for Smooth Imitation Learning via Smooth Costs and Smooth Policies
Figure 2 for Smooth Imitation Learning via Smooth Costs and Smooth Policies
Figure 3 for Smooth Imitation Learning via Smooth Costs and Smooth Policies
Figure 4 for Smooth Imitation Learning via Smooth Costs and Smooth Policies
Viaarxiv icon