Picture for Sapana Chaudhary

Sapana Chaudhary

VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks

Add code
Nov 06, 2025
Figure 1 for VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks
Figure 2 for VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks
Figure 3 for VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks
Figure 4 for VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks
Viaarxiv icon

Teaching Large Language Models to Reason through Learning and Forgetting

Add code
Apr 15, 2025
Viaarxiv icon

Risk-Averse Finetuning of Large Language Models

Add code
Jan 12, 2025
Figure 1 for Risk-Averse Finetuning of Large Language Models
Figure 2 for Risk-Averse Finetuning of Large Language Models
Figure 3 for Risk-Averse Finetuning of Large Language Models
Figure 4 for Risk-Averse Finetuning of Large Language Models
Viaarxiv icon

AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents

Add code
Oct 17, 2024
Figure 1 for AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
Figure 2 for AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
Figure 3 for AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
Figure 4 for AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
Viaarxiv icon

Pedagogical Alignment of Large Language Models

Add code
Feb 07, 2024
Viaarxiv icon

Dynamic Regret Analysis of Safe Distributed Online Optimization for Convex and Non-convex Problems

Add code
Feb 23, 2023
Viaarxiv icon

Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments

Add code
Sep 26, 2022
Figure 1 for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Figure 2 for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Figure 3 for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Figure 4 for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Viaarxiv icon

Safe Online Convex Optimization with Unknown Linear Safety Constraints

Add code
Nov 14, 2021
Figure 1 for Safe Online Convex Optimization with Unknown Linear Safety Constraints
Figure 2 for Safe Online Convex Optimization with Unknown Linear Safety Constraints
Figure 3 for Safe Online Convex Optimization with Unknown Linear Safety Constraints
Figure 4 for Safe Online Convex Optimization with Unknown Linear Safety Constraints
Viaarxiv icon

Smooth Imitation Learning via Smooth Costs and Smooth Policies

Add code
Nov 03, 2021
Figure 1 for Smooth Imitation Learning via Smooth Costs and Smooth Policies
Figure 2 for Smooth Imitation Learning via Smooth Costs and Smooth Policies
Figure 3 for Smooth Imitation Learning via Smooth Costs and Smooth Policies
Figure 4 for Smooth Imitation Learning via Smooth Costs and Smooth Policies
Viaarxiv icon