Picture for Sobhan Miryoosefi

Sobhan Miryoosefi

On the Inductive Bias of Stacking Towards Improving Reasoning

Add code
Sep 27, 2024
Viaarxiv icon

Landscape-Aware Growing: The Power of a Little LAG

Add code
Jun 04, 2024
Figure 1 for Landscape-Aware Growing: The Power of a Little LAG
Figure 2 for Landscape-Aware Growing: The Power of a Little LAG
Figure 3 for Landscape-Aware Growing: The Power of a Little LAG
Figure 4 for Landscape-Aware Growing: The Power of a Little LAG
Viaarxiv icon

Efficient Stagewise Pretraining via Progressive Subnetworks

Add code
Feb 08, 2024
Viaarxiv icon

ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

Add code
Dec 15, 2023
Viaarxiv icon

Provable Reinforcement Learning with a Short-Term Memory

Add code
Feb 08, 2022
Figure 1 for Provable Reinforcement Learning with a Short-Term Memory
Figure 2 for Provable Reinforcement Learning with a Short-Term Memory
Viaarxiv icon

A Simple Reward-free Approach to Constrained Reinforcement Learning

Add code
Jul 12, 2021
Figure 1 for A Simple Reward-free Approach to Constrained Reinforcement Learning
Viaarxiv icon

Bellman Eluder Dimension: New Rich Classes of RL Problems, and Sample-Efficient Algorithms

Add code
Feb 05, 2021
Figure 1 for Bellman Eluder Dimension: New Rich Classes of RL Problems, and Sample-Efficient Algorithms
Viaarxiv icon

Constrained episodic reinforcement learning in concave-convex and knapsack settings

Add code
Jun 09, 2020
Figure 1 for Constrained episodic reinforcement learning in concave-convex and knapsack settings
Figure 2 for Constrained episodic reinforcement learning in concave-convex and knapsack settings
Viaarxiv icon

Reinforcement Learning with Convex Constraints

Add code
Jun 21, 2019
Figure 1 for Reinforcement Learning with Convex Constraints
Figure 2 for Reinforcement Learning with Convex Constraints
Viaarxiv icon