Picture for Mehul Damani

Mehul Damani

The Surprising Effectiveness of Test-Time Training for Abstract Reasoning

Add code
Nov 11, 2024
Viaarxiv icon

Learning How Hard to Think: Input-Adaptive Allocation of LM Computation

Add code
Oct 07, 2024
Viaarxiv icon

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Add code
Jul 27, 2023
Viaarxiv icon

Distributed Reinforcement Learning for Robot Teams: A Review

Add code
Apr 07, 2022
Figure 1 for Distributed Reinforcement Learning for Robot Teams: A Review
Figure 2 for Distributed Reinforcement Learning for Robot Teams: A Review
Figure 3 for Distributed Reinforcement Learning for Robot Teams: A Review
Figure 4 for Distributed Reinforcement Learning for Robot Teams: A Review
Viaarxiv icon

Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World

Add code
Mar 30, 2021
Figure 1 for Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Figure 2 for Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Figure 3 for Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Figure 4 for Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Viaarxiv icon

PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong

Add code
Oct 16, 2020
Figure 1 for PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong
Figure 2 for PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong
Figure 3 for PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong
Figure 4 for PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong
Viaarxiv icon