Picture for Mehul Damani

Mehul Damani

The Surprising Effectiveness of Test-Time Training for Abstract Reasoning

Add code
Nov 11, 2024
Figure 1 for The Surprising Effectiveness of Test-Time Training for Abstract Reasoning
Figure 2 for The Surprising Effectiveness of Test-Time Training for Abstract Reasoning
Figure 3 for The Surprising Effectiveness of Test-Time Training for Abstract Reasoning
Figure 4 for The Surprising Effectiveness of Test-Time Training for Abstract Reasoning
Viaarxiv icon

Learning How Hard to Think: Input-Adaptive Allocation of LM Computation

Add code
Oct 07, 2024
Viaarxiv icon

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Add code
Jul 27, 2023
Figure 1 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 2 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 3 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 4 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Viaarxiv icon

Distributed Reinforcement Learning for Robot Teams: A Review

Add code
Apr 07, 2022
Figure 1 for Distributed Reinforcement Learning for Robot Teams: A Review
Figure 2 for Distributed Reinforcement Learning for Robot Teams: A Review
Figure 3 for Distributed Reinforcement Learning for Robot Teams: A Review
Figure 4 for Distributed Reinforcement Learning for Robot Teams: A Review
Viaarxiv icon

Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World

Add code
Mar 30, 2021
Figure 1 for Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Figure 2 for Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Figure 3 for Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Figure 4 for Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Viaarxiv icon

PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong

Add code
Oct 16, 2020
Figure 1 for PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong
Figure 2 for PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong
Figure 3 for PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong
Figure 4 for PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong
Viaarxiv icon