Picture for Yiming Lu

Yiming Lu

Systematic Analysis of LLM Contributions to Planning: Solver, Verifier, Heuristic

Add code
Dec 12, 2024
Viaarxiv icon

STRUX: An LLM for Decision-Making with Structured Explanations

Add code
Oct 16, 2024
Viaarxiv icon

DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning

Add code
Oct 02, 2024
Figure 1 for DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning
Figure 2 for DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning
Figure 3 for DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning
Figure 4 for DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning
Viaarxiv icon

Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL

Add code
Feb 14, 2022
Figure 1 for Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Figure 2 for Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Figure 3 for Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Figure 4 for Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Viaarxiv icon

SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition

Add code
Nov 17, 2021
Figure 1 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Figure 2 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Figure 3 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Figure 4 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Viaarxiv icon

Towards robust and domain agnostic reinforcement learning competitions

Add code
Jun 07, 2021
Figure 1 for Towards robust and domain agnostic reinforcement learning competitions
Figure 2 for Towards robust and domain agnostic reinforcement learning competitions
Figure 3 for Towards robust and domain agnostic reinforcement learning competitions
Figure 4 for Towards robust and domain agnostic reinforcement learning competitions
Viaarxiv icon