Picture for Xidong Feng

Xidong Feng

Efficient Reinforcement Learning with Large Language Model Priors

Add code
Oct 10, 2024
Figure 1 for Efficient Reinforcement Learning with Large Language Model Priors
Figure 2 for Efficient Reinforcement Learning with Large Language Model Priors
Figure 3 for Efficient Reinforcement Learning with Large Language Model Priors
Figure 4 for Efficient Reinforcement Learning with Large Language Model Priors
Viaarxiv icon

Natural Language Reinforcement Learning

Add code
Feb 14, 2024
Viaarxiv icon

Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

Add code
Feb 05, 2024
Viaarxiv icon

Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning

Add code
Dec 22, 2023
Viaarxiv icon

Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training

Add code
Sep 29, 2023
Figure 1 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Figure 2 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Figure 3 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Figure 4 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Viaarxiv icon

ChessGPT: Bridging Policy Learning and Language Modeling

Add code
Jun 15, 2023
Figure 1 for ChessGPT: Bridging Policy Learning and Language Modeling
Figure 2 for ChessGPT: Bridging Policy Learning and Language Modeling
Figure 3 for ChessGPT: Bridging Policy Learning and Language Modeling
Figure 4 for ChessGPT: Bridging Policy Learning and Language Modeling
Viaarxiv icon

Contextual Transformer for Offline Meta Reinforcement Learning

Add code
Nov 15, 2022
Figure 1 for Contextual Transformer for Offline Meta Reinforcement Learning
Figure 2 for Contextual Transformer for Offline Meta Reinforcement Learning
Figure 3 for Contextual Transformer for Offline Meta Reinforcement Learning
Figure 4 for Contextual Transformer for Offline Meta Reinforcement Learning
Viaarxiv icon

TorchOpt: An Efficient Library for Differentiable Optimization

Add code
Nov 13, 2022
Figure 1 for TorchOpt: An Efficient Library for Differentiable Optimization
Figure 2 for TorchOpt: An Efficient Library for Differentiable Optimization
Figure 3 for TorchOpt: An Efficient Library for Differentiable Optimization
Figure 4 for TorchOpt: An Efficient Library for Differentiable Optimization
Viaarxiv icon

Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL

Add code
Aug 02, 2022
Figure 1 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 2 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 3 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 4 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Viaarxiv icon

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning

Add code
Jun 17, 2022
Figure 1 for Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Figure 2 for Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Figure 3 for Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Figure 4 for Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Viaarxiv icon