Picture for Zhizhou Ren

Zhizhou Ren

Full-Atom Peptide Design based on Multi-modal Flow Matching

Add code
Jun 02, 2024
Figure 1 for Full-Atom Peptide Design based on Multi-modal Flow Matching
Figure 2 for Full-Atom Peptide Design based on Multi-modal Flow Matching
Figure 3 for Full-Atom Peptide Design based on Multi-modal Flow Matching
Figure 4 for Full-Atom Peptide Design based on Multi-modal Flow Matching
Viaarxiv icon

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Add code
May 23, 2024
Viaarxiv icon

Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation

Add code
Nov 20, 2022
Figure 1 for Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation
Figure 2 for Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation
Figure 3 for Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation
Viaarxiv icon

Self-Organized Polynomial-Time Coordination Graphs

Add code
Dec 07, 2021
Figure 1 for Self-Organized Polynomial-Time Coordination Graphs
Figure 2 for Self-Organized Polynomial-Time Coordination Graphs
Figure 3 for Self-Organized Polynomial-Time Coordination Graphs
Figure 4 for Self-Organized Polynomial-Time Coordination Graphs
Viaarxiv icon

Learning Long-Term Reward Redistribution via Randomized Return Decomposition

Add code
Nov 26, 2021
Figure 1 for Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Figure 2 for Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Figure 3 for Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Figure 4 for Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Viaarxiv icon

On the Estimation Bias in Double Q-Learning

Add code
Sep 29, 2021
Figure 1 for On the Estimation Bias in Double Q-Learning
Figure 2 for On the Estimation Bias in Double Q-Learning
Figure 3 for On the Estimation Bias in Double Q-Learning
Figure 4 for On the Estimation Bias in Double Q-Learning
Viaarxiv icon

Off-Policy Reinforcement Learning with Delayed Rewards

Add code
Jun 22, 2021
Figure 1 for Off-Policy Reinforcement Learning with Delayed Rewards
Figure 2 for Off-Policy Reinforcement Learning with Delayed Rewards
Figure 3 for Off-Policy Reinforcement Learning with Delayed Rewards
Figure 4 for Off-Policy Reinforcement Learning with Delayed Rewards
Viaarxiv icon

Generalizable Episodic Memory for Deep Reinforcement Learning

Add code
Mar 11, 2021
Figure 1 for Generalizable Episodic Memory for Deep Reinforcement Learning
Figure 2 for Generalizable Episodic Memory for Deep Reinforcement Learning
Figure 3 for Generalizable Episodic Memory for Deep Reinforcement Learning
Figure 4 for Generalizable Episodic Memory for Deep Reinforcement Learning
Viaarxiv icon

QPLEX: Duplex Dueling Multi-Agent Q-Learning

Add code
Aug 03, 2020
Figure 1 for QPLEX: Duplex Dueling Multi-Agent Q-Learning
Figure 2 for QPLEX: Duplex Dueling Multi-Agent Q-Learning
Figure 3 for QPLEX: Duplex Dueling Multi-Agent Q-Learning
Figure 4 for QPLEX: Duplex Dueling Multi-Agent Q-Learning
Viaarxiv icon

Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning

Add code
Jun 23, 2020
Figure 1 for Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning
Figure 2 for Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning
Figure 3 for Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning
Viaarxiv icon