Picture for Tianwei Ni

Tianwei Ni

Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers

Add code
Mar 29, 2024
Viaarxiv icon

Do Transformer World Models Give Better Policy Gradients?

Add code
Feb 11, 2024
Viaarxiv icon

Bridging State and History Representations: Understanding Self-Predictive RL

Add code
Jan 17, 2024
Viaarxiv icon

When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment

Add code
Jul 31, 2023
Viaarxiv icon

Towards Disturbance-Free Visual Mobile Manipulation

Add code
Dec 17, 2021
Figure 1 for Towards Disturbance-Free Visual Mobile Manipulation
Figure 2 for Towards Disturbance-Free Visual Mobile Manipulation
Figure 3 for Towards Disturbance-Free Visual Mobile Manipulation
Figure 4 for Towards Disturbance-Free Visual Mobile Manipulation
Viaarxiv icon

Recurrent Model-Free RL is a Strong Baseline for Many POMDPs

Add code
Oct 11, 2021
Figure 1 for Recurrent Model-Free RL is a Strong Baseline for Many POMDPs
Figure 2 for Recurrent Model-Free RL is a Strong Baseline for Many POMDPs
Figure 3 for Recurrent Model-Free RL is a Strong Baseline for Many POMDPs
Figure 4 for Recurrent Model-Free RL is a Strong Baseline for Many POMDPs
Viaarxiv icon

Adaptive Agent Architecture for Real-time Human-Agent Teaming

Add code
Mar 07, 2021
Figure 1 for Adaptive Agent Architecture for Real-time Human-Agent Teaming
Figure 2 for Adaptive Agent Architecture for Real-time Human-Agent Teaming
Figure 3 for Adaptive Agent Architecture for Real-time Human-Agent Teaming
Figure 4 for Adaptive Agent Architecture for Real-time Human-Agent Teaming
Viaarxiv icon

f-IRL: Inverse Reinforcement Learning via State Marginal Matching

Add code
Nov 09, 2020
Figure 1 for f-IRL: Inverse Reinforcement Learning via State Marginal Matching
Figure 2 for f-IRL: Inverse Reinforcement Learning via State Marginal Matching
Figure 3 for f-IRL: Inverse Reinforcement Learning via State Marginal Matching
Figure 4 for f-IRL: Inverse Reinforcement Learning via State Marginal Matching
Viaarxiv icon

Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient

Add code
Jul 03, 2020
Figure 1 for Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Figure 2 for Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Figure 3 for Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Figure 4 for Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Viaarxiv icon

Phase Collaborative Network for Multi-Phase Medical Imaging Segmentation

Add code
Dec 06, 2018
Figure 1 for Phase Collaborative Network for Multi-Phase Medical Imaging Segmentation
Figure 2 for Phase Collaborative Network for Multi-Phase Medical Imaging Segmentation
Figure 3 for Phase Collaborative Network for Multi-Phase Medical Imaging Segmentation
Figure 4 for Phase Collaborative Network for Multi-Phase Medical Imaging Segmentation
Viaarxiv icon