Picture for Tianwei Ni

Tianwei Ni

Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers

Add code
Mar 29, 2024
Figure 1 for Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers
Figure 2 for Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers
Figure 3 for Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers
Figure 4 for Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers
Viaarxiv icon

Do Transformer World Models Give Better Policy Gradients?

Add code
Feb 11, 2024
Figure 1 for Do Transformer World Models Give Better Policy Gradients?
Figure 2 for Do Transformer World Models Give Better Policy Gradients?
Figure 3 for Do Transformer World Models Give Better Policy Gradients?
Figure 4 for Do Transformer World Models Give Better Policy Gradients?
Viaarxiv icon

Bridging State and History Representations: Understanding Self-Predictive RL

Add code
Jan 17, 2024
Viaarxiv icon

When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment

Add code
Jul 31, 2023
Viaarxiv icon

Towards Disturbance-Free Visual Mobile Manipulation

Add code
Dec 17, 2021
Figure 1 for Towards Disturbance-Free Visual Mobile Manipulation
Figure 2 for Towards Disturbance-Free Visual Mobile Manipulation
Figure 3 for Towards Disturbance-Free Visual Mobile Manipulation
Figure 4 for Towards Disturbance-Free Visual Mobile Manipulation
Viaarxiv icon

Recurrent Model-Free RL is a Strong Baseline for Many POMDPs

Add code
Oct 11, 2021
Figure 1 for Recurrent Model-Free RL is a Strong Baseline for Many POMDPs
Figure 2 for Recurrent Model-Free RL is a Strong Baseline for Many POMDPs
Figure 3 for Recurrent Model-Free RL is a Strong Baseline for Many POMDPs
Figure 4 for Recurrent Model-Free RL is a Strong Baseline for Many POMDPs
Viaarxiv icon

Adaptive Agent Architecture for Real-time Human-Agent Teaming

Add code
Mar 07, 2021
Figure 1 for Adaptive Agent Architecture for Real-time Human-Agent Teaming
Figure 2 for Adaptive Agent Architecture for Real-time Human-Agent Teaming
Figure 3 for Adaptive Agent Architecture for Real-time Human-Agent Teaming
Figure 4 for Adaptive Agent Architecture for Real-time Human-Agent Teaming
Viaarxiv icon

f-IRL: Inverse Reinforcement Learning via State Marginal Matching

Add code
Nov 09, 2020
Figure 1 for f-IRL: Inverse Reinforcement Learning via State Marginal Matching
Figure 2 for f-IRL: Inverse Reinforcement Learning via State Marginal Matching
Figure 3 for f-IRL: Inverse Reinforcement Learning via State Marginal Matching
Figure 4 for f-IRL: Inverse Reinforcement Learning via State Marginal Matching
Viaarxiv icon

Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient

Add code
Jul 03, 2020
Figure 1 for Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Figure 2 for Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Figure 3 for Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Figure 4 for Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Viaarxiv icon

Phase Collaborative Network for Multi-Phase Medical Imaging Segmentation

Add code
Dec 06, 2018
Figure 1 for Phase Collaborative Network for Multi-Phase Medical Imaging Segmentation
Figure 2 for Phase Collaborative Network for Multi-Phase Medical Imaging Segmentation
Figure 3 for Phase Collaborative Network for Multi-Phase Medical Imaging Segmentation
Figure 4 for Phase Collaborative Network for Multi-Phase Medical Imaging Segmentation
Viaarxiv icon