Picture for Zhanhong Jiang

Zhanhong Jiang

FAWAC: Feasibility Informed Advantage Weighted Regression for Persistent Safety in Offline Reinforcement Learning

Add code
Dec 12, 2024
Viaarxiv icon

Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning

Add code
Dec 11, 2024
Viaarxiv icon

DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models

Add code
Apr 11, 2024
Figure 1 for DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models
Figure 2 for DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models
Figure 3 for DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models
Figure 4 for DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models
Viaarxiv icon

Neural PDE Solvers for Irregular Domains

Add code
Nov 07, 2022
Viaarxiv icon

Distributed Online Non-convex Optimization with Composite Regret

Add code
Sep 21, 2022
Figure 1 for Distributed Online Non-convex Optimization with Composite Regret
Viaarxiv icon

Asynchronous Training Schemes in Distributed Learning with Time Delay

Add code
Aug 28, 2022
Figure 1 for Asynchronous Training Schemes in Distributed Learning with Time Delay
Figure 2 for Asynchronous Training Schemes in Distributed Learning with Time Delay
Figure 3 for Asynchronous Training Schemes in Distributed Learning with Time Delay
Figure 4 for Asynchronous Training Schemes in Distributed Learning with Time Delay
Viaarxiv icon

MDPGT: Momentum-based Decentralized Policy Gradient Tracking

Add code
Dec 06, 2021
Figure 1 for MDPGT: Momentum-based Decentralized Policy Gradient Tracking
Figure 2 for MDPGT: Momentum-based Decentralized Policy Gradient Tracking
Figure 3 for MDPGT: Momentum-based Decentralized Policy Gradient Tracking
Figure 4 for MDPGT: Momentum-based Decentralized Policy Gradient Tracking
Viaarxiv icon

Cross-Gradient Aggregation for Decentralized Learning from Non-IID data

Add code
Mar 02, 2021
Figure 1 for Cross-Gradient Aggregation for Decentralized Learning from Non-IID data
Figure 2 for Cross-Gradient Aggregation for Decentralized Learning from Non-IID data
Figure 3 for Cross-Gradient Aggregation for Decentralized Learning from Non-IID data
Figure 4 for Cross-Gradient Aggregation for Decentralized Learning from Non-IID data
Viaarxiv icon

Decentralized Deep Learning using Momentum-Accelerated Consensus

Add code
Oct 21, 2020
Figure 1 for Decentralized Deep Learning using Momentum-Accelerated Consensus
Figure 2 for Decentralized Deep Learning using Momentum-Accelerated Consensus
Figure 3 for Decentralized Deep Learning using Momentum-Accelerated Consensus
Figure 4 for Decentralized Deep Learning using Momentum-Accelerated Consensus
Viaarxiv icon

Spatiotemporal Attention for Multivariate Time Series Prediction and Interpretation

Add code
Aug 11, 2020
Figure 1 for Spatiotemporal Attention for Multivariate Time Series Prediction and Interpretation
Figure 2 for Spatiotemporal Attention for Multivariate Time Series Prediction and Interpretation
Figure 3 for Spatiotemporal Attention for Multivariate Time Series Prediction and Interpretation
Figure 4 for Spatiotemporal Attention for Multivariate Time Series Prediction and Interpretation
Viaarxiv icon