Picture for Zhanhong Jiang

Zhanhong Jiang

FUSE: First-Order and Second-Order Unified SynthEsis in Stochastic Optimization

Add code
Mar 06, 2025
Viaarxiv icon

Enhancing PPO with Trajectory-Aware Hybrid Policies

Add code
Feb 21, 2025
Figure 1 for Enhancing PPO with Trajectory-Aware Hybrid Policies
Figure 2 for Enhancing PPO with Trajectory-Aware Hybrid Policies
Figure 3 for Enhancing PPO with Trajectory-Aware Hybrid Policies
Figure 4 for Enhancing PPO with Trajectory-Aware Hybrid Policies
Viaarxiv icon

RLS3: RL-Based Synthetic Sample Selection to Enhance Spatial Reasoning in Vision-Language Models for Indoor Autonomous Perception

Add code
Jan 31, 2025
Viaarxiv icon

STITCH: Surface reconstrucTion using Implicit neural representations with Topology Constraints and persistent Homology

Add code
Dec 24, 2024
Figure 1 for STITCH: Surface reconstrucTion using Implicit neural representations with Topology Constraints and persistent Homology
Figure 2 for STITCH: Surface reconstrucTion using Implicit neural representations with Topology Constraints and persistent Homology
Figure 3 for STITCH: Surface reconstrucTion using Implicit neural representations with Topology Constraints and persistent Homology
Figure 4 for STITCH: Surface reconstrucTion using Implicit neural representations with Topology Constraints and persistent Homology
Viaarxiv icon

FAWAC: Feasibility Informed Advantage Weighted Regression for Persistent Safety in Offline Reinforcement Learning

Add code
Dec 12, 2024
Viaarxiv icon

Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning

Add code
Dec 11, 2024
Viaarxiv icon

DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models

Add code
Apr 11, 2024
Figure 1 for DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models
Figure 2 for DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models
Figure 3 for DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models
Figure 4 for DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models
Viaarxiv icon

Neural PDE Solvers for Irregular Domains

Add code
Nov 07, 2022
Viaarxiv icon

Distributed Online Non-convex Optimization with Composite Regret

Add code
Sep 21, 2022
Figure 1 for Distributed Online Non-convex Optimization with Composite Regret
Viaarxiv icon

Asynchronous Training Schemes in Distributed Learning with Time Delay

Add code
Aug 28, 2022
Figure 1 for Asynchronous Training Schemes in Distributed Learning with Time Delay
Figure 2 for Asynchronous Training Schemes in Distributed Learning with Time Delay
Figure 3 for Asynchronous Training Schemes in Distributed Learning with Time Delay
Figure 4 for Asynchronous Training Schemes in Distributed Learning with Time Delay
Viaarxiv icon