Picture for Hanjing Wang

Hanjing Wang

Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis

Add code
Mar 11, 2026
Viaarxiv icon

ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning

Add code
Mar 12, 2025
Viaarxiv icon

Boost, Disentangle, and Customize: A Robust System2-to-System1 Pipeline for Code Generation

Add code
Feb 18, 2025
Figure 1 for Boost, Disentangle, and Customize: A Robust System2-to-System1 Pipeline for Code Generation
Figure 2 for Boost, Disentangle, and Customize: A Robust System2-to-System1 Pipeline for Code Generation
Figure 3 for Boost, Disentangle, and Customize: A Robust System2-to-System1 Pipeline for Code Generation
Figure 4 for Boost, Disentangle, and Customize: A Robust System2-to-System1 Pipeline for Code Generation
Viaarxiv icon

Epistemic Uncertainty Quantification For Pre-trained Neural Network

Add code
Apr 15, 2024
Figure 1 for Epistemic Uncertainty Quantification For Pre-trained Neural Network
Figure 2 for Epistemic Uncertainty Quantification For Pre-trained Neural Network
Figure 3 for Epistemic Uncertainty Quantification For Pre-trained Neural Network
Figure 4 for Epistemic Uncertainty Quantification For Pre-trained Neural Network
Viaarxiv icon

GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models

Add code
Oct 08, 2023
Figure 1 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Figure 2 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Figure 3 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Figure 4 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Viaarxiv icon

Body Knowledge and Uncertainty Modeling for Monocular 3D Human Body Reconstruction

Add code
Aug 01, 2023
Figure 1 for Body Knowledge and Uncertainty Modeling for Monocular 3D Human Body Reconstruction
Figure 2 for Body Knowledge and Uncertainty Modeling for Monocular 3D Human Body Reconstruction
Figure 3 for Body Knowledge and Uncertainty Modeling for Monocular 3D Human Body Reconstruction
Figure 4 for Body Knowledge and Uncertainty Modeling for Monocular 3D Human Body Reconstruction
Viaarxiv icon

Large Sequence Models for Sequential Decision-Making: A Survey

Add code
Jun 24, 2023
Viaarxiv icon

Gradient-based Uncertainty Attribution for Explainable Bayesian Deep Learning

Add code
Apr 10, 2023
Figure 1 for Gradient-based Uncertainty Attribution for Explainable Bayesian Deep Learning
Figure 2 for Gradient-based Uncertainty Attribution for Explainable Bayesian Deep Learning
Figure 3 for Gradient-based Uncertainty Attribution for Explainable Bayesian Deep Learning
Figure 4 for Gradient-based Uncertainty Attribution for Explainable Bayesian Deep Learning
Viaarxiv icon

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning

Add code
Jun 05, 2021
Figure 1 for MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Figure 2 for MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Figure 3 for MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Figure 4 for MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Viaarxiv icon

AdaWISH: Faster Discrete Integration via Adaptive Quantiles

Add code
Oct 13, 2019
Figure 1 for AdaWISH: Faster Discrete Integration via Adaptive Quantiles
Figure 2 for AdaWISH: Faster Discrete Integration via Adaptive Quantiles
Figure 3 for AdaWISH: Faster Discrete Integration via Adaptive Quantiles
Figure 4 for AdaWISH: Faster Discrete Integration via Adaptive Quantiles
Viaarxiv icon