Picture for Guoqiang Wu

Guoqiang Wu

Efficient Hierarchical Implicit Flow Q-learning for Offline Goal-conditioned Reinforcement Learning

Add code
Apr 10, 2026
Viaarxiv icon

Value-Guidance MeanFlow for Offline Multi-Agent Reinforcement Learning

Add code
Apr 09, 2026
Viaarxiv icon

Equivariant Efficient Joint Discrete and Continuous MeanFlow for Molecular Graph Generation

Add code
Apr 09, 2026
Viaarxiv icon

DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning

Add code
Mar 03, 2025
Figure 1 for DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
Figure 2 for DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
Figure 3 for DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
Figure 4 for DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
Viaarxiv icon

Sharper Concentration Inequalities for Multi-Graph Dependent Variables

Add code
Feb 25, 2025
Figure 1 for Sharper Concentration Inequalities for Multi-Graph Dependent Variables
Figure 2 for Sharper Concentration Inequalities for Multi-Graph Dependent Variables
Figure 3 for Sharper Concentration Inequalities for Multi-Graph Dependent Variables
Figure 4 for Sharper Concentration Inequalities for Multi-Graph Dependent Variables
Viaarxiv icon

A Theory for Conditional Generative Modeling on Multiple Data Sources

Add code
Feb 20, 2025
Viaarxiv icon

Towards Macro-AUC oriented Imbalanced Multi-Label Continual Learning

Add code
Dec 24, 2024
Figure 1 for Towards Macro-AUC oriented Imbalanced Multi-Label Continual Learning
Figure 2 for Towards Macro-AUC oriented Imbalanced Multi-Label Continual Learning
Figure 3 for Towards Macro-AUC oriented Imbalanced Multi-Label Continual Learning
Figure 4 for Towards Macro-AUC oriented Imbalanced Multi-Label Continual Learning
Viaarxiv icon

IPL: Leveraging Multimodal Large Language Models for Intelligent Product Listing

Add code
Oct 22, 2024
Figure 1 for IPL: Leveraging Multimodal Large Language Models for Intelligent Product Listing
Figure 2 for IPL: Leveraging Multimodal Large Language Models for Intelligent Product Listing
Figure 3 for IPL: Leveraging Multimodal Large Language Models for Intelligent Product Listing
Figure 4 for IPL: Leveraging Multimodal Large Language Models for Intelligent Product Listing
Viaarxiv icon

Learning to Race in Extreme Turning Scene with Active Exploration and Gaussian Process Regression-based MPC

Add code
Oct 08, 2024
Figure 1 for Learning to Race in Extreme Turning Scene with Active Exploration and Gaussian Process Regression-based MPC
Figure 2 for Learning to Race in Extreme Turning Scene with Active Exploration and Gaussian Process Regression-based MPC
Figure 3 for Learning to Race in Extreme Turning Scene with Active Exploration and Gaussian Process Regression-based MPC
Figure 4 for Learning to Race in Extreme Turning Scene with Active Exploration and Gaussian Process Regression-based MPC
Viaarxiv icon

On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability

Add code
May 27, 2024
Figure 1 for On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability
Figure 2 for On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability
Figure 3 for On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability
Figure 4 for On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability
Viaarxiv icon