Picture for Xinbing Wang

Xinbing Wang

Controlling Underestimation Bias in Constrained Reinforcement Learning for Safe Exploration

Add code
Jan 17, 2026
Viaarxiv icon

Extreme Value Policy Optimization for Safe Reinforcement Learning

Add code
Jan 17, 2026
Viaarxiv icon

Improving LLM Reasoning with Homophily-aware Structural and Semantic Text-Attributed Graph Compression

Add code
Jan 13, 2026
Viaarxiv icon

Pretraining Language Models to Ponder in Continuous Space

Add code
May 27, 2025
Figure 1 for Pretraining Language Models to Ponder in Continuous Space
Figure 2 for Pretraining Language Models to Ponder in Continuous Space
Figure 3 for Pretraining Language Models to Ponder in Continuous Space
Figure 4 for Pretraining Language Models to Ponder in Continuous Space
Viaarxiv icon

Bayesian Cross-Modal Alignment Learning for Few-Shot Out-of-Distribution Generalization

Add code
Apr 22, 2025
Figure 1 for Bayesian Cross-Modal Alignment Learning for Few-Shot Out-of-Distribution Generalization
Figure 2 for Bayesian Cross-Modal Alignment Learning for Few-Shot Out-of-Distribution Generalization
Figure 3 for Bayesian Cross-Modal Alignment Learning for Few-Shot Out-of-Distribution Generalization
Figure 4 for Bayesian Cross-Modal Alignment Learning for Few-Shot Out-of-Distribution Generalization
Viaarxiv icon

CHAINSFORMER: Numerical Reasoning on Knowledge Graphs from a Chain Perspective

Add code
Apr 19, 2025
Figure 1 for CHAINSFORMER: Numerical Reasoning on Knowledge Graphs from a Chain Perspective
Figure 2 for CHAINSFORMER: Numerical Reasoning on Knowledge Graphs from a Chain Perspective
Figure 3 for CHAINSFORMER: Numerical Reasoning on Knowledge Graphs from a Chain Perspective
Figure 4 for CHAINSFORMER: Numerical Reasoning on Knowledge Graphs from a Chain Perspective
Viaarxiv icon

InfoBound: A Provable Information-Bounds Inspired Framework for Both OoD Generalization and OoD Detection

Add code
Apr 13, 2025
Figure 1 for InfoBound: A Provable Information-Bounds Inspired Framework for Both OoD Generalization and OoD Detection
Figure 2 for InfoBound: A Provable Information-Bounds Inspired Framework for Both OoD Generalization and OoD Detection
Figure 3 for InfoBound: A Provable Information-Bounds Inspired Framework for Both OoD Generalization and OoD Detection
Figure 4 for InfoBound: A Provable Information-Bounds Inspired Framework for Both OoD Generalization and OoD Detection
Viaarxiv icon

Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration

Add code
Feb 17, 2025
Viaarxiv icon

Less is More: Masking Elements in Image Condition Features Avoids Content Leakages in Style Transfer Diffusion Models

Add code
Feb 11, 2025
Viaarxiv icon

KaLM: Knowledge-aligned Autoregressive Language Modeling via Dual-view Knowledge Graph Contrastive Learning

Add code
Dec 06, 2024
Viaarxiv icon