Picture for Hongteng Xu

Hongteng Xu

GD$^2$PO: Mitigating Multi-Reward Conflicts via Group-Dynamic reward-Decoupled Policy Optimization

Add code
Jun 15, 2026
Viaarxiv icon

Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization

Add code
Mar 25, 2026
Viaarxiv icon

HypeMed: Enhancing Medication Recommendations with Hypergraph-Based Patient Relationships

Add code
Mar 19, 2026
Viaarxiv icon

SWE-World: Building Software Engineering Agents in Docker-Free Environments

Add code
Feb 03, 2026
Viaarxiv icon

ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought

Add code
Jan 30, 2026
Viaarxiv icon

TheoremForge: Scaling up Formal Data Synthesis with Low-Budget Agentic Workflow

Add code
Jan 24, 2026
Viaarxiv icon

MoORE: SVD-based Model MoE-ization for Conflict- and Oblivion-Resistant Multi-Task Adaptation

Add code
Jun 17, 2025
Viaarxiv icon

PolyConf: Unlocking Polymer Conformation Generation through Hierarchical Generative Models

Add code
Apr 11, 2025
Viaarxiv icon

Learning Structure-enhanced Temporal Point Processes with Gromov-Wasserstein Regularization

Add code
Mar 29, 2025
Figure 1 for Learning Structure-enhanced Temporal Point Processes with Gromov-Wasserstein Regularization
Figure 2 for Learning Structure-enhanced Temporal Point Processes with Gromov-Wasserstein Regularization
Figure 3 for Learning Structure-enhanced Temporal Point Processes with Gromov-Wasserstein Regularization
Figure 4 for Learning Structure-enhanced Temporal Point Processes with Gromov-Wasserstein Regularization
Viaarxiv icon

ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation

Add code
Feb 20, 2025
Viaarxiv icon