Picture for Bo Wang

Bo Wang

Tencent, WeChat Pay

BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

Add code
Mar 05, 2026
Viaarxiv icon

Not All Candidates are Created Equal: A Heterogeneity-Aware Approach to Pre-ranking in Recommender Systems

Add code
Mar 04, 2026
Viaarxiv icon

A Novel Reconfigurable Dexterous Hand Based on Triple-Symmetric Bricard Parallel Mechanism

Add code
Mar 01, 2026
Viaarxiv icon

Generative Recommendation for Large-Scale Advertising

Add code
Feb 26, 2026
Viaarxiv icon

An LLM-Enabled Frequency-Aware Flow Diffusion Model for Natural-Language-Guided Power System Scenario Generation

Add code
Feb 23, 2026
Viaarxiv icon

dnaHNet: A Scalable and Hierarchical Foundation Model for Genomic Sequence Learning

Add code
Feb 14, 2026
Viaarxiv icon

Diffusion-Pretrained Dense and Contextual Embeddings

Add code
Feb 13, 2026
Viaarxiv icon

Discovering Semantic Latent Structures in Psychological Scales: A Response-Free Pathway to Efficient Simplification

Add code
Feb 13, 2026
Viaarxiv icon

EchoJEPA: A Latent Predictive Foundation Model for Echocardiography

Add code
Feb 02, 2026
Viaarxiv icon

SetPO: Set-Level Policy Optimization for Diversity-Preserving LLM Reasoning

Add code
Feb 01, 2026
Viaarxiv icon