Picture for Xiangxin Zhou

Xiangxin Zhou

Rethinking the Trust Region in LLM Reinforcement Learning

Add code
Feb 04, 2026
Viaarxiv icon

Defeating the Training-Inference Mismatch via FP16

Add code
Oct 30, 2025
Viaarxiv icon

Variational Reasoning for Language Models

Add code
Sep 26, 2025
Figure 1 for Variational Reasoning for Language Models
Figure 2 for Variational Reasoning for Language Models
Figure 3 for Variational Reasoning for Language Models
Figure 4 for Variational Reasoning for Language Models
Viaarxiv icon

ChemBOMAS: Accelerated BO in Chemistry with LLM-Enhanced Multi-Agent System

Add code
Sep 10, 2025
Viaarxiv icon

OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use

Add code
Aug 06, 2025
Figure 1 for OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
Figure 2 for OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
Figure 3 for OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
Figure 4 for OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
Viaarxiv icon

Reinforcing General Reasoning without Verifiers

Add code
May 27, 2025
Viaarxiv icon

Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling

Add code
May 27, 2025
Figure 1 for Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling
Figure 2 for Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling
Figure 3 for Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling
Figure 4 for Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling
Viaarxiv icon

An All-Atom Generative Model for Designing Protein Complexes

Add code
Apr 17, 2025
Viaarxiv icon

Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows

Add code
Mar 06, 2025
Viaarxiv icon

UniMatch: Universal Matching from Atom to Task for Few-Shot Drug Discovery

Add code
Feb 18, 2025
Viaarxiv icon