Picture for Ruizhe Chen

Ruizhe Chen

CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making

Add code
Jun 15, 2025
Viaarxiv icon

Med-U1: Incentivizing Unified Medical Reasoning in LLMs via Large-scale Reinforcement Learning

Add code
Jun 14, 2025
Viaarxiv icon

BiasFilter: An Inference-Time Debiasing Framework for Large Language Models

Add code
May 28, 2025
Viaarxiv icon

FRN: Fractal-Based Recursive Spectral Reconstruction Network

Add code
May 21, 2025
Viaarxiv icon

BiasGuard: A Reasoning-enhanced Bias Detection Tool For Large Language Models

Add code
Apr 30, 2025
Viaarxiv icon

FairSteer: Inference Time Debiasing for LLMs with Dynamic Activation Steering

Add code
Apr 20, 2025
Viaarxiv icon

An All-Atom Generative Model for Designing Protein Complexes

Add code
Apr 17, 2025
Viaarxiv icon

Persona-judge: Personalized Alignment of Large Language Models via Token-level Self-judgment

Add code
Apr 17, 2025
Viaarxiv icon

MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning

Add code
Apr 14, 2025
Viaarxiv icon

DiffPO: Diffusion-styled Preference Optimization for Efficient Inference-Time Alignment of Large Language Models

Add code
Mar 06, 2025
Viaarxiv icon