Picture for Weinan Zhang

Weinan Zhang

Information-Theoretic Reward Decomposition for Generalizable RLHF

Add code
Apr 08, 2025
Viaarxiv icon

AdvKT: An Adversarial Multi-Step Training Framework for Knowledge Tracing

Add code
Apr 07, 2025
Viaarxiv icon

AgentNet: Decentralized Evolutionary Coordination for LLM-based Multi-Agent Systems

Add code
Apr 01, 2025
Viaarxiv icon

Sell It Before You Make It: Revolutionizing E-Commerce with Personalized AI-Generated Items

Add code
Mar 28, 2025
Viaarxiv icon

ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning

Add code
Mar 12, 2025
Viaarxiv icon

Adding Alignment Control to Language Models

Add code
Mar 07, 2025
Viaarxiv icon

PALo: Learning Posture-Aware Locomotion for Quadruped Robots

Add code
Mar 06, 2025
Viaarxiv icon

Humanoid Whole-Body Locomotion on Narrow Terrain via Dynamic Balance and Reinforcement Learning

Add code
Feb 24, 2025
Viaarxiv icon

ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning

Add code
Feb 22, 2025
Viaarxiv icon

Retrieval-Augmented Process Reward Model for Generalizable Mathematical Reasoning

Add code
Feb 20, 2025
Viaarxiv icon