Picture for Chen Zhu

Chen Zhu

Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization

Add code
Jan 31, 2025
Viaarxiv icon

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Add code
Jan 18, 2025
Viaarxiv icon

Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment

Add code
Jan 16, 2025
Figure 1 for Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
Figure 2 for Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
Figure 3 for Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
Figure 4 for Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
Viaarxiv icon

Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following

Add code
Oct 21, 2024
Viaarxiv icon

Preference Optimization with Multi-Sample Comparisons

Add code
Oct 16, 2024
Viaarxiv icon

DISCO: A Hierarchical Disentangled Cognitive Diagnosis Framework for Interpretable Job Recommendation

Add code
Oct 10, 2024
Figure 1 for DISCO: A Hierarchical Disentangled Cognitive Diagnosis Framework for Interpretable Job Recommendation
Figure 2 for DISCO: A Hierarchical Disentangled Cognitive Diagnosis Framework for Interpretable Job Recommendation
Figure 3 for DISCO: A Hierarchical Disentangled Cognitive Diagnosis Framework for Interpretable Job Recommendation
Figure 4 for DISCO: A Hierarchical Disentangled Cognitive Diagnosis Framework for Interpretable Job Recommendation
Viaarxiv icon

The Perfect Blend: Redefining RLHF with Mixture of Judges

Add code
Sep 30, 2024
Figure 1 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 2 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 3 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 4 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Viaarxiv icon

Swarm Intelligence in Geo-Localization: A Multi-Agent Large Vision-Language Model Collaborative Framework

Add code
Aug 21, 2024
Viaarxiv icon

Generative Organizational Behavior Simulation using Large Language Model based Autonomous Agents: A Holacracy Perspective

Add code
Aug 05, 2024
Figure 1 for Generative Organizational Behavior Simulation using Large Language Model based Autonomous Agents: A Holacracy Perspective
Figure 2 for Generative Organizational Behavior Simulation using Large Language Model based Autonomous Agents: A Holacracy Perspective
Figure 3 for Generative Organizational Behavior Simulation using Large Language Model based Autonomous Agents: A Holacracy Perspective
Figure 4 for Generative Organizational Behavior Simulation using Large Language Model based Autonomous Agents: A Holacracy Perspective
Viaarxiv icon

Contrastive Factor Analysis

Add code
Aug 01, 2024
Figure 1 for Contrastive Factor Analysis
Figure 2 for Contrastive Factor Analysis
Figure 3 for Contrastive Factor Analysis
Figure 4 for Contrastive Factor Analysis
Viaarxiv icon