Picture for Debing Zhang

Debing Zhang

Scalable Oversight for Superhuman AI via Recursive Self-Critiquing

Add code
Feb 07, 2025
Viaarxiv icon

SedarEval: Automated Evaluation using Self-Adaptive Rubrics

Add code
Jan 26, 2025
Viaarxiv icon

RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?

Add code
Jan 20, 2025
Viaarxiv icon

Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models

Add code
Oct 28, 2024
Figure 1 for Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models
Figure 2 for Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models
Figure 3 for Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models
Figure 4 for Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models
Viaarxiv icon

Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?

Add code
Oct 08, 2024
Viaarxiv icon

CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning

Add code
Oct 03, 2024
Viaarxiv icon

Untie the Knots: An Efficient Data Augmentation Strategy for Long-Context Pre-Training in Language Models

Add code
Sep 07, 2024
Viaarxiv icon

Critic-CoT: Boosting the reasoning abilities of large language model via Chain-of-thoughts Critic

Add code
Aug 29, 2024
Viaarxiv icon

SLR: Learning Quadruped Locomotion without Privileged Information

Add code
Jun 07, 2024
Figure 1 for SLR: Learning Quadruped Locomotion without Privileged Information
Figure 2 for SLR: Learning Quadruped Locomotion without Privileged Information
Figure 3 for SLR: Learning Quadruped Locomotion without Privileged Information
Figure 4 for SLR: Learning Quadruped Locomotion without Privileged Information
Viaarxiv icon

MPI-Flow: Learning Realistic Optical Flow with Multiplane Images

Add code
Sep 13, 2023
Viaarxiv icon