Picture for MohammadHossein Rezaei

MohammadHossein Rezaei

Rubric-Guided Self-Distillation: Post-Training Without Rubric Verifiers

Add code
Jun 10, 2026
Viaarxiv icon

Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR

Add code
May 19, 2026
Viaarxiv icon

Reward Hacking in Rubric-Based Reinforcement Learning

Add code
May 12, 2026
Viaarxiv icon

Commonsense Knowledge with Negation: A Resource to Enhance Negation Understanding

Add code
Apr 21, 2026
Viaarxiv icon

Online Rubrics Elicitation from Pairwise Comparisons

Add code
Oct 08, 2025
Figure 1 for Online Rubrics Elicitation from Pairwise Comparisons
Figure 2 for Online Rubrics Elicitation from Pairwise Comparisons
Figure 3 for Online Rubrics Elicitation from Pairwise Comparisons
Figure 4 for Online Rubrics Elicitation from Pairwise Comparisons
Viaarxiv icon

EgoNormia: Benchmarking Physical Social Norm Understanding

Add code
Feb 27, 2025
Viaarxiv icon

Making Language Models Robust Against Negation

Add code
Feb 11, 2025
Figure 1 for Making Language Models Robust Against Negation
Figure 2 for Making Language Models Robust Against Negation
Figure 3 for Making Language Models Robust Against Negation
Figure 4 for Making Language Models Robust Against Negation
Viaarxiv icon

Paraphrasing in Affirmative Terms Improves Negation Understanding

Add code
Jun 11, 2024
Figure 1 for Paraphrasing in Affirmative Terms Improves Negation Understanding
Figure 2 for Paraphrasing in Affirmative Terms Improves Negation Understanding
Figure 3 for Paraphrasing in Affirmative Terms Improves Negation Understanding
Figure 4 for Paraphrasing in Affirmative Terms Improves Negation Understanding
Viaarxiv icon

Interpreting Indirect Answers to Yes-No Questions in Multiple Languages

Add code
Oct 20, 2023
Figure 1 for Interpreting Indirect Answers to Yes-No Questions in Multiple Languages
Figure 2 for Interpreting Indirect Answers to Yes-No Questions in Multiple Languages
Figure 3 for Interpreting Indirect Answers to Yes-No Questions in Multiple Languages
Figure 4 for Interpreting Indirect Answers to Yes-No Questions in Multiple Languages
Viaarxiv icon