Picture for Yunyi Shen

Yunyi Shen

Reviving The Classics: Active Reward Modeling in Large Language Model Alignment

Add code
Feb 04, 2025
Figure 1 for Reviving The Classics: Active Reward Modeling in Large Language Model Alignment
Figure 2 for Reviving The Classics: Active Reward Modeling in Large Language Model Alignment
Figure 3 for Reviving The Classics: Active Reward Modeling in Large Language Model Alignment
Figure 4 for Reviving The Classics: Active Reward Modeling in Large Language Model Alignment
Viaarxiv icon

Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs

Add code
Feb 04, 2025
Figure 1 for Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs
Figure 2 for Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs
Figure 3 for Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs
Figure 4 for Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs
Viaarxiv icon

Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives

Add code
Nov 07, 2024
Figure 1 for Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives
Figure 2 for Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives
Figure 3 for Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives
Figure 4 for Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives
Viaarxiv icon

Multi-marginal Schrödinger Bridges with Iterative Reference

Add code
Aug 12, 2024
Viaarxiv icon

Consistent Validation for Predictive Methods in Spatial Settings

Add code
Feb 05, 2024
Figure 1 for Consistent Validation for Predictive Methods in Spatial Settings
Figure 2 for Consistent Validation for Predictive Methods in Spatial Settings
Figure 3 for Consistent Validation for Predictive Methods in Spatial Settings
Figure 4 for Consistent Validation for Predictive Methods in Spatial Settings
Viaarxiv icon