Picture for Adam Fisch

Adam Fisch

Shammie

Don't lie to your friends: Learning what you know from collaborative self-play

Add code
Mar 18, 2025
Viaarxiv icon

Mitigating Preference Hacking in Policy Optimization with Pessimism

Add code
Mar 10, 2025
Viaarxiv icon

Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA

Add code
Oct 28, 2024
Figure 1 for Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Figure 2 for Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Figure 3 for Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Figure 4 for Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Viaarxiv icon

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

Add code
Oct 10, 2024
Figure 1 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 2 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 3 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 4 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Viaarxiv icon

Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation

Add code
Jun 06, 2024
Figure 1 for Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation
Figure 2 for Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation
Viaarxiv icon

Block Transformer: Global-to-Local Language Modeling for Fast Inference

Add code
Jun 04, 2024
Figure 1 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Figure 2 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Figure 3 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Figure 4 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Viaarxiv icon

Robust Preference Optimization through Reward Model Distillation

Add code
May 29, 2024
Viaarxiv icon

Bayesian Prediction-Powered Inference

Add code
May 09, 2024
Viaarxiv icon

Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking

Add code
Dec 21, 2023
Figure 1 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Figure 2 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Figure 3 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Figure 4 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Viaarxiv icon

Risk-Controlling Model Selection via Guided Bayesian Optimization

Add code
Dec 04, 2023
Viaarxiv icon