Picture for Adam Fisch

Adam Fisch

Shammie

Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA

Add code
Oct 28, 2024
Viaarxiv icon

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

Add code
Oct 10, 2024
Figure 1 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 2 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 3 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 4 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Viaarxiv icon

Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation

Add code
Jun 06, 2024
Viaarxiv icon

Block Transformer: Global-to-Local Language Modeling for Fast Inference

Add code
Jun 04, 2024
Figure 1 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Figure 2 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Figure 3 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Figure 4 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Viaarxiv icon

Robust Preference Optimization through Reward Model Distillation

Add code
May 29, 2024
Viaarxiv icon

Bayesian Prediction-Powered Inference

Add code
May 09, 2024
Viaarxiv icon

Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking

Add code
Dec 21, 2023
Figure 1 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Figure 2 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Figure 3 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Figure 4 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Viaarxiv icon

Risk-Controlling Model Selection via Guided Bayesian Optimization

Add code
Dec 04, 2023
Viaarxiv icon

Towards Robust and Efficient Continual Language Learning

Add code
Jul 11, 2023
Viaarxiv icon

Conformal Language Modeling

Add code
Jun 16, 2023
Viaarxiv icon