Picture for Adam Fisch

Adam Fisch

Shammie

Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA

Add code
Oct 28, 2024
Figure 1 for Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Figure 2 for Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Figure 3 for Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Figure 4 for Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Viaarxiv icon

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

Add code
Oct 10, 2024
Figure 1 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 2 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 3 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 4 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Viaarxiv icon

Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation

Add code
Jun 06, 2024
Figure 1 for Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation
Figure 2 for Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation
Viaarxiv icon

Block Transformer: Global-to-Local Language Modeling for Fast Inference

Add code
Jun 04, 2024
Figure 1 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Figure 2 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Figure 3 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Figure 4 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Viaarxiv icon

Robust Preference Optimization through Reward Model Distillation

Add code
May 29, 2024
Viaarxiv icon

Bayesian Prediction-Powered Inference

Add code
May 09, 2024
Viaarxiv icon

Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking

Add code
Dec 21, 2023
Figure 1 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Figure 2 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Figure 3 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Figure 4 for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking
Viaarxiv icon

Risk-Controlling Model Selection via Guided Bayesian Optimization

Add code
Dec 04, 2023
Viaarxiv icon

Towards Robust and Efficient Continual Language Learning

Add code
Jul 11, 2023
Figure 1 for Towards Robust and Efficient Continual Language Learning
Figure 2 for Towards Robust and Efficient Continual Language Learning
Figure 3 for Towards Robust and Efficient Continual Language Learning
Figure 4 for Towards Robust and Efficient Continual Language Learning
Viaarxiv icon

Conformal Language Modeling

Add code
Jun 16, 2023
Viaarxiv icon