Picture for Moshe Berchansky

Moshe Berchansky

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Add code
Aug 05, 2024
Viaarxiv icon

Distributed Speculative Inference of Large Language Models

Add code
May 23, 2024
Viaarxiv icon

Accelerating Speculative Decoding using Dynamic Speculation Length

Add code
May 07, 2024
Figure 1 for Accelerating Speculative Decoding using Dynamic Speculation Length
Figure 2 for Accelerating Speculative Decoding using Dynamic Speculation Length
Figure 3 for Accelerating Speculative Decoding using Dynamic Speculation Length
Figure 4 for Accelerating Speculative Decoding using Dynamic Speculation Length
Viaarxiv icon

CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level Granularity

Add code
Apr 16, 2024
Viaarxiv icon

Optimizing Retrieval-augmented Reader Models via Token Elimination

Add code
Oct 20, 2023
Figure 1 for Optimizing Retrieval-augmented Reader Models via Token Elimination
Figure 2 for Optimizing Retrieval-augmented Reader Models via Token Elimination
Figure 3 for Optimizing Retrieval-augmented Reader Models via Token Elimination
Figure 4 for Optimizing Retrieval-augmented Reader Models via Token Elimination
Viaarxiv icon

How to Train BERT with an Academic Budget

Add code
Apr 15, 2021
Figure 1 for How to Train BERT with an Academic Budget
Figure 2 for How to Train BERT with an Academic Budget
Figure 3 for How to Train BERT with an Academic Budget
Figure 4 for How to Train BERT with an Academic Budget
Viaarxiv icon