Picture for Gal Shachaf

Gal Shachaf

Jamba-1.5: Hybrid Transformer-Mamba Models at Scale

Add code
Aug 22, 2024
Figure 1 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 2 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 3 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 4 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Viaarxiv icon

MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning

Add code
May 01, 2022
Figure 1 for MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning
Figure 2 for MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning
Figure 3 for MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning
Figure 4 for MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning
Viaarxiv icon

Learning to Retrieve Passages without Supervision

Add code
Dec 14, 2021
Figure 1 for Learning to Retrieve Passages without Supervision
Figure 2 for Learning to Retrieve Passages without Supervision
Figure 3 for Learning to Retrieve Passages without Supervision
Figure 4 for Learning to Retrieve Passages without Supervision
Viaarxiv icon

A Theoretical Analysis of Fine-tuning with Linear Teachers

Add code
Jul 04, 2021
Figure 1 for A Theoretical Analysis of Fine-tuning with Linear Teachers
Figure 2 for A Theoretical Analysis of Fine-tuning with Linear Teachers
Figure 3 for A Theoretical Analysis of Fine-tuning with Linear Teachers
Viaarxiv icon