Picture for Omer Antverg

Omer Antverg

Jamba-1.5: Hybrid Transformer-Mamba Models at Scale

Add code
Aug 22, 2024
Figure 1 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 2 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 3 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 4 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

IDANI: Inference-time Domain Adaptation via Neuron-level Interventions

Add code
Jun 01, 2022
Figure 1 for IDANI: Inference-time Domain Adaptation via Neuron-level Interventions
Figure 2 for IDANI: Inference-time Domain Adaptation via Neuron-level Interventions
Figure 3 for IDANI: Inference-time Domain Adaptation via Neuron-level Interventions
Figure 4 for IDANI: Inference-time Domain Adaptation via Neuron-level Interventions
Viaarxiv icon

On the Pitfalls of Analyzing Individual Neurons in Language Models

Add code
Oct 14, 2021
Figure 1 for On the Pitfalls of Analyzing Individual Neurons in Language Models
Figure 2 for On the Pitfalls of Analyzing Individual Neurons in Language Models
Figure 3 for On the Pitfalls of Analyzing Individual Neurons in Language Models
Figure 4 for On the Pitfalls of Analyzing Individual Neurons in Language Models
Viaarxiv icon