Picture for Shahrad Mohammadzadeh

Shahrad Mohammadzadeh

Hallucination Detox: Sensitive Neuron Dropout (SeND) for Large Language Model Training

Add code
Oct 20, 2024
Viaarxiv icon

Scavenging Hyena: Distilling Transformers into Long Convolution Models

Add code
Jan 31, 2024
Viaarxiv icon