Picture for Irina Rish

Irina Rish

RedPajama: an Open Dataset for Training Large Language Models

Add code
Nov 19, 2024
Viaarxiv icon

Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning

Add code
Nov 04, 2024
Viaarxiv icon

Context is Key: A Benchmark for Forecasting with Essential Textual Information

Add code
Oct 24, 2024
Viaarxiv icon

VFA: Vision Frequency Analysis of Foundation Models and Human

Add code
Sep 09, 2024
Viaarxiv icon

Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models

Add code
Jul 17, 2024
Viaarxiv icon

Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent

Add code
Jul 16, 2024
Viaarxiv icon

Towards Adversarially Robust Vision-Language Models: Insights from Design Choices and Prompt Formatting Techniques

Add code
Jul 15, 2024
Viaarxiv icon

Lost in Translation: The Algorithmic Gap Between LMs and the Brain

Add code
Jul 05, 2024
Viaarxiv icon

$μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers

Add code
May 31, 2024
Viaarxiv icon

Deep Generative Sampling in the Dual Divergence Space: A Data-efficient & Interpretative Approach for Generative AI

Add code
Apr 10, 2024
Viaarxiv icon