Picture for Ravid Shwartz-Ziv

Ravid Shwartz-Ziv

Does Representation Matter? Exploring Intermediate Layers in Large Language Models

Add code
Dec 12, 2024
Viaarxiv icon

Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation

Add code
Dec 10, 2024
Viaarxiv icon

Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning

Add code
Nov 04, 2024
Viaarxiv icon

Learning to Compress: Local Rank and Information Compression in Deep Neural Networks

Add code
Oct 10, 2024
Viaarxiv icon

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Add code
Jun 27, 2024
Viaarxiv icon

Just How Flexible are Neural Networks in Practice?

Add code
Jun 17, 2024
Viaarxiv icon

Towards an Improved Understanding and Utilization of Maximum Manifold Capacity Representations

Add code
Jun 13, 2024
Viaarxiv icon

The Entropy Enigma: Success and Failure of Entropy Minimization

Add code
May 08, 2024
Viaarxiv icon

Simplifying Neural Network Training Under Class Imbalance

Add code
Dec 05, 2023
Viaarxiv icon

Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs

Add code
Sep 28, 2023
Viaarxiv icon