Picture for Tom Goldstein

Tom Goldstein

A Simple Baseline for Predicting Events with Auto-Regressive Tabular Transformers

Add code
Oct 14, 2024
Viaarxiv icon

Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization

Add code
Sep 27, 2024
Viaarxiv icon

Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?

Add code
Jul 24, 2024
Viaarxiv icon

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Add code
Jun 27, 2024
Viaarxiv icon

GenQA: Generating Millions of Instructions from a Handful of Prompts

Add code
Jun 14, 2024
Figure 1 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Figure 2 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Figure 3 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Figure 4 for GenQA: Generating Millions of Instructions from a Handful of Prompts
Viaarxiv icon

PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting

Add code
Jun 14, 2024
Figure 1 for PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting
Figure 2 for PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting
Figure 3 for PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting
Figure 4 for PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting
Viaarxiv icon

From Pixels to Prose: A Large Dataset of Dense Image Captions

Add code
Jun 14, 2024
Figure 1 for From Pixels to Prose: A Large Dataset of Dense Image Captions
Figure 2 for From Pixels to Prose: A Large Dataset of Dense Image Captions
Figure 3 for From Pixels to Prose: A Large Dataset of Dense Image Captions
Figure 4 for From Pixels to Prose: A Large Dataset of Dense Image Captions
Viaarxiv icon

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Add code
Jun 14, 2024
Viaarxiv icon

OPTune: Efficient Online Preference Tuning

Add code
Jun 11, 2024
Figure 1 for OPTune: Efficient Online Preference Tuning
Figure 2 for OPTune: Efficient Online Preference Tuning
Figure 3 for OPTune: Efficient Online Preference Tuning
Figure 4 for OPTune: Efficient Online Preference Tuning
Viaarxiv icon

The CLRS-Text Algorithmic Reasoning Language Benchmark

Add code
Jun 06, 2024
Viaarxiv icon