Picture for Tom Goldstein

Tom Goldstein

When Can You Get Away with Low Memory Adam?

Add code
Mar 03, 2025
Viaarxiv icon

Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers

Add code
Feb 12, 2025
Viaarxiv icon

Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks

Add code
Feb 12, 2025
Viaarxiv icon

Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs

Add code
Feb 10, 2025
Figure 1 for Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs
Figure 2 for Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs
Figure 3 for Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs
Figure 4 for Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs
Viaarxiv icon

Gemstones: A Model Suite for Multi-Faceted Scaling Laws

Add code
Feb 07, 2025
Figure 1 for Gemstones: A Model Suite for Multi-Faceted Scaling Laws
Figure 2 for Gemstones: A Model Suite for Multi-Faceted Scaling Laws
Figure 3 for Gemstones: A Model Suite for Multi-Faceted Scaling Laws
Figure 4 for Gemstones: A Model Suite for Multi-Faceted Scaling Laws
Viaarxiv icon

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Add code
Feb 07, 2025
Viaarxiv icon

Transformers Boost the Performance of Decision Trees on Tabular Data across Sample Sizes

Add code
Feb 06, 2025
Viaarxiv icon

Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models

Add code
Dec 19, 2024
Figure 1 for Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models
Figure 2 for Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models
Figure 3 for Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models
Figure 4 for Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models
Viaarxiv icon

Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models

Add code
Dec 09, 2024
Figure 1 for Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models
Figure 2 for Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models
Figure 3 for Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models
Figure 4 for Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models
Viaarxiv icon

EditScout: Locating Forged Regions from Diffusion-based Edited Images with Multimodal LLM

Add code
Dec 05, 2024
Figure 1 for EditScout: Locating Forged Regions from Diffusion-based Edited Images with Multimodal LLM
Figure 2 for EditScout: Locating Forged Regions from Diffusion-based Edited Images with Multimodal LLM
Figure 3 for EditScout: Locating Forged Regions from Diffusion-based Edited Images with Multimodal LLM
Figure 4 for EditScout: Locating Forged Regions from Diffusion-based Edited Images with Multimodal LLM
Viaarxiv icon