Picture for Thomas Hartvigsen

Thomas Hartvigsen

Sparse Autoencoder Features for Classifications and Transferability

Add code
Feb 17, 2025
Viaarxiv icon

Lifelong Sequential Knowledge Editing without Model Degradation

Add code
Feb 03, 2025
Viaarxiv icon

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Add code
Nov 26, 2024
Figure 1 for Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens
Figure 2 for Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens
Figure 3 for Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens
Figure 4 for Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens
Viaarxiv icon

BendVLM: Test-Time Debiasing of Vision-Language Embeddings

Add code
Nov 07, 2024
Figure 1 for BendVLM: Test-Time Debiasing of Vision-Language Embeddings
Figure 2 for BendVLM: Test-Time Debiasing of Vision-Language Embeddings
Figure 3 for BendVLM: Test-Time Debiasing of Vision-Language Embeddings
Figure 4 for BendVLM: Test-Time Debiasing of Vision-Language Embeddings
Viaarxiv icon

Identifying Implicit Social Biases in Vision-Language Models

Add code
Nov 01, 2024
Viaarxiv icon

Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing

Add code
Oct 23, 2024
Figure 1 for Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing
Figure 2 for Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing
Figure 3 for Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing
Figure 4 for Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing
Viaarxiv icon

Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes

Add code
Oct 22, 2024
Viaarxiv icon

Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation

Add code
Sep 30, 2024
Viaarxiv icon

FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging

Add code
Jul 11, 2024
Figure 1 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Figure 2 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Figure 3 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Figure 4 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Viaarxiv icon

Composable Interventions for Language Models

Add code
Jul 09, 2024
Figure 1 for Composable Interventions for Language Models
Figure 2 for Composable Interventions for Language Models
Figure 3 for Composable Interventions for Language Models
Figure 4 for Composable Interventions for Language Models
Viaarxiv icon