Picture for Virginia Smith

Virginia Smith

Research in Collaborative Learning Does Not Serve Cross-Silo Federated Learning in Practice

Add code
Oct 14, 2025
Viaarxiv icon

e3: Learning to Explore Enables Extrapolation of Test-Time Compute for LLMs

Add code
Jun 10, 2025
Viaarxiv icon

Membership Inference Attacks for Unseen Classes

Add code
Jun 06, 2025
Viaarxiv icon

Position: Mechanistic Interpretability Should Prioritize Feature Consistency in SAEs

Add code
May 26, 2025
Viaarxiv icon

SAEs $\textit{Can}$ Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs

Add code
Apr 11, 2025
Figure 1 for SAEs $\textit{Can}$ Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs
Figure 2 for SAEs $\textit{Can}$ Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs
Figure 3 for SAEs $\textit{Can}$ Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs
Figure 4 for SAEs $\textit{Can}$ Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs
Viaarxiv icon

Exact Unlearning of Finetuning Data via Model Merging at Scale

Add code
Apr 06, 2025
Viaarxiv icon

CoRAG: Collaborative Retrieval-Augmented Generation

Add code
Apr 02, 2025
Figure 1 for CoRAG: Collaborative Retrieval-Augmented Generation
Figure 2 for CoRAG: Collaborative Retrieval-Augmented Generation
Figure 3 for CoRAG: Collaborative Retrieval-Augmented Generation
Figure 4 for CoRAG: Collaborative Retrieval-Augmented Generation
Viaarxiv icon

NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA

Add code
Nov 06, 2024
Figure 1 for NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA
Figure 2 for NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA
Figure 3 for NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA
Figure 4 for NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA
Viaarxiv icon

Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models

Add code
Nov 01, 2024
Figure 1 for Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models
Figure 2 for Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models
Figure 3 for Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models
Figure 4 for Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models
Viaarxiv icon

Position: LLM Unlearning Benchmarks are Weak Measures of Progress

Add code
Oct 03, 2024
Figure 1 for Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Figure 2 for Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Figure 3 for Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Figure 4 for Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Viaarxiv icon