Picture for Eric Winsor

Eric Winsor

Fundamental Limitations in Defending LLM Finetuning APIs

Add code
Feb 20, 2025
Figure 1 for Fundamental Limitations in Defending LLM Finetuning APIs
Figure 2 for Fundamental Limitations in Defending LLM Finetuning APIs
Figure 3 for Fundamental Limitations in Defending LLM Finetuning APIs
Figure 4 for Fundamental Limitations in Defending LLM Finetuning APIs
Viaarxiv icon

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Add code
Oct 11, 2024
Figure 1 for AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents
Figure 2 for AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents
Figure 3 for AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents
Figure 4 for AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents
Viaarxiv icon

Look Before You Leap: A Universal Emergent Decomposition of Retrieval Tasks in Language Models

Add code
Dec 13, 2023
Figure 1 for Look Before You Leap: A Universal Emergent Decomposition of Retrieval Tasks in Language Models
Figure 2 for Look Before You Leap: A Universal Emergent Decomposition of Retrieval Tasks in Language Models
Figure 3 for Look Before You Leap: A Universal Emergent Decomposition of Retrieval Tasks in Language Models
Figure 4 for Look Before You Leap: A Universal Emergent Decomposition of Retrieval Tasks in Language Models
Viaarxiv icon

Interpreting Neural Networks through the Polytope Lens

Add code
Nov 22, 2022
Viaarxiv icon

Scatterbrain: Unifying Sparse and Low-rank Attention Approximation

Add code
Oct 28, 2021
Figure 1 for Scatterbrain: Unifying Sparse and Low-rank Attention Approximation
Figure 2 for Scatterbrain: Unifying Sparse and Low-rank Attention Approximation
Figure 3 for Scatterbrain: Unifying Sparse and Low-rank Attention Approximation
Figure 4 for Scatterbrain: Unifying Sparse and Low-rank Attention Approximation
Viaarxiv icon