Picture for Dennis Wei

Dennis Wei

Identifying Sub-networks in Neural Networks via Functionally Similar Representations

Add code
Oct 21, 2024
Figure 1 for Identifying Sub-networks in Neural Networks via Functionally Similar Representations
Figure 2 for Identifying Sub-networks in Neural Networks via Functionally Similar Representations
Figure 3 for Identifying Sub-networks in Neural Networks via Functionally Similar Representations
Figure 4 for Identifying Sub-networks in Neural Networks via Functionally Similar Representations
Viaarxiv icon

Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs

Add code
Jun 17, 2024
Viaarxiv icon

Interventional Causal Discovery in a Mixture of DAGs

Add code
Jun 12, 2024
Viaarxiv icon

Facilitating Human-LLM Collaboration through Factuality Scores and Source Attributions

Add code
May 30, 2024
Viaarxiv icon

Selective Explanations

Add code
May 29, 2024
Viaarxiv icon

The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers

Add code
Apr 03, 2024
Viaarxiv icon

Multi-Level Explanations for Generative Language Models

Add code
Mar 21, 2024
Figure 1 for Multi-Level Explanations for Generative Language Models
Figure 2 for Multi-Level Explanations for Generative Language Models
Figure 3 for Multi-Level Explanations for Generative Language Models
Figure 4 for Multi-Level Explanations for Generative Language Models
Viaarxiv icon

Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

Add code
Mar 09, 2024
Viaarxiv icon

Causal Bandits with General Causal Models and Interventions

Add code
Mar 01, 2024
Figure 1 for Causal Bandits with General Causal Models and Interventions
Figure 2 for Causal Bandits with General Causal Models and Interventions
Figure 3 for Causal Bandits with General Causal Models and Interventions
Figure 4 for Causal Bandits with General Causal Models and Interventions
Viaarxiv icon

Trust Regions for Explanations via Black-Box Probabilistic Certification

Add code
Feb 21, 2024
Viaarxiv icon