Picture for Eric Wong

Eric Wong

AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal Properties

Add code
Oct 31, 2024
Figure 1 for AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal Properties
Figure 2 for AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal Properties
Figure 3 for AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal Properties
Figure 4 for AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal Properties
Viaarxiv icon

Dolphin: A Programmable Framework for Scalable Neurosymbolic Learning

Add code
Oct 04, 2024
Figure 1 for Dolphin: A Programmable Framework for Scalable Neurosymbolic Learning
Figure 2 for Dolphin: A Programmable Framework for Scalable Neurosymbolic Learning
Figure 3 for Dolphin: A Programmable Framework for Scalable Neurosymbolic Learning
Figure 4 for Dolphin: A Programmable Framework for Scalable Neurosymbolic Learning
Viaarxiv icon

The FIX Benchmark: Extracting Features Interpretable to eXperts

Add code
Sep 20, 2024
Figure 1 for The FIX Benchmark: Extracting Features Interpretable to eXperts
Figure 2 for The FIX Benchmark: Extracting Features Interpretable to eXperts
Figure 3 for The FIX Benchmark: Extracting Features Interpretable to eXperts
Figure 4 for The FIX Benchmark: Extracting Features Interpretable to eXperts
Viaarxiv icon

Towards Compositionality in Concept Learning

Add code
Jun 26, 2024
Viaarxiv icon

Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference

Add code
Jun 21, 2024
Viaarxiv icon

Avoiding Copyright Infringement via Machine Unlearning

Add code
Jun 16, 2024
Viaarxiv icon

Data-Efficient Learning with Neural Programs

Add code
Jun 10, 2024
Figure 1 for Data-Efficient Learning with Neural Programs
Figure 2 for Data-Efficient Learning with Neural Programs
Figure 3 for Data-Efficient Learning with Neural Programs
Figure 4 for Data-Efficient Learning with Neural Programs
Viaarxiv icon

DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation

Add code
Jun 02, 2024
Figure 1 for DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation
Figure 2 for DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation
Figure 3 for DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation
Figure 4 for DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation
Viaarxiv icon

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

Add code
Mar 28, 2024
Figure 1 for JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models
Figure 2 for JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models
Figure 3 for JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models
Figure 4 for JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models
Viaarxiv icon

Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing

Add code
Feb 28, 2024
Figure 1 for Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing
Figure 2 for Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing
Figure 3 for Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing
Figure 4 for Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing
Viaarxiv icon