Picture for Adel Bibi

Adel Bibi

Attacking Multimodal OS Agents with Malicious Image Patches

Add code
Mar 13, 2025
Viaarxiv icon

Shh, don't say that! Domain Certification in LLMs

Add code
Feb 26, 2025
Viaarxiv icon

On the Coexistence and Ensembling of Watermarks

Add code
Jan 29, 2025
Figure 1 for On the Coexistence and Ensembling of Watermarks
Figure 2 for On the Coexistence and Ensembling of Watermarks
Figure 3 for On the Coexistence and Ensembling of Watermarks
Figure 4 for On the Coexistence and Ensembling of Watermarks
Viaarxiv icon

Open Problems in Machine Unlearning for AI Safety

Add code
Jan 09, 2025
Viaarxiv icon

Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Prompts

Add code
Dec 13, 2024
Figure 1 for Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Prompts
Figure 2 for Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Prompts
Figure 3 for Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Prompts
Figure 4 for Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Prompts
Viaarxiv icon

Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models

Add code
Aug 27, 2024
Figure 1 for Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models
Figure 2 for Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models
Figure 3 for Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models
Figure 4 for Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models
Viaarxiv icon

FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging

Add code
Jul 11, 2024
Figure 1 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Figure 2 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Figure 3 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Figure 4 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Viaarxiv icon

Model Merging and Safety Alignment: One Bad Model Spoils the Bunch

Add code
Jun 20, 2024
Viaarxiv icon

Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models

Add code
Jun 12, 2024
Figure 1 for Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models
Figure 2 for Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models
Figure 3 for Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models
Figure 4 for Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models
Viaarxiv icon

Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation

Add code
Jun 07, 2024
Figure 1 for Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation
Figure 2 for Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation
Figure 3 for Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation
Figure 4 for Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation
Viaarxiv icon