Picture for Adel Bibi

Adel Bibi

Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Prompts

Add code
Dec 13, 2024
Viaarxiv icon

Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models

Add code
Aug 27, 2024
Viaarxiv icon

FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging

Add code
Jul 11, 2024
Figure 1 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Figure 2 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Figure 3 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Figure 4 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Viaarxiv icon

Model Merging and Safety Alignment: One Bad Model Spoils the Bunch

Add code
Jun 20, 2024
Viaarxiv icon

Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models

Add code
Jun 12, 2024
Figure 1 for Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models
Figure 2 for Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models
Figure 3 for Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models
Figure 4 for Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models
Viaarxiv icon

Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation

Add code
Jun 07, 2024
Figure 1 for Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation
Figure 2 for Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation
Figure 3 for Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation
Figure 4 for Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation
Viaarxiv icon

Universal In-Context Approximation By Prompting Fully Recurrent Models

Add code
Jun 03, 2024
Viaarxiv icon

Towards Certification of Uncertainty Calibration under Adversarial Attacks

Add code
May 22, 2024
Figure 1 for Towards Certification of Uncertainty Calibration under Adversarial Attacks
Figure 2 for Towards Certification of Uncertainty Calibration under Adversarial Attacks
Figure 3 for Towards Certification of Uncertainty Calibration under Adversarial Attacks
Figure 4 for Towards Certification of Uncertainty Calibration under Adversarial Attacks
Viaarxiv icon

Risks and Opportunities of Open-Source Generative AI

Add code
May 14, 2024
Viaarxiv icon

Near to Mid-term Risks and Opportunities of Open Source Generative AI

Add code
Apr 25, 2024
Figure 1 for Near to Mid-term Risks and Opportunities of Open Source Generative AI
Figure 2 for Near to Mid-term Risks and Opportunities of Open Source Generative AI
Figure 3 for Near to Mid-term Risks and Opportunities of Open Source Generative AI
Figure 4 for Near to Mid-term Risks and Opportunities of Open Source Generative AI
Viaarxiv icon