Picture for Fatih Ilhan

Fatih Ilhan

Bilkent University, DataBoss A.S

$H^3$Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs

Add code
Nov 26, 2024
Viaarxiv icon

LLM-TOPLA: Efficient LLM Ensemble by Maximising Diversity

Add code
Oct 04, 2024
Viaarxiv icon

Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey

Add code
Sep 26, 2024
Viaarxiv icon

Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation

Add code
Sep 04, 2024
Viaarxiv icon

Booster: Tackling Harmful Fine-tuing for Large Language Models via Attenuating Harmful Perturbation

Add code
Sep 03, 2024
Viaarxiv icon

Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning

Add code
May 28, 2024
Viaarxiv icon

Robust Few-Shot Ensemble Learning with Focal Diversity-Based Pruning

Add code
Apr 05, 2024
Viaarxiv icon

A Survey on Large Language Model-Based Game Agents

Add code
Apr 02, 2024
Viaarxiv icon

STDLens: Model Hijacking-Resilient Federated Learning for Object Detection

Add code
Mar 25, 2023
Viaarxiv icon

EENet: Learning to Early Exit for Adaptive Inference

Add code
Jan 15, 2023
Viaarxiv icon