Picture for Sihao Hu

Sihao Hu

Multi-Agent Reinforcement Learning with Focal Diversity Optimization

Add code
Feb 06, 2025
Viaarxiv icon

Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation

Add code
Jan 29, 2025
Viaarxiv icon

$H^3$Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs

Add code
Nov 26, 2024
Figure 1 for $H^3$Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs
Figure 2 for $H^3$Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs
Figure 3 for $H^3$Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs
Figure 4 for $H^3$Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs
Viaarxiv icon

Collaborative Contrastive Network for Click-Through Rate Prediction

Add code
Nov 18, 2024
Figure 1 for Collaborative Contrastive Network for Click-Through Rate Prediction
Figure 2 for Collaborative Contrastive Network for Click-Through Rate Prediction
Figure 3 for Collaborative Contrastive Network for Click-Through Rate Prediction
Figure 4 for Collaborative Contrastive Network for Click-Through Rate Prediction
Viaarxiv icon

LLM-TOPLA: Efficient LLM Ensemble by Maximising Diversity

Add code
Oct 04, 2024
Viaarxiv icon

Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey

Add code
Sep 26, 2024
Viaarxiv icon

Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation

Add code
Sep 04, 2024
Viaarxiv icon

Booster: Tackling Harmful Fine-tuing for Large Language Models via Attenuating Harmful Perturbation

Add code
Sep 03, 2024
Viaarxiv icon

Joint-Motion Mutual Learning for Pose Estimation in Videos

Add code
Aug 05, 2024
Viaarxiv icon

Personalized Privacy Protection Mask Against Unauthorized Facial Recognition

Add code
Jul 19, 2024
Viaarxiv icon