Picture for Zihan Guan

Zihan Guan

Alignment-Weighted DPO: A principled reasoning approach to improve safety alignment

Add code
Feb 24, 2026
Viaarxiv icon

Agentic Framework for Epidemiological Modeling

Add code
Jan 30, 2026
Viaarxiv icon

Enhanced Diagnostic Performance via Large-Resolution Inference Optimization for Pathology Foundation Models

Add code
Jan 17, 2026
Viaarxiv icon

Benign Samples Matter! Fine-tuning On Outlier Benign Samples Severely Breaks Safety

Add code
May 11, 2025
Viaarxiv icon

BalancEdit: Dynamically Balancing the Generality-Locality Trade-off in Multi-modal Model Editing

Add code
May 02, 2025
Viaarxiv icon

Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing

Add code
Oct 23, 2024
Figure 1 for Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing
Figure 2 for Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing
Figure 3 for Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing
Figure 4 for Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing
Viaarxiv icon

No Free Lunch: Retrieval-Augmented Generation Undermines Fairness in LLMs, Even for Vigilant Users

Add code
Oct 10, 2024
Figure 1 for No Free Lunch: Retrieval-Augmented Generation Undermines Fairness in LLMs, Even for Vigilant Users
Figure 2 for No Free Lunch: Retrieval-Augmented Generation Undermines Fairness in LLMs, Even for Vigilant Users
Figure 3 for No Free Lunch: Retrieval-Augmented Generation Undermines Fairness in LLMs, Even for Vigilant Users
Figure 4 for No Free Lunch: Retrieval-Augmented Generation Undermines Fairness in LLMs, Even for Vigilant Users
Viaarxiv icon

UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models

Add code
Apr 01, 2024
Figure 1 for UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models
Figure 2 for UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models
Figure 3 for UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models
Figure 4 for UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models
Viaarxiv icon

Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation

Add code
Mar 28, 2024
Figure 1 for Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation
Figure 2 for Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation
Figure 3 for Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation
Figure 4 for Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation
Viaarxiv icon

XGBD: Explanation-Guided Graph Backdoor Detection

Add code
Aug 08, 2023
Figure 1 for XGBD: Explanation-Guided Graph Backdoor Detection
Figure 2 for XGBD: Explanation-Guided Graph Backdoor Detection
Figure 3 for XGBD: Explanation-Guided Graph Backdoor Detection
Figure 4 for XGBD: Explanation-Guided Graph Backdoor Detection
Viaarxiv icon