Picture for Furong Huang

Furong Huang

Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset

Add code
Nov 05, 2024
Figure 1 for Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
Figure 2 for Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
Figure 3 for Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
Figure 4 for Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
Viaarxiv icon

Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory

Add code
Nov 01, 2024
Viaarxiv icon

AdvBDGen: Adversarially Fortified Prompt-Specific Fuzzy Backdoor Generator Against LLM Alignment

Add code
Oct 15, 2024
Figure 1 for AdvBDGen: Adversarially Fortified Prompt-Specific Fuzzy Backdoor Generator Against LLM Alignment
Figure 2 for AdvBDGen: Adversarially Fortified Prompt-Specific Fuzzy Backdoor Generator Against LLM Alignment
Figure 3 for AdvBDGen: Adversarially Fortified Prompt-Specific Fuzzy Backdoor Generator Against LLM Alignment
Figure 4 for AdvBDGen: Adversarially Fortified Prompt-Specific Fuzzy Backdoor Generator Against LLM Alignment
Viaarxiv icon

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment

Add code
Oct 10, 2024
Figure 1 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 2 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 3 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 4 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Viaarxiv icon

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

Add code
Oct 09, 2024
Viaarxiv icon

EnsemW2S: Can an Ensemble of LLMs be Leveraged to Obtain a Stronger LLM?

Add code
Oct 06, 2024
Viaarxiv icon

Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance

Add code
Oct 03, 2024
Viaarxiv icon

SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation

Add code
Oct 03, 2024
Viaarxiv icon

Auction-Based Regulation for Artificial Intelligence

Add code
Oct 02, 2024
Figure 1 for Auction-Based Regulation for Artificial Intelligence
Figure 2 for Auction-Based Regulation for Artificial Intelligence
Figure 3 for Auction-Based Regulation for Artificial Intelligence
Figure 4 for Auction-Based Regulation for Artificial Intelligence
Viaarxiv icon

Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization

Add code
Sep 27, 2024
Viaarxiv icon