Picture for Sicheng Zhu

Sicheng Zhu

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment

Add code
Oct 10, 2024
Figure 1 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 2 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 3 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 4 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Viaarxiv icon

Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models

Add code
Sep 01, 2024
Viaarxiv icon

Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?

Add code
Jul 24, 2024
Viaarxiv icon

Triple-domain Feature Learning with Frequency-aware Memory Enhancement for Moving Infrared Small Target Detection

Add code
Jun 11, 2024
Viaarxiv icon

Benchmarking the Robustness of Image Watermarks

Add code
Jan 22, 2024
Viaarxiv icon

AutoDAN: Automatic and Interpretable Adversarial Attacks on Large Language Models

Add code
Oct 23, 2023
Viaarxiv icon

More Context, Less Distraction: Visual Classification by Inferring and Conditioning on Contextual Attributes

Add code
Aug 02, 2023
Viaarxiv icon

On the Possibilities of AI-Generated Text Detection

Add code
Apr 10, 2023
Viaarxiv icon

Understanding the Generalization Benefit of Model Invariance from a Data Perspective

Add code
Nov 10, 2021
Figure 1 for Understanding the Generalization Benefit of Model Invariance from a Data Perspective
Figure 2 for Understanding the Generalization Benefit of Model Invariance from a Data Perspective
Figure 3 for Understanding the Generalization Benefit of Model Invariance from a Data Perspective
Figure 4 for Understanding the Generalization Benefit of Model Invariance from a Data Perspective
Viaarxiv icon

Learning Adversarially Robust Representations via Worst-Case Mutual Information Maximization

Add code
Feb 26, 2020
Figure 1 for Learning Adversarially Robust Representations via Worst-Case Mutual Information Maximization
Figure 2 for Learning Adversarially Robust Representations via Worst-Case Mutual Information Maximization
Figure 3 for Learning Adversarially Robust Representations via Worst-Case Mutual Information Maximization
Figure 4 for Learning Adversarially Robust Representations via Worst-Case Mutual Information Maximization
Viaarxiv icon