Picture for Wenjie Ruan

Wenjie Ruan

Trustworthy Text-to-Image Diffusion Models: A Timely and Focused Survey

Add code
Sep 26, 2024
Viaarxiv icon

Boosting Adversarial Training via Fisher-Rao Norm-based Regularization

Add code
Mar 26, 2024
Viaarxiv icon

Towards Fairness-Aware Adversarial Learning

Add code
Feb 27, 2024
Viaarxiv icon

ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation

Add code
Feb 23, 2024
Viaarxiv icon

Building Guardrails for Large Language Models

Add code
Feb 02, 2024
Viaarxiv icon

ReRoGCRL: Representation-based Robustness in Goal-Conditioned Reinforcement Learning

Add code
Dec 19, 2023
Viaarxiv icon

Reward Certification for Policy Smoothed Reinforcement Learning

Add code
Dec 12, 2023
Viaarxiv icon

A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation

Add code
May 19, 2023
Viaarxiv icon

Model-Agnostic Reachability Analysis on Deep Neural Networks

Add code
Apr 03, 2023
Viaarxiv icon

RePreM: Representation Pre-training with Masked Model for Reinforcement Learning

Add code
Mar 03, 2023
Viaarxiv icon