Picture for Ronghui Mu

Ronghui Mu

Invariant Correlation of Representation with Label

Add code
Jul 01, 2024
Viaarxiv icon

Safeguarding Large Language Models: A Survey

Add code
Jun 03, 2024
Viaarxiv icon

Towards Fairness-Aware Adversarial Learning

Add code
Feb 27, 2024
Viaarxiv icon

Building Guardrails for Large Language Models

Add code
Feb 02, 2024
Viaarxiv icon

Reward Certification for Policy Smoothed Reinforcement Learning

Add code
Dec 12, 2023
Viaarxiv icon

A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation

Add code
May 19, 2023
Viaarxiv icon

Randomized Adversarial Training via Taylor Expansion

Add code
Mar 19, 2023
Viaarxiv icon

Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning

Add code
Dec 22, 2022
Viaarxiv icon

3DVerifier: Efficient Robustness Verification for 3D Point Cloud Models

Add code
Jul 15, 2022
Figure 1 for 3DVerifier: Efficient Robustness Verification for 3D Point Cloud Models
Figure 2 for 3DVerifier: Efficient Robustness Verification for 3D Point Cloud Models
Figure 3 for 3DVerifier: Efficient Robustness Verification for 3D Point Cloud Models
Figure 4 for 3DVerifier: Efficient Robustness Verification for 3D Point Cloud Models
Viaarxiv icon

Sparse Adversarial Video Attacks with Spatial Transformations

Add code
Nov 10, 2021
Figure 1 for Sparse Adversarial Video Attacks with Spatial Transformations
Figure 2 for Sparse Adversarial Video Attacks with Spatial Transformations
Figure 3 for Sparse Adversarial Video Attacks with Spatial Transformations
Figure 4 for Sparse Adversarial Video Attacks with Spatial Transformations
Viaarxiv icon