Picture for Yahan Yang

Yahan Yang

MR. Guard: Multilingual Reasoning Guardrail using Curriculum Learning

Add code
Apr 21, 2025
Viaarxiv icon

Safety Monitoring for Learning-Enabled Cyber-Physical Systems in Out-of-Distribution Scenarios

Add code
Apr 18, 2025
Viaarxiv icon

Benchmarking LLM Guardrails in Handling Multilingual Toxicity

Add code
Oct 29, 2024
Figure 1 for Benchmarking LLM Guardrails in Handling Multilingual Toxicity
Figure 2 for Benchmarking LLM Guardrails in Handling Multilingual Toxicity
Figure 3 for Benchmarking LLM Guardrails in Handling Multilingual Toxicity
Figure 4 for Benchmarking LLM Guardrails in Handling Multilingual Toxicity
Viaarxiv icon

Understanding Calibration for Multilingual Question Answering Models

Add code
Nov 15, 2023
Figure 1 for Understanding Calibration for Multilingual Question Answering Models
Figure 2 for Understanding Calibration for Multilingual Question Answering Models
Figure 3 for Understanding Calibration for Multilingual Question Answering Models
Figure 4 for Understanding Calibration for Multilingual Question Answering Models
Viaarxiv icon

Using Semantic Information for Defining and Detecting OOD Inputs

Add code
Feb 21, 2023
Figure 1 for Using Semantic Information for Defining and Detecting OOD Inputs
Figure 2 for Using Semantic Information for Defining and Detecting OOD Inputs
Figure 3 for Using Semantic Information for Defining and Detecting OOD Inputs
Figure 4 for Using Semantic Information for Defining and Detecting OOD Inputs
Viaarxiv icon

In and Out-of-Domain Text Adversarial Robustness via Label Smoothing

Add code
Dec 20, 2022
Figure 1 for In and Out-of-Domain Text Adversarial Robustness via Label Smoothing
Figure 2 for In and Out-of-Domain Text Adversarial Robustness via Label Smoothing
Figure 3 for In and Out-of-Domain Text Adversarial Robustness via Label Smoothing
Figure 4 for In and Out-of-Domain Text Adversarial Robustness via Label Smoothing
Viaarxiv icon

Memory Classifiers: Two-stage Classification for Robustness in Machine Learning

Add code
Jun 10, 2022
Figure 1 for Memory Classifiers: Two-stage Classification for Robustness in Machine Learning
Figure 2 for Memory Classifiers: Two-stage Classification for Robustness in Machine Learning
Figure 3 for Memory Classifiers: Two-stage Classification for Robustness in Machine Learning
Figure 4 for Memory Classifiers: Two-stage Classification for Robustness in Machine Learning
Viaarxiv icon