Picture for Charith Peris

Charith Peris

ARES: Adaptive Red-Teaming and End-to-End Repair of Policy-Reward System

Add code
Apr 20, 2026
Viaarxiv icon

Defenses Against Prompt Attacks Learn Surface Heuristics

Add code
Jan 12, 2026
Viaarxiv icon

Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation

Add code
May 27, 2025
Viaarxiv icon

Evaluating Differentially Private Synthetic Data Generation in High-Stakes Domains

Add code
Oct 10, 2024
Figure 1 for Evaluating Differentially Private Synthetic Data Generation in High-Stakes Domains
Figure 2 for Evaluating Differentially Private Synthetic Data Generation in High-Stakes Domains
Figure 3 for Evaluating Differentially Private Synthetic Data Generation in High-Stakes Domains
Figure 4 for Evaluating Differentially Private Synthetic Data Generation in High-Stakes Domains
Viaarxiv icon

Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification

Add code
Oct 07, 2024
Figure 1 for Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification
Figure 2 for Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification
Figure 3 for Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification
Figure 4 for Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification
Viaarxiv icon

Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs

Add code
Jul 31, 2024
Figure 1 for Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs
Figure 2 for Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs
Figure 3 for Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs
Figure 4 for Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs
Viaarxiv icon

Partial Federated Learning

Add code
Mar 03, 2024
Figure 1 for Partial Federated Learning
Figure 2 for Partial Federated Learning
Figure 3 for Partial Federated Learning
Figure 4 for Partial Federated Learning
Viaarxiv icon

On the steerability of large language models toward data-driven personas

Add code
Nov 08, 2023
Viaarxiv icon

Coordinated Replay Sample Selection for Continual Federated Learning

Add code
Oct 23, 2023
Figure 1 for Coordinated Replay Sample Selection for Continual Federated Learning
Figure 2 for Coordinated Replay Sample Selection for Continual Federated Learning
Figure 3 for Coordinated Replay Sample Selection for Continual Federated Learning
Figure 4 for Coordinated Replay Sample Selection for Continual Federated Learning
Viaarxiv icon

Holistic Survey of Privacy and Fairness in Machine Learning

Add code
Jul 28, 2023
Viaarxiv icon