Picture for Ram Shankar Siva Kumar

Ram Shankar Siva Kumar

Microsoft

PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI System

Add code
Oct 01, 2024
Viaarxiv icon

Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle

Add code
Jul 18, 2024
Figure 1 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Figure 2 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Figure 3 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Figure 4 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Viaarxiv icon

The Human Factor in AI Red Teaming: Perspectives from Social and Collaborative Computing

Add code
Jul 10, 2024
Viaarxiv icon

Adversarial Machine Learning and Cybersecurity: Risks, Challenges, and Legal Implications

Add code
May 23, 2023
Viaarxiv icon

Adversarial for Good? How the Adversarial ML Community's Values Impede Socially Beneficial Uses of Attacks

Add code
Jul 11, 2021
Figure 1 for Adversarial for Good? How the Adversarial ML Community's Values Impede Socially Beneficial Uses of Attacks
Figure 2 for Adversarial for Good? How the Adversarial ML Community's Values Impede Socially Beneficial Uses of Attacks
Viaarxiv icon

Legal Risks of Adversarial Machine Learning Research

Add code
Jun 29, 2020
Figure 1 for Legal Risks of Adversarial Machine Learning Research
Figure 2 for Legal Risks of Adversarial Machine Learning Research
Figure 3 for Legal Risks of Adversarial Machine Learning Research
Viaarxiv icon

Politics of Adversarial Machine Learning

Add code
Feb 19, 2020
Viaarxiv icon

Adversarial Machine Learning -- Industry Perspectives

Add code
Feb 04, 2020
Figure 1 for Adversarial Machine Learning -- Industry Perspectives
Figure 2 for Adversarial Machine Learning -- Industry Perspectives
Figure 3 for Adversarial Machine Learning -- Industry Perspectives
Figure 4 for Adversarial Machine Learning -- Industry Perspectives
Viaarxiv icon

Failure Modes in Machine Learning Systems

Add code
Nov 25, 2019
Viaarxiv icon

Law and Adversarial Machine Learning

Add code
Oct 26, 2018
Viaarxiv icon