Picture for Raja Sekhar Rao Dheekonda

Raja Sekhar Rao Dheekonda

PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI System

Add code
Oct 01, 2024
Viaarxiv icon

Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle

Add code
Jul 18, 2024
Figure 1 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Figure 2 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Figure 3 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Figure 4 for Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle
Viaarxiv icon