Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Edyta Bogucka

C3AI: Crafting and Evaluating Constitutions for Constitutional AI

Feb 21, 2025

Yara Kyrychenko, Ke Zhou, Edyta Bogucka, Daniele Quercia

Abstract:Constitutional AI (CAI) guides LLM behavior using constitutions, but identifying which principles are most effective for model alignment remains an open challenge. We introduce the C3AI framework (\textit{Crafting Constitutions for CAI models}), which serves two key functions: (1) selecting and structuring principles to form effective constitutions before fine-tuning; and (2) evaluating whether fine-tuned CAI models follow these principles in practice. By analyzing principles from AI and psychology, we found that positively framed, behavior-based principles align more closely with human preferences than negatively framed or trait-based principles. In a safety alignment use case, we applied a graph-based principle selection method to refine an existing CAI constitution, improving safety measures while maintaining strong general reasoning capabilities. Interestingly, fine-tuned CAI models performed well on negatively framed principles but struggled with positively framed ones, in contrast to our human alignment results. This highlights a potential gap between principle design and model adherence. Overall, C3AI provides a structured and scalable approach to both crafting and evaluating CAI constitutions.

* This has been accepted for the Web Conference 2025

Via

Access Paper or Ask Questions

Co-designing an AI Impact Assessment Report Template with AI Practitioners and AI Compliance Experts

Jul 24, 2024

Edyta Bogucka, Marios Constantinides, Sanja Šćepanović, Daniele Quercia

Figure 1 for Co-designing an AI Impact Assessment Report Template with AI Practitioners and AI Compliance Experts

Figure 2 for Co-designing an AI Impact Assessment Report Template with AI Practitioners and AI Compliance Experts

Figure 3 for Co-designing an AI Impact Assessment Report Template with AI Practitioners and AI Compliance Experts

Figure 4 for Co-designing an AI Impact Assessment Report Template with AI Practitioners and AI Compliance Experts

Abstract:In the evolving landscape of AI regulation, it is crucial for companies to conduct impact assessments and document their compliance through comprehensive reports. However, current reports lack grounding in regulations and often focus on specific aspects like privacy in relation to AI systems, without addressing the real-world uses of these systems. Moreover, there is no systematic effort to design and evaluate these reports with both AI practitioners and AI compliance experts. To address this gap, we conducted an iterative co-design process with 14 AI practitioners and 6 AI compliance experts and proposed a template for impact assessment reports grounded in the EU AI Act, NIST's AI Risk Management Framework, and ISO 42001 AI Management System. We evaluated the template by producing an impact assessment report for an AI-based meeting companion at a major tech company. A user study with 8 AI practitioners from the same company and 5 AI compliance experts from industry and academia revealed that our template effectively provides necessary information for impact assessments and documents the broad impacts of AI systems. Participants envisioned using the template not only at the pre-deployment stage for compliance but also as a tool to guide the design stage of AI uses.

* 11 pages, 7 figures

Via

Access Paper or Ask Questions