Picture for Matija Franklin

Matija Franklin

Multi-Agent Risks from Advanced AI

Add code
Feb 19, 2025
Viaarxiv icon

Model-Free RL Agents Demonstrate System 1-Like Intentionality

Add code
Jan 30, 2025
Viaarxiv icon

AI Governance through Markets

Add code
Jan 29, 2025
Viaarxiv icon

LMUnit: Fine-grained Evaluation with Natural Language Unit Tests

Add code
Dec 17, 2024
Viaarxiv icon

Beyond Preferences in AI Alignment

Add code
Aug 30, 2024
Figure 1 for Beyond Preferences in AI Alignment
Figure 2 for Beyond Preferences in AI Alignment
Figure 3 for Beyond Preferences in AI Alignment
Figure 4 for Beyond Preferences in AI Alignment
Viaarxiv icon

A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI

Add code
Apr 23, 2024
Figure 1 for A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
Figure 2 for A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
Figure 3 for A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
Figure 4 for A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
Viaarxiv icon

An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI

Add code
Nov 06, 2023
Figure 1 for An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI
Figure 2 for An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI
Figure 3 for An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI
Figure 4 for An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI
Viaarxiv icon

Strengthening the EU AI Act: Defining Key Terms on AI Manipulation

Add code
Aug 30, 2023
Viaarxiv icon

Concept Extrapolation: A Conceptual Primer

Add code
Jun 19, 2023
Viaarxiv icon

The Influence of Explainable Artificial Intelligence: Nudging Behaviour or Boosting Capability?

Add code
Oct 05, 2022
Viaarxiv icon