Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Merlin Stein

Measuring AI agent autonomy: Towards a scalable approach with code inspection

Feb 21, 2025

Peter Cihon, Merlin Stein, Gagan Bansal, Sam Manning, Kevin Xu

Abstract:AI agents are AI systems that can achieve complex goals autonomously. Assessing the level of agent autonomy is crucial for understanding both their potential benefits and risks. Current assessments of autonomy often focus on specific risks and rely on run-time evaluations -- observations of agent actions during operation. We introduce a code-based assessment of autonomy that eliminates the need to run an AI agent to perform specific tasks, thereby reducing the costs and risks associated with run-time evaluations. Using this code-based framework, the orchestration code used to run an AI agent can be scored according to a taxonomy that assesses attributes of autonomy: impact and oversight. We demonstrate this approach with the AutoGen framework and select applications.

* NeurIPS Socially Responsible Language Modelling Research (SoLaR) Workshop 2024

Via

Access Paper or Ask Questions

The Role of Governments in Increasing Interconnected Post-Deployment Monitoring of AI

Oct 07, 2024

Merlin Stein, Jamie Bernardi, Connor Dunlop

Figure 1 for The Role of Governments in Increasing Interconnected Post-Deployment Monitoring of AI

Figure 2 for The Role of Governments in Increasing Interconnected Post-Deployment Monitoring of AI

Figure 3 for The Role of Governments in Increasing Interconnected Post-Deployment Monitoring of AI

Abstract:Language-based AI systems are diffusing into society, bringing positive and negative impacts. Mitigating negative impacts depends on accurate impact assessments, drawn from an empirical evidence base that makes causal connections between AI usage and impacts. Interconnected post-deployment monitoring combines information about model integration and use, application use, and incidents and impacts. For example, inference time monitoring of chain-of-thought reasoning can be combined with long-term monitoring of sectoral AI diffusion, impacts and incidents. Drawing on information sharing mechanisms in other industries, we highlight example data sources and specific data points that governments could collect to inform AI risk management.

* 7 pages, 2 figures, 1 table

Via

Access Paper or Ask Questions