Picture for Francesco Belardinelli

Francesco Belardinelli

Imperial College London

Multi-Agent Q-Learning Dynamics in Random Networks: Convergence due to Exploration and Sparsity

Add code
Mar 13, 2025
Viaarxiv icon

Probabilistic Shielding for Safe Reinforcement Learning

Add code
Mar 09, 2025
Viaarxiv icon

Explainable Reinforcement Learning for Formula One Race Strategy

Add code
Jan 07, 2025
Figure 1 for Explainable Reinforcement Learning for Formula One Race Strategy
Figure 2 for Explainable Reinforcement Learning for Formula One Race Strategy
Figure 3 for Explainable Reinforcement Learning for Formula One Race Strategy
Figure 4 for Explainable Reinforcement Learning for Formula One Race Strategy
Viaarxiv icon

Measuring Goal-Directedness

Add code
Dec 06, 2024
Figure 1 for Measuring Goal-Directedness
Figure 2 for Measuring Goal-Directedness
Figure 3 for Measuring Goal-Directedness
Figure 4 for Measuring Goal-Directedness
Viaarxiv icon

The Reasons that Agents Act: Intention and Instrumental Goals

Add code
Feb 15, 2024
Figure 1 for The Reasons that Agents Act: Intention and Instrumental Goals
Figure 2 for The Reasons that Agents Act: Intention and Instrumental Goals
Figure 3 for The Reasons that Agents Act: Intention and Instrumental Goals
Figure 4 for The Reasons that Agents Act: Intention and Instrumental Goals
Viaarxiv icon

Leveraging Approximate Model-based Shielding for Probabilistic Safety Guarantees in Continuous Environments

Add code
Feb 01, 2024
Viaarxiv icon

Stability of Multi-Agent Learning in Competitive Networks: Delaying the Onset of Chaos

Add code
Dec 19, 2023
Viaarxiv icon

Honesty Is the Best Policy: Defining and Mitigating AI Deception

Add code
Dec 03, 2023
Figure 1 for Honesty Is the Best Policy: Defining and Mitigating AI Deception
Figure 2 for Honesty Is the Best Policy: Defining and Mitigating AI Deception
Figure 3 for Honesty Is the Best Policy: Defining and Mitigating AI Deception
Figure 4 for Honesty Is the Best Policy: Defining and Mitigating AI Deception
Viaarxiv icon

3vLTL: A Tool to Generate Automata for Three-valued LTL

Add code
Nov 16, 2023
Figure 1 for 3vLTL: A Tool to Generate Automata for Three-valued LTL
Figure 2 for 3vLTL: A Tool to Generate Automata for Three-valued LTL
Figure 3 for 3vLTL: A Tool to Generate Automata for Three-valued LTL
Viaarxiv icon

Approximate Model-Based Shielding for Safe Reinforcement Learning

Add code
Jul 27, 2023
Viaarxiv icon