Picture for Doina Precup

Doina Precup

McGill University, Mila- Quebec Artificial Intelligence Institute

Agency Is Frame-Dependent

Add code
Feb 06, 2025
Viaarxiv icon

Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning

Add code
Jan 29, 2025
Viaarxiv icon

Fairness in Reinforcement Learning with Bisimulation Metrics

Add code
Dec 22, 2024
Viaarxiv icon

MaestroMotif: Skill Design from Artificial Intelligence Feedback

Add code
Dec 11, 2024
Figure 1 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Figure 2 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Figure 3 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Figure 4 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Viaarxiv icon

Parseval Regularization for Continual Reinforcement Learning

Add code
Dec 10, 2024
Figure 1 for Parseval Regularization for Continual Reinforcement Learning
Figure 2 for Parseval Regularization for Continual Reinforcement Learning
Figure 3 for Parseval Regularization for Continual Reinforcement Learning
Figure 4 for Parseval Regularization for Continual Reinforcement Learning
Viaarxiv icon

Reaction-conditioned De Novo Enzyme Design with GENzyme

Add code
Nov 10, 2024
Viaarxiv icon

Soft Condorcet Optimization for Ranking of General Agents

Add code
Nov 04, 2024
Viaarxiv icon

Learning Successor Features the Simple Way

Add code
Oct 29, 2024
Figure 1 for Learning Successor Features the Simple Way
Figure 2 for Learning Successor Features the Simple Way
Figure 3 for Learning Successor Features the Simple Way
Figure 4 for Learning Successor Features the Simple Way
Viaarxiv icon

Identifying and Addressing Delusions for Target-Directed Decision-Making

Add code
Oct 10, 2024
Viaarxiv icon

Mitigating Downstream Model Risks via Model Provenance

Add code
Oct 03, 2024
Figure 1 for Mitigating Downstream Model Risks via Model Provenance
Figure 2 for Mitigating Downstream Model Risks via Model Provenance
Figure 3 for Mitigating Downstream Model Risks via Model Provenance
Viaarxiv icon