Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Christine Evers

Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC

Nov 06, 2024

Tyler Clark, Mark Towers, Christine Evers, Jonathon Hare

Figure 1 for Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC

Figure 2 for Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC

Figure 3 for Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC

Figure 4 for Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC

Abstract:Rainbow Deep Q-Network (DQN) demonstrated combining multiple independent enhancements could significantly boost a reinforcement learning (RL) agent's performance. In this paper, we present "Beyond The Rainbow" (BTR), a novel algorithm that integrates six improvements from across the RL literature to Rainbow DQN, establishing a new state-of-the-art for RL using a desktop PC, with a human-normalized interquartile mean (IQM) of 7.4 on atari-60. Beyond Atari, we demonstrate BTR's capability to handle complex 3D games, successfully training agents to play Super Mario Galaxy, Mario Kart, and Mortal Kombat with minimal algorithmic changes. Designing BTR with computational efficiency in mind, agents can be trained using a desktop PC on 200 million Atari frames within 12 hours. Additionally, we conduct detailed ablation studies of each component, analzying the performance and impact using numerous measures.

* 9 main pages, 26 total. Currently under review at ICLR

Via

Access Paper or Ask Questions

Scene-to-Patch Earth Observation: Multiple Instance Learning for Land Cover Classification

Nov 15, 2022

Joseph Early, Ying-Jung Deweese, Christine Evers, Sarvapali Ramchurn

Abstract:Land cover classification (LCC), and monitoring how land use changes over time, is an important process in climate change mitigation and adaptation. Existing approaches that use machine learning with Earth observation data for LCC rely on fully-annotated and segmented datasets. Creating these datasets requires a large amount of effort, and a lack of suitable datasets has become an obstacle in scaling the use of LCC. In this study, we propose Scene-to-Patch models: an alternative LCC approach utilising Multiple Instance Learning (MIL) that requires only high-level scene labels. This enables much faster development of new datasets whilst still providing segmentation through patch-level predictions, ultimately increasing the accessibility of using LCC for different scenarios. On the DeepGlobe-LCC dataset, our approach outperforms non-MIL baselines on both scene- and patch-level prediction. This work provides the foundation for expanding the use of LCC in climate change mitigation methods for technology, government, and academia.

* 14 pages total (4 main content; 2 acknowledgments + citations; 8 appendices); 8 figures (2 main; 6 appendix); published at "Tackling Climate Change with Machine Learning: Workshop at NeurIPS 2022"

Via

Access Paper or Ask Questions

Trust in Language Grounding: a new AI challenge for human-robot teams

Sep 05, 2022

David M. Bossens, Christine Evers

Figure 1 for Trust in Language Grounding: a new AI challenge for human-robot teams

Figure 2 for Trust in Language Grounding: a new AI challenge for human-robot teams

Figure 3 for Trust in Language Grounding: a new AI challenge for human-robot teams

Figure 4 for Trust in Language Grounding: a new AI challenge for human-robot teams

Abstract:The challenge of language grounding is to fully understand natural language by grounding language in real-world referents. While AI techniques are available, the widespread adoption and effectiveness of such technologies for human-robot teams relies critically on user trust. This survey provides three contributions relating to the newly emerging field of trust in language grounding, including a) an overview of language grounding research in terms of AI technologies, data sets, and user interfaces; b) six hypothesised trust factors relevant to language grounding, which are tested empirically on a human-robot cleaning team; and c) future research directions for trust in language grounding.

Via

Access Paper or Ask Questions

Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning

May 30, 2022

Joseph Early, Tom Bewley, Christine Evers, Sarvapali Ramchurn

Figure 1 for Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning

Figure 2 for Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning

Figure 3 for Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning

Figure 4 for Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning

Abstract:We generalise the problem of reward modelling (RM) for reinforcement learning (RL) to handle non-Markovian rewards. Existing work assumes that human evaluators observe each step in a trajectory independently when providing feedback on agent behaviour. In this work, we remove this assumption, extending RM to include hidden state information that captures temporal dependencies in human assessment of trajectories. We then show how RM can be approached as a multiple instance learning (MIL) problem, and develop new MIL models that are able to capture the time dependencies in labelled trajectories. We demonstrate on a range of RL tasks that our novel MIL models can reconstruct reward functions to a high level of accuracy, and that they provide interpretable learnt hidden information that can be used to train high-performing agent policies.

* 20 pages (9 main content; 2 references; 9 appendix). 11 figures (8 main content; 3 appendix)

Via

Access Paper or Ask Questions

Model Agnostic Interpretability for Multiple Instance Learning

Jan 28, 2022

Joseph Early, Christine Evers, Sarvapali Ramchurn

Figure 1 for Model Agnostic Interpretability for Multiple Instance Learning

Figure 2 for Model Agnostic Interpretability for Multiple Instance Learning

Figure 3 for Model Agnostic Interpretability for Multiple Instance Learning

Figure 4 for Model Agnostic Interpretability for Multiple Instance Learning

Abstract:In Multiple Instance Learning (MIL), models are trained using bags of instances, where only a single label is provided for each bag. A bag label is often only determined by a handful of key instances within a bag, making it difficult to interpret what information a classifier is using to make decisions. In this work, we establish the key requirements for interpreting MIL models. We then go on to develop several model-agnostic approaches that meet these requirements. Our methods are compared against existing inherently interpretable MIL models on several datasets, and achieve an increase in interpretability accuracy of up to 30%. We also examine the ability of the methods to identify interactions between instances and scale to larger datasets, improving their applicability to real-world problems.

* 25 pages (9 content, 2 acknowledgement + references, 14 appendix). 16 figures (3 main content, 13 appendix). Submitted and accepted to ICLR 22, see http://openreview.net/forum?id=KSSfF5lMIAg . Revision: added additional acknowledgements

Via

Access Paper or Ask Questions