Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Auroop Ganguly

Assessing the Impact of Distribution Shift on Reinforcement Learning Performance

Feb 05, 2024

Ted Fujimoto, Joshua Suetterlein, Samrat Chatterjee, Auroop Ganguly

Figure 1 for Assessing the Impact of Distribution Shift on Reinforcement Learning Performance

Figure 2 for Assessing the Impact of Distribution Shift on Reinforcement Learning Performance

Figure 3 for Assessing the Impact of Distribution Shift on Reinforcement Learning Performance

Figure 4 for Assessing the Impact of Distribution Shift on Reinforcement Learning Performance

Abstract:Research in machine learning is making progress in fixing its own reproducibility crisis. Reinforcement learning (RL), in particular, faces its own set of unique challenges. Comparison of point estimates, and plots that show successful convergence to the optimal policy during training, may obfuscate overfitting or dependence on the experimental setup. Although researchers in RL have proposed reliability metrics that account for uncertainty to better understand each algorithm's strengths and weaknesses, the recommendations of past work do not assume the presence of out-of-distribution observations. We propose a set of evaluation methods that measure the robustness of RL algorithms under distribution shifts. The tools presented here argue for the need to account for performance over time while the agent is acting in its environment. In particular, we recommend time series analysis as a method of observational RL evaluation. We also show that the unique properties of RL and simulated dynamic environments allow us to make stronger assumptions to justify the measurement of causal impact in our evaluations. We then apply these tools to single-agent and multi-agent environments to show the impact of introducing distribution shifts during test time. We present this methodology as a first step toward rigorous RL evaluation in the presence of distribution shifts.

* Poster at the Workshop on Regulatable Machine Learning at the 37th Conference on Neural Information Processing Systems (RegML @ NeurIPS 2023)

Via

Access Paper or Ask Questions

Ad Hoc Teamwork in the Presence of Adversaries

Aug 09, 2022

Ted Fujimoto, Samrat Chatterjee, Auroop Ganguly

Abstract:Advances in ad hoc teamwork have the potential to create agents that collaborate robustly in real-world applications. Agents deployed in the real world, however, are vulnerable to adversaries with the intent to subvert them. There has been little research in ad hoc teamwork that assumes the presence of adversaries. We explain the importance of extending ad hoc teamwork to include the presence of adversaries and clarify why this problem is difficult. We then propose some directions for new research opportunities in ad hoc teamwork that leads to more robust multi-agent cyber-physical infrastructure systems.

* Blue Sky Ideas Acceptance at the New Frontiers in Adversarial Machine Learning Workshop @ ICML 2022

Via

Access Paper or Ask Questions

Progressively Growing Generative Adversarial Networks for High Resolution Semantic Segmentation of Satellite Images

Feb 12, 2019

Edward Collier, Kate Duffy, Sangram Ganguly, Geri Madanguit, Subodh Kalia, Gayaka Shreekant, Ramakrishna Nemani, Andrew Michaelis, Shuang Li, Auroop Ganguly(+1 more)

Figure 1 for Progressively Growing Generative Adversarial Networks for High Resolution Semantic Segmentation of Satellite Images

Figure 2 for Progressively Growing Generative Adversarial Networks for High Resolution Semantic Segmentation of Satellite Images

Figure 3 for Progressively Growing Generative Adversarial Networks for High Resolution Semantic Segmentation of Satellite Images

Figure 4 for Progressively Growing Generative Adversarial Networks for High Resolution Semantic Segmentation of Satellite Images

Abstract:Machine learning has proven to be useful in classification and segmentation of images. In this paper, we evaluate a training methodology for pixel-wise segmentation on high resolution satellite images using progressive growing of generative adversarial networks. We apply our model to segmenting building rooftops and compare these results to conventional methods for rooftop segmentation. We present our findings using the SpaceNet version 2 dataset. Progressive GAN training achieved a test accuracy of 93% compared to 89% for traditional GAN training.

* Accepted too and presented at DMESS 2018 as part of IEEE ICDM 2018

Via

Access Paper or Ask Questions

Theory-guided Data Science: A New Paradigm for Scientific Discovery from Data

Nov 13, 2017

Anuj Karpatne, Gowtham Atluri, James Faghmous, Michael Steinbach, Arindam Banerjee, Auroop Ganguly, Shashi Shekhar, Nagiza Samatova, Vipin Kumar

Figure 1 for Theory-guided Data Science: A New Paradigm for Scientific Discovery from Data

Figure 2 for Theory-guided Data Science: A New Paradigm for Scientific Discovery from Data

Figure 3 for Theory-guided Data Science: A New Paradigm for Scientific Discovery from Data

Figure 4 for Theory-guided Data Science: A New Paradigm for Scientific Discovery from Data

Abstract:Data science models, although successful in a number of commercial domains, have had limited applicability in scientific problems involving complex physical phenomena. Theory-guided data science (TGDS) is an emerging paradigm that aims to leverage the wealth of scientific knowledge for improving the effectiveness of data science models in enabling scientific discovery. The overarching vision of TGDS is to introduce scientific consistency as an essential component for learning generalizable models. Further, by producing scientifically interpretable models, TGDS aims to advance our scientific understanding by discovering novel domain insights. Indeed, the paradigm of TGDS has started to gain prominence in a number of scientific disciplines such as turbulence modeling, material discovery, quantum chemistry, bio-medical science, bio-marker discovery, climate science, and hydrology. In this paper, we formally conceptualize the paradigm of TGDS and present a taxonomy of research themes in TGDS. We describe several approaches for integrating domain knowledge in different research themes using illustrative examples from different disciplines. We also highlight some of the promising avenues of novel research for realizing the full potential of theory-guided data science.

* IEEE Transactions on Knowledge and Data Engineering, 29(10), pp.2318-2331. 2017

Via

Access Paper or Ask Questions