Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alberto Speranzon

Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings

Feb 13, 2024

Rushang Karia, Pulkit Verma, Alberto Speranzon, Siddharth Srivastava

Figure 1 for Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings

Figure 2 for Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings

Abstract:This paper introduces a new approach for continual planning and model learning in non-stationary stochastic environments expressed using relational representations. Such capabilities are essential for the deployment of sequential decision-making systems in the uncertain, constantly evolving real world. Working in such practical settings with unknown (and non-stationary) transition systems and changing tasks, the proposed framework models gaps in the agent's current state of knowledge and uses them to conduct focused, investigative explorations. Data collected using these explorations is used for learning generalizable probabilistic models for solving the current task despite continual changes in the environment dynamics. Empirical evaluations on several benchmark domains show that this approach significantly outperforms planning and RL baselines in terms of sample complexity in non-stationary settings. Theoretical results show that the system reverts to exhibit desirable convergence properties when stationarity holds.

* To appear at ICAPS-24

Via

Access Paper or Ask Questions

Indoor and Outdoor 3D Scene Graph Generation via Language-Enabled Spatial Ontologies

Dec 18, 2023

Jared Strader, Nathan Hughes, William Chen, Alberto Speranzon, Luca Carlone

Abstract:This paper proposes an approach to build 3D scene graphs in arbitrary (indoor and outdoor) environments. Such extension is challenging; the hierarchy of concepts that describe an outdoor environment is more complex than for indoors, and manually defining such hierarchy is time-consuming and does not scale. Furthermore, the lack of training data prevents the straightforward application of learning-based tools used in indoor settings. To address these challenges, we propose two novel extensions. First, we develop methods to build a spatial ontology defining concepts and relations relevant for indoor and outdoor robot operation. In particular, we use a Large Language Model (LLM) to build such an ontology, thus largely reducing the amount of manual effort required. Second, we leverage the spatial ontology for 3D scene graph construction using Logic Tensor Networks (LTN) to add logical rules, or axioms (e.g., "a beach contains sand"), which provide additional supervisory signals at training time thus reducing the need for labelled data, providing better predictions, and even allowing predicting concepts unseen at training time. We test our approach in a variety of datasets, including indoor, rural, and coastal environments, and show that it leads to a significant increase in the quality of the 3D scene graph generation with sparsely annotated data.

* 10 pages, 6 figures, submitted to Robotics and Automation Letters

Via

Access Paper or Ask Questions

A perspective on multi-agent communication for information fusion

Nov 09, 2019

Homagni Saha, Vijay Venkataraman, Alberto Speranzon, Soumik Sarkar

Figure 1 for A perspective on multi-agent communication for information fusion

Figure 2 for A perspective on multi-agent communication for information fusion

Figure 3 for A perspective on multi-agent communication for information fusion

Figure 4 for A perspective on multi-agent communication for information fusion

Abstract:Collaborative decision making in multi-agent systems typically requires a predefined communication protocol among agents. Usually, agent-level observations are locally processed and information is exchanged using the predefined protocol, enabling the team to perform more efficiently than each agent operating in isolation. In this work, we consider the situation where agents, with complementary sensing modalities must co-operate to achieve a common goal/task by learning an efficient communication protocol. We frame the problem within an actor-critic scheme, where the agents learn optimal policies in a centralized fashion, while taking action in a distributed manner. We provide an interpretation of the emergent communication between the agents. We observe that the information exchanged is not just an encoding of the raw sensor data but is, rather, a specific set of directive actions that depend on the overall task. Simulation results demonstrate the interpretability of the learnt communication in a variety of tasks.

* NeuRIPS 2019, Workshop on Visually Grounded Interaction and Language, Vancouver, CA

Via

Access Paper or Ask Questions

ECO: Egocentric Cognitive Mapping

Dec 02, 2018

Jayant Sharma, Zixing Wang, Alberto Speranzon, Vijay Venkataraman, Hyun Soo Park

Figure 1 for ECO: Egocentric Cognitive Mapping

Figure 2 for ECO: Egocentric Cognitive Mapping

Figure 3 for ECO: Egocentric Cognitive Mapping

Figure 4 for ECO: Egocentric Cognitive Mapping

Abstract:We present a new method to localize a camera within a previously unseen environment perceived from an egocentric point of view. Although this is, in general, an ill-posed problem, humans can effortlessly and efficiently determine their relative location and orientation and navigate into a previously unseen environments, e.g., finding a specific item in a new grocery store. To enable such a capability, we design a new egocentric representation, which we call ECO (Egocentric COgnitive map). ECO is biologically inspired, by the cognitive map that allows human navigation, and it encodes the surrounding visual semantics with respect to both distance and orientation. ECO possesses three main properties: (1) reconfigurability: complex semantics and geometry is captured via the synthesis of atomic visual representations (e.g., image patch); (2) robustness: the visual semantics are registered in a geometrically consistent way (e.g., aligning with respect to the gravity vector, frontalizing, and rescaling to canonical depth), thus enabling us to learn meaningful atomic representations; (3) adaptability: a domain adaptation framework is designed to generalize the learned representation without manual calibration. As a proof-of-concept, we use ECO to localize a camera within real-world scenes---various grocery stores---and demonstrate performance improvements when compared to existing semantic localization approaches.

Via

Access Paper or Ask Questions

Robust Belief Roadmap: Planning Under Intermittent Sensing

Sep 15, 2013

Shaunak D. Bopardikar, Brendan J. Englot, Alberto Speranzon

Figure 1 for Robust Belief Roadmap: Planning Under Intermittent Sensing

Figure 2 for Robust Belief Roadmap: Planning Under Intermittent Sensing

Figure 3 for Robust Belief Roadmap: Planning Under Intermittent Sensing

Figure 4 for Robust Belief Roadmap: Planning Under Intermittent Sensing

Abstract:In this paper, we extend the recent body of work on planning under uncertainty to include the fact that sensors may not provide any measurement owing to misdetection. This is caused either by adverse environmental conditions that prevent the sensors from making measurements or by the fundamental limitations of the sensors. Examples include RF-based ranging devices that intermittently do not receive the signal from beacons because of obstacles; the misdetection of features by a camera system in detrimental lighting conditions; a LIDAR sensor that is pointed at a glass-based material such as a window, etc. The main contribution of this paper is twofold. We first show that it is possible to obtain an analytical bound on the performance of a state estimator under sensor misdetection occurring stochastically over time in the environment. We then show how this bound can be used in a sample-based path planning algorithm to produce a path that trades off accuracy and robustness. Computational results demonstrate the benefit of the approach and comparisons are made with the state of the art in path planning under state uncertainty.

* 10 pages, 6 figures

Via

Access Paper or Ask Questions