Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nick Saw

SIGMA: An Open-Source Interactive System for Mixed-Reality Task Assistance Research

May 16, 2024

Dan Bohus, Sean Andrist, Nick Saw, Ann Paradiso, Ishani Chakraborty, Mahdi Rad

Figure 1 for SIGMA: An Open-Source Interactive System for Mixed-Reality Task Assistance Research

Figure 2 for SIGMA: An Open-Source Interactive System for Mixed-Reality Task Assistance Research

Figure 3 for SIGMA: An Open-Source Interactive System for Mixed-Reality Task Assistance Research

Figure 4 for SIGMA: An Open-Source Interactive System for Mixed-Reality Task Assistance Research

Abstract:We introduce an open-source system called SIGMA (short for "Situated Interactive Guidance, Monitoring, and Assistance") as a platform for conducting research on task-assistive agents in mixed-reality scenarios. The system leverages the sensing and rendering affordances of a head-mounted mixed-reality device in conjunction with large language and vision models to guide users step by step through procedural tasks. We present the system's core capabilities, discuss its overall design and implementation, and outline directions for future research enabled by the system. SIGMA is easily extensible and provides a useful basis for future research at the intersection of mixed reality and AI. By open-sourcing an end-to-end implementation, we aim to lower the barrier to entry, accelerate research in this space, and chart a path towards community-driven end-to-end evaluation of large language, vision, and multimodal models in the context of real-world interactive applications.

* 10 pages, 5 figures

Via

Access Paper or Ask Questions

Platform for Situated Intelligence

Mar 29, 2021

Dan Bohus, Sean Andrist, Ashley Feniello, Nick Saw, Mihai Jalobeanu, Patrick Sweeney, Anne Loomis Thompson, Eric Horvitz

Figure 1 for Platform for Situated Intelligence

Figure 2 for Platform for Situated Intelligence

Figure 3 for Platform for Situated Intelligence

Figure 4 for Platform for Situated Intelligence

Abstract:We introduce Platform for Situated Intelligence, an open-source framework created to support the rapid development and study of multimodal, integrative-AI systems. The framework provides infrastructure for sensing, fusing, and making inferences from temporal streams of data across different modalities, a set of tools that enable visualization and debugging, and an ecosystem of components that encapsulate a variety of perception and processing technologies. These assets jointly provide the means for rapidly constructing and refining multimodal, integrative-AI systems, while retaining the efficiency and performance characteristics required for deployment in open-world settings.

* 29 pages, 14 figures, Microsoft Research Technical Report

Via

Access Paper or Ask Questions