Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Anthony Francis

Logical Robotics

VADER: Visual Affordance Detection and Error Recovery for Multi Robot Human Collaboration

May 25, 2024

Michael Ahn, Montserrat Gonzalez Arenas, Matthew Bennice, Noah Brown, Christine Chan, Byron David, Anthony Francis, Gavin Gonzalez, Rainer Hessmer, Tomas Jackson(+15 more)

Figure 1 for VADER: Visual Affordance Detection and Error Recovery for Multi Robot Human Collaboration

Figure 2 for VADER: Visual Affordance Detection and Error Recovery for Multi Robot Human Collaboration

Figure 3 for VADER: Visual Affordance Detection and Error Recovery for Multi Robot Human Collaboration

Figure 4 for VADER: Visual Affordance Detection and Error Recovery for Multi Robot Human Collaboration

Abstract:Robots today can exploit the rich world knowledge of large language models to chain simple behavioral skills into long-horizon tasks. However, robots often get interrupted during long-horizon tasks due to primitive skill failures and dynamic environments. We propose VADER, a plan, execute, detect framework with seeking help as a new skill that enables robots to recover and complete long-horizon tasks with the help of humans or other robots. VADER leverages visual question answering (VQA) modules to detect visual affordances and recognize execution errors. It then generates prompts for a language model planner (LMP) which decides when to seek help from another robot or human to recover from errors in long-horizon task execution. We show the effectiveness of VADER with two long-horizon robotic tasks. Our pilot study showed that VADER is capable of performing complex long-horizon tasks by asking for help from another robot to clear a table. Our user study showed that VADER is capable of performing complex long-horizon tasks by asking for help from a human to clear a path. We gathered feedback from people (N=19) about the performance of the VADER performance vs. a robot that did not ask for help. https://google-vader.github.io/

* 9 pages, 4 figures

Via

Access Paper or Ask Questions

Principles and Guidelines for Evaluating Social Robot Navigation Algorithms

Jun 29, 2023

Anthony Francis, Claudia Perez-D'Arpino, Chengshu Li, Fei Xia, Alexandre Alahi, Rachid Alami, Aniket Bera, Abhijat Biswas, Joydeep Biswas, Rohan Chandra(+21 more)

Figure 1 for Principles and Guidelines for Evaluating Social Robot Navigation Algorithms

Figure 2 for Principles and Guidelines for Evaluating Social Robot Navigation Algorithms

Figure 3 for Principles and Guidelines for Evaluating Social Robot Navigation Algorithms

Figure 4 for Principles and Guidelines for Evaluating Social Robot Navigation Algorithms

Abstract:A major challenge to deploying robots widely is navigation in human-populated environments, commonly referred to as social robot navigation. While the field of social navigation has advanced tremendously in recent years, the fair evaluation of algorithms that tackle social navigation remains hard because it involves not just robotic agents moving in static environments but also dynamic human agents and their perceptions of the appropriateness of robot behavior. In contrast, clear, repeatable, and accessible benchmarks have accelerated progress in fields like computer vision, natural language processing and traditional robot navigation by enabling researchers to fairly compare algorithms, revealing limitations of existing solutions and illuminating promising new directions. We believe the same approach can benefit social navigation. In this paper, we pave the road towards common, widely accessible, and repeatable benchmarking criteria to evaluate social robot navigation. Our contributions include (a) a definition of a socially navigating robot as one that respects the principles of safety, comfort, legibility, politeness, social competency, agent understanding, proactivity, and responsiveness to context, (b) guidelines for the use of metrics, development of scenarios, benchmarks, datasets, and simulators to evaluate social navigation, and (c) a design of a social navigation metrics framework to make it easier to compare results from different simulators, robots and datasets.

* 43 pages, 11 figures, 6 tables

Via

Access Paper or Ask Questions

Retrospectives on the Embodied AI Workshop

Oct 17, 2022

Matt Deitke, Dhruv Batra, Yonatan Bisk, Tommaso Campari, Angel X. Chang, Devendra Singh Chaplot, Changan Chen, Claudia Pérez D'Arpino, Kiana Ehsani, Ali Farhadi(+29 more)

Figure 1 for Retrospectives on the Embodied AI Workshop

Figure 2 for Retrospectives on the Embodied AI Workshop

Figure 3 for Retrospectives on the Embodied AI Workshop

Figure 4 for Retrospectives on the Embodied AI Workshop

Abstract:We present a retrospective on the state of Embodied AI research. Our analysis focuses on 13 challenges presented at the Embodied AI Workshop at CVPR. These challenges are grouped into three themes: (1) visual navigation, (2) rearrangement, and (3) embodied vision-and-language. We discuss the dominant datasets within each theme, evaluation metrics for the challenges, and the performance of state-of-the-art models. We highlight commonalities between top approaches to the challenges and identify potential future directions for Embodied AI research.

Via

Access Paper or Ask Questions

Learning Model Predictive Controllers with Real-Time Attention for Real-World Navigation

Sep 24, 2022

Xuesu Xiao, Tingnan Zhang, Krzysztof Choromanski, Edward Lee, Anthony Francis, Jake Varley, Stephen Tu, Sumeet Singh, Peng Xu, Fei Xia(+7 more)

Figure 1 for Learning Model Predictive Controllers with Real-Time Attention for Real-World Navigation

Figure 2 for Learning Model Predictive Controllers with Real-Time Attention for Real-World Navigation

Figure 3 for Learning Model Predictive Controllers with Real-Time Attention for Real-World Navigation

Figure 4 for Learning Model Predictive Controllers with Real-Time Attention for Real-World Navigation

Abstract:Despite decades of research, existing navigation systems still face real-world challenges when deployed in the wild, e.g., in cluttered home environments or in human-occupied public spaces. To address this, we present a new class of implicit control policies combining the benefits of imitation learning with the robust handling of system constraints from Model Predictive Control (MPC). Our approach, called Performer-MPC, uses a learned cost function parameterized by vision context embeddings provided by Performers -- a low-rank implicit-attention Transformer. We jointly train the cost function and construct the controller relying on it, effectively solving end-to-end the corresponding bi-level optimization problem. We show that the resulting policy improves standard MPC performance by leveraging a few expert demonstrations of the desired navigation behavior in different challenging real-world scenarios. Compared with a standard MPC policy, Performer-MPC achieves >40% better goal reached in cluttered environments and >65% better on social metrics when navigating around humans.

Via

Access Paper or Ask Questions

Gesture2Path: Imitation Learning for Gesture-aware Navigation

Sep 19, 2022

Catie Cuan, Edward Lee, Emre Fisher, Anthony Francis, Leila Takayama, Tingnan Zhang, Alexander Toshev, Sören Pirk

Figure 1 for Gesture2Path: Imitation Learning for Gesture-aware Navigation

Figure 2 for Gesture2Path: Imitation Learning for Gesture-aware Navigation

Figure 3 for Gesture2Path: Imitation Learning for Gesture-aware Navigation

Figure 4 for Gesture2Path: Imitation Learning for Gesture-aware Navigation

Abstract:As robots increasingly enter human-centered environments, they must not only be able to navigate safely around humans, but also adhere to complex social norms. Humans often rely on non-verbal communication through gestures and facial expressions when navigating around other people, especially in densely occupied spaces. Consequently, robots also need to be able to interpret gestures as part of solving social navigation tasks. To this end, we present Gesture2Path, a novel social navigation approach that combines image-based imitation learning with model-predictive control. Gestures are interpreted based on a neural network that operates on streams of images, while we use a state-of-the-art model predictive control algorithm to solve point-to-point navigation tasks. We deploy our method on real robots and showcase the effectiveness of our approach for the four gestures-navigation scenarios: left/right, follow me, and make a circle. Our experiments indicate that our method is able to successfully interpret complex human gestures and to use them as a signal to generate socially compliant trajectories for navigation tasks. We validated our method based on in-situ ratings of participants interacting with the robots.

* 8 pages, 12 figures

Via

Access Paper or Ask Questions

Google Scanned Objects: A High-Quality Dataset of 3D Scanned Household Items

Apr 25, 2022

Laura Downs, Anthony Francis, Nate Koenig, Brandon Kinman, Ryan Hickman, Krista Reymann, Thomas B. McHugh, Vincent Vanhoucke

Figure 1 for Google Scanned Objects: A High-Quality Dataset of 3D Scanned Household Items

Figure 2 for Google Scanned Objects: A High-Quality Dataset of 3D Scanned Household Items

Figure 3 for Google Scanned Objects: A High-Quality Dataset of 3D Scanned Household Items

Figure 4 for Google Scanned Objects: A High-Quality Dataset of 3D Scanned Household Items

Abstract:Interactive 3D simulations have enabled breakthroughs in robotics and computer vision, but simulating the broad diversity of environments needed for deep learning requires large corpora of photo-realistic 3D object models. To address this need, we present Google Scanned Objects, an open-source collection of over one thousand 3D-scanned household items released under a Creative Commons license; these models are preprocessed for use in Ignition Gazebo and the Bullet simulation platforms, but are easily adaptable to other simulators. We describe our object scanning and curation pipeline, then provide statistics about the contents of the dataset and its usage. We hope that the diversity, quality, and flexibility of Google Scanned Objects will lead to advances in interactive simulation, synthetic perception, and robotic learning.

* 8 pages, 5 figures, 4 tables; to appear in the conference proceedings of ICRA 2022

Via

Access Paper or Ask Questions

A Protocol for Validating Social Navigation Policies

Apr 11, 2022

Sören Pirk, Edward Lee, Xuesu Xiao, Leila Takayama, Anthony Francis, Alexander Toshev

Figure 1 for A Protocol for Validating Social Navigation Policies

Figure 2 for A Protocol for Validating Social Navigation Policies

Figure 3 for A Protocol for Validating Social Navigation Policies

Figure 4 for A Protocol for Validating Social Navigation Policies

Abstract:Enabling socially acceptable behavior for situated agents is a major goal of recent robotics research. Robots should not only operate safely around humans, but also abide by complex social norms. A key challenge for developing socially-compliant policies is measuring the quality of their behavior. Social behavior is enormously complex, making it difficult to create reliable metrics to gauge the performance of algorithms. In this paper, we propose a protocol for social navigation benchmarking that defines a set of canonical social navigation scenarios and an in-situ metric for evaluating performance on these scenarios using questionnaires. Our experiments show this protocol is realistic, scalable, and repeatable across runs and physical spaces. Our protocol can be replicated verbatim or it can be used to define a social navigation benchmark for novel scenarios. Our goal is to introduce a protocol for benchmarking social scenarios that is homogeneous and comparable.

Via

Access Paper or Ask Questions

Style-based quantum generative adversarial networks for Monte Carlo events

Oct 13, 2021

Carlos Bravo-Prieto, Julien Baglio, Marco Cè, Anthony Francis, Dorota M. Grabowska, Stefano Carrazza

Figure 1 for Style-based quantum generative adversarial networks for Monte Carlo events

Figure 2 for Style-based quantum generative adversarial networks for Monte Carlo events

Figure 3 for Style-based quantum generative adversarial networks for Monte Carlo events

Figure 4 for Style-based quantum generative adversarial networks for Monte Carlo events

Abstract:We propose and assess an alternative quantum generator architecture in the context of generative adversarial learning for Monte Carlo event generation, used to simulate particle physics processes at the Large Hadron Collider (LHC). We validate this methodology by implementing the quantum network on artificial data generated from known underlying distributions. The network is then applied to Monte Carlo-generated datasets of specific LHC scattering processes. The new quantum generator architecture leads to an improvement in state-of-the-art implementations while maintaining shallow-depth networks. Moreover, the quantum generator successfully learns the underlying distribution functions even if trained with small training sample sets; this is particularly interesting for data augmentation applications. We deploy this novel methodology on two different quantum hardware architectures, trapped-ion and superconducting technologies, to test its hardware-independent viability.

* 14 pages, 10 figures, code available in https://github.com/QTI-TH/style-qgan

Via

Access Paper or Ask Questions

Evolving Rewards to Automate Reinforcement Learning

May 18, 2019

Aleksandra Faust, Anthony Francis, Dar Mehta

Figure 1 for Evolving Rewards to Automate Reinforcement Learning

Figure 2 for Evolving Rewards to Automate Reinforcement Learning

Figure 3 for Evolving Rewards to Automate Reinforcement Learning

Figure 4 for Evolving Rewards to Automate Reinforcement Learning

Abstract:Many continuous control tasks have easily formulated objectives, yet using them directly as a reward in reinforcement learning (RL) leads to suboptimal policies. Therefore, many classical control tasks guide RL training using complex rewards, which require tedious hand-tuning. We automate the reward search with AutoRL, an evolutionary layer over standard RL that treats reward tuning as hyperparameter optimization and trains a population of RL agents to find a reward that maximizes the task objective. AutoRL, evaluated on four Mujoco continuous control tasks over two RL algorithms, shows improvements over baselines, with the the biggest uplift for more complex tasks. The video can be found at: \url{https://youtu.be/svdaOFfQyC8}.

* Accepted to 6th AutoML@ICML

Via

Access Paper or Ask Questions

Long-Range Indoor Navigation with PRM-RL

Feb 25, 2019

Anthony Francis, Aleksandra Faust, Hao-Tien Lewis Chiang, Jasmine Hsu, J. Chase Kew, Marek Fiser, Tsang-Wei Edward Lee

Figure 1 for Long-Range Indoor Navigation with PRM-RL

Figure 2 for Long-Range Indoor Navigation with PRM-RL

Figure 3 for Long-Range Indoor Navigation with PRM-RL

Figure 4 for Long-Range Indoor Navigation with PRM-RL

Abstract:Long-range indoor navigation requires guiding robots with noisy sensors and controls through cluttered environments along paths that span a variety of buildings. We achieve this with PRM-RL, a hierarchical robot navigation method in which reinforcement learning agents that map noisy sensors to robot controls learn to solve short-range obstacle avoidance tasks, and then sampling-based planners map where these agents can reliably navigate in simulation; these roadmaps and agents are then deployed on-robot, guiding the robot along the shortest path where the agents are likely to succeed. Here we use Probabilistic Roadmaps (PRMs) as the sampling-based planner and AutoRL as the reinforcement learning method in the indoor navigation context. We evaluate the method in simulation for kinematic differential drive and kinodynamic car-like robots in several environments, and on-robot for differential-drive robots at two physical sites. Our results show PRM-RL with AutoRL is more successful than several baselines, is robust to noise, and can guide robots over hundreds of meters in the face of noise and obstacles in both simulation and on-robot, including over 3.3 kilometers of physical robot navigation.

* 19 pages; 15 Figures; 6 tables

Via

Access Paper or Ask Questions