Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ali-akbar Agha-mohammadi

Jet Propulsion Lab., California Institute of Technology and

FALCON: Learning Force-Adaptive Humanoid Loco-Manipulation

May 10, 2025

Yuanhang Zhang, Yifu Yuan, Prajwal Gurunath, Tairan He, Shayegan Omidshafiei, Ali-akbar Agha-mohammadi, Marcell Vazquez-Chanlatte, Liam Pedersen, Guanya Shi

Abstract:Humanoid loco-manipulation holds transformative potential for daily service and industrial tasks, yet achieving precise, robust whole-body control with 3D end-effector force interaction remains a major challenge. Prior approaches are often limited to lightweight tasks or quadrupedal/wheeled platforms. To overcome these limitations, we propose FALCON, a dual-agent reinforcement-learning-based framework for robust force-adaptive humanoid loco-manipulation. FALCON decomposes whole-body control into two specialized agents: (1) a lower-body agent ensuring stable locomotion under external force disturbances, and (2) an upper-body agent precisely tracking end-effector positions with implicit adaptive force compensation. These two agents are jointly trained in simulation with a force curriculum that progressively escalates the magnitude of external force exerted on the end effector while respecting torque limits. Experiments demonstrate that, compared to the baselines, FALCON achieves 2x more accurate upper-body joint tracking, while maintaining robust locomotion under force disturbances and achieving faster training convergence. Moreover, FALCON enables policy training without embodiment-specific reward or curriculum tuning. Using the same training setup, we obtain policies that are deployed across multiple humanoids, enabling forceful loco-manipulation tasks such as transporting payloads (0-20N force), cart-pulling (0-100N), and door-opening (0-40N) in the real world.

Via

Access Paper or Ask Questions

SayComply: Grounding Field Robotic Tasks in Operational Compliance through Retrieval-Based Language Models

Nov 18, 2024

Muhammad Fadhil Ginting, Dong-Ki Kim, Sung-Kyun Kim, Bandi Jai Krishna, Mykel J. Kochenderfer, Shayegan Omidshafiei, Ali-akbar Agha-mohammadi

Figure 1 for SayComply: Grounding Field Robotic Tasks in Operational Compliance through Retrieval-Based Language Models

Figure 2 for SayComply: Grounding Field Robotic Tasks in Operational Compliance through Retrieval-Based Language Models

Figure 3 for SayComply: Grounding Field Robotic Tasks in Operational Compliance through Retrieval-Based Language Models

Figure 4 for SayComply: Grounding Field Robotic Tasks in Operational Compliance through Retrieval-Based Language Models

Abstract:This paper addresses the problem of task planning for robots that must comply with operational manuals in real-world settings. Task planning under these constraints is essential for enabling autonomous robot operation in domains that require adherence to domain-specific knowledge. Current methods for generating robot goals and plans rely on common sense knowledge encoded in large language models. However, these models lack grounding of robot plans to domain-specific knowledge and are not easily transferable between multiple sites or customers with different compliance needs. In this work, we present SayComply, which enables grounding robotic task planning with operational compliance using retrieval-based language models. We design a hierarchical database of operational, environment, and robot embodiment manuals and procedures to enable efficient retrieval of the relevant context under the limited context length of the LLMs. We then design a task planner using a tree-based retrieval augmented generation (RAG) technique to generate robot tasks that follow user instructions while simultaneously complying with the domain knowledge in the database. We demonstrate the benefits of our approach through simulations and hardware experiments in real-world scenarios that require precise context retrieval across various types of context, outperforming the standard RAG method. Our approach bridges the gap in deploying robots that consistently adhere to operational protocols, offering a scalable and edge-deployable solution for ensuring compliance across varied and complex real-world environments. Project website: saycomply.github.io.

Via

Access Paper or Ask Questions

Capability-aware Task Allocation and Team Formation Analysis for Cooperative Exploration of Complex Environments

Nov 01, 2024

Muhammad Fadhil Ginting, Kyohei Otsu, Mykel J. Kochenderfer, Ali-akbar Agha-mohammadi

Figure 1 for Capability-aware Task Allocation and Team Formation Analysis for Cooperative Exploration of Complex Environments

Figure 2 for Capability-aware Task Allocation and Team Formation Analysis for Cooperative Exploration of Complex Environments

Figure 3 for Capability-aware Task Allocation and Team Formation Analysis for Cooperative Exploration of Complex Environments

Figure 4 for Capability-aware Task Allocation and Team Formation Analysis for Cooperative Exploration of Complex Environments

Abstract:To achieve autonomy in complex real-world exploration missions, we consider deployment strategies for a team of robots with heterogeneous autonomy capabilities. In this work, we formulate a multi-robot exploration mission and compute an operation policy to maintain robot team productivity and maximize mission rewards. The environment description, robot capability, and mission outcome are modeled as a Markov decision process (MDP). We also include constraints in real-world operation, such as sensor failures, limited communication coverage, and mobility-stressing elements. Then, we study the proposed operation model on a real-world scenario in the context of the DARPA Subterranean (SubT) Challenge. The computed deployment policy is also compared against the human-based operation strategy in the final competition of the SubT Challenge. Finally, using the proposed model, we discuss the design trade-off on building a multi-robot team with heterogeneous capabilities.

Via

Access Paper or Ask Questions

FRAME: A Modular Framework for Autonomous Map-merging: Advancements in the Field

Apr 27, 2024

Nikolaos Stathoulopoulos, Björn Lindqvist, Anton Koval, Ali-akbar Agha-mohammadi, George Nikolakopoulos

Figure 1 for FRAME: A Modular Framework for Autonomous Map-merging: Advancements in the Field

Figure 2 for FRAME: A Modular Framework for Autonomous Map-merging: Advancements in the Field

Figure 3 for FRAME: A Modular Framework for Autonomous Map-merging: Advancements in the Field

Figure 4 for FRAME: A Modular Framework for Autonomous Map-merging: Advancements in the Field

Abstract:In this article, a novel approach for merging 3D point cloud maps in the context of egocentric multi-robot exploration is presented. Unlike traditional methods, the proposed approach leverages state-of-the-art place recognition and learned descriptors to efficiently detect overlap between maps, eliminating the need for the time-consuming global feature extraction and feature matching process. The estimated overlapping regions are used to calculate a homogeneous rigid transform, which serves as an initial condition for the GICP point cloud registration algorithm to refine the alignment between the maps. The advantages of this approach include faster processing time, improved accuracy, and increased robustness in challenging environments. Furthermore, the effectiveness of the proposed framework is successfully demonstrated through multiple field missions of robot exploration in a variety of different underground environments.

* 28 pages, 24 figures. Submitted to Field Robotics

Via

Access Paper or Ask Questions

Low Frequency Sampling in Model Predictive Path Integral Control

Apr 03, 2024

Bogdan Vlahov, Jason Gibson, David D. Fan, Patrick Spieler, Ali-akbar Agha-mohammadi, Evangelos A. Theodorou

Abstract:Sampling-based model-predictive controllers have become a powerful optimization tool for planning and control problems in various challenging environments. In this paper, we show how the default choice of uncorrelated Gaussian distributions can be improved upon with the use of a colored noise distribution. Our choice of distribution allows for the emphasis on low frequency control signals, which can result in smoother and more exploratory samples. We use this frequency-based sampling distribution with Model Predictive Path Integral (MPPI) in both hardware and simulation experiments to show better or equal performance on systems with various speeds of input response.

* Accepted to RA-L

Via

Access Paper or Ask Questions

Staircase Localization for Autonomous Exploration in Urban Environments

Mar 26, 2024

Jinrae Kim, Sunggoo Jung, Sung-Kyun Kim, Youdan Kim, Ali-akbar Agha-mohammadi

Figure 1 for Staircase Localization for Autonomous Exploration in Urban Environments

Figure 2 for Staircase Localization for Autonomous Exploration in Urban Environments

Figure 3 for Staircase Localization for Autonomous Exploration in Urban Environments

Figure 4 for Staircase Localization for Autonomous Exploration in Urban Environments

Abstract:A staircase localization method is proposed for robots to explore urban environments autonomously. The proposed method employs a modular design in the form of a cascade pipeline consisting of three modules of stair detection, line segment detection, and stair localization modules. The stair detection module utilizes an object detection algorithm based on deep learning to generate a region of interest (ROI). From the ROI, line segment features are extracted using a deep line segment detection algorithm. The extracted line segments are used to localize a staircase in terms of position, orientation, and stair direction. The stair detection and localization are performed only with a single RGB-D camera. Each component of the proposed pipeline does not need to be designed particularly for staircases, which makes it easy to maintain the whole pipeline and replace each component with state-of-the-art deep learning detection techniques. The results of real-world experiments show that the proposed method can perform accurate stair detection and localization during autonomous exploration for various structured and unstructured upstairs and downstairs with shadows, dirt, and occlusions by artificial and natural objects.

* 9 pages, 10 figures

Via

Access Paper or Ask Questions

Semantic Belief Behavior Graph: Enabling Autonomous Robot Inspection in Unknown Environments

Jan 30, 2024

Muhammad Fadhil Ginting, David D. Fan, Sung-Kyun Kim, Mykel J. Kochenderfer, Ali-akbar Agha-mohammadi

Figure 1 for Semantic Belief Behavior Graph: Enabling Autonomous Robot Inspection in Unknown Environments

Figure 2 for Semantic Belief Behavior Graph: Enabling Autonomous Robot Inspection in Unknown Environments

Figure 3 for Semantic Belief Behavior Graph: Enabling Autonomous Robot Inspection in Unknown Environments

Figure 4 for Semantic Belief Behavior Graph: Enabling Autonomous Robot Inspection in Unknown Environments

Abstract:This paper addresses the problem of autonomous robotic inspection in complex and unknown environments. This capability is crucial for efficient and precise inspections in various real-world scenarios, even when faced with perceptual uncertainty and lack of prior knowledge of the environment. Existing methods for real-world autonomous inspections typically rely on predefined targets and waypoints and often fail to adapt to dynamic or unknown settings. In this work, we introduce the Semantic Belief Behavior Graph (SB2G) framework as a novel approach to semantic-aware autonomous robot inspection. SB2G generates a control policy for the robot, featuring behavior nodes that encapsulate various semantic-based policies designed for inspecting different classes of objects. We design an active semantic search behavior to guide the robot in locating objects for inspection while reducing semantic information uncertainty. The edges in the SB2G encode transitions between these behaviors. We validate our approach through simulation and real-world urban inspections using a legged robotic platform. Our results show that SB2G enables a more efficient inspection policy, exhibiting performance comparable to human-operated inspections.

Via

Access Paper or Ask Questions

Towards a Reduced Dependency Framework for Autonomous Unified Inspect-Explore Missions

Sep 01, 2023

Vignesh Kottayam Viswanathan, Sumeet Gajanan Satpute, Ali-akbar Agha-mohammadi, George Nikolakopoulos

Figure 1 for Towards a Reduced Dependency Framework for Autonomous Unified Inspect-Explore Missions

Figure 2 for Towards a Reduced Dependency Framework for Autonomous Unified Inspect-Explore Missions

Figure 3 for Towards a Reduced Dependency Framework for Autonomous Unified Inspect-Explore Missions

Figure 4 for Towards a Reduced Dependency Framework for Autonomous Unified Inspect-Explore Missions

Abstract:The task of establishing and maintaining situational awareness in an unknown environment is a critical step to fulfil in a mission related to the field of rescue robotics. Predominantly, the problem of visual inspection of urban structures is dealt with view-planning being addressed by map-based approaches. In this article, we propose a novel approach towards effective use of Micro Aerial Vehicles (MAVs) for obtaining a 3-D shape of an unknown structure of objects utilizing a map-independent planning framework. The problem is undertaken via a bifurcated approach to address the task of executing a closer inspection of detected structures with a wider exploration strategy to identify and locate nearby structures, while being equipped with limited sensing capability. The proposed framework is evaluated experimentally in a controlled indoor environment in presence of a mock-up environment validating the efficacy of the proposed inspect-explore policy.

Via

Access Paper or Ask Questions

Contact-Prioritized Planning of Impact-Resilient Aerial Robots with an Integrated Compliant Arm

May 24, 2023

Zhichao Liu, Zhouyu Lu, Ali-akbar Agha-mohammadi, Konstantinos Karydis

Figure 1 for Contact-Prioritized Planning of Impact-Resilient Aerial Robots with an Integrated Compliant Arm

Figure 2 for Contact-Prioritized Planning of Impact-Resilient Aerial Robots with an Integrated Compliant Arm

Figure 3 for Contact-Prioritized Planning of Impact-Resilient Aerial Robots with an Integrated Compliant Arm

Figure 4 for Contact-Prioritized Planning of Impact-Resilient Aerial Robots with an Integrated Compliant Arm

Abstract:The article develops an impact-resilient aerial robot (s-ARQ) equipped with a compliant arm to sense contacts and reduce collision impact and featuring a real-time contact force estimator and a non-linear motion controller to handle collisions while performing aggressive maneuvers and stabilize from high-speed wall collisions. Further, a new collision-inclusive planning method that aims to prioritize contacts to facilitate aerial robot navigation in cluttered environments is proposed. A range of simulated and physical experiments demonstrate key benefits of the robot and the contact-prioritized (CP) planner. Experimental results show that the compliant robot has only a $4\%$ weight increase but around $40\%$ impact reduction in drop tests and wall collision tests. s-ARQ can handle collisions while performing aggressive maneuvers and stabilize from high-speed wall collisions at $3.0$ m/s with a success rate of $100\%$. Our proposed compliant robot and contact-prioritized planning method can accelerate computation time while having shorter trajectory time and larger clearances compared to A$^\ast$ and RRT$^\ast$ planners with velocity constraints. Online planning tests in partially-known environments further demonstrate the preliminary feasibility of our method to apply in practical use cases.

* To appear in TMECH/AIM FS. Video https://youtu.be/Ks1tMMtjfGg

Via

Access Paper or Ask Questions

A Multi-step Dynamics Modeling Framework For Autonomous Driving In Multiple Environments

May 03, 2023

Jason Gibson, Bogdan Vlahov, David Fan, Patrick Spieler, Daniel Pastor, Ali-akbar Agha-mohammadi, Evangelos A. Theodorou

Abstract:Modeling dynamics is often the first step to making a vehicle autonomous. While on-road autonomous vehicles have been extensively studied, off-road vehicles pose many challenging modeling problems. An off-road vehicle encounters highly complex and difficult-to-model terrain/vehicle interactions, as well as having complex vehicle dynamics of its own. These complexities can create challenges for effective high-speed control and planning. In this paper, we introduce a framework for multistep dynamics prediction that explicitly handles the accumulation of modeling error and remains scalable for sampling-based controllers. Our method uses a specially-initialized Long Short-Term Memory (LSTM) over a limited time horizon as the learned component in a hybrid model to predict the dynamics of a 4-person seating all-terrain vehicle (Polaris S4 1000 RZR) in two distinct environments. By only having the LSTM predict over a fixed time horizon, we negate the need for long term stability that is often a challenge when training recurrent neural networks. Our framework is flexible as it only requires odometry information for labels. Through extensive experimentation, we show that our method is able to predict millions of possible trajectories in real-time, with a time horizon of five seconds in challenging off road driving scenarios.

Via

Access Paper or Ask Questions