Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mohsen Kaboli

Edge Training and Inference with Analog ReRAM Technology for Hand Gesture Recognition

Feb 25, 2025

Victoria Clerico, Anirvan Dutta, Donato Francesco Falcone, Wooseok Choi, Matteo Galetta, Tommaso Stecconi, András Horváth, Shokoofeh Varzandeh, Bert Jan Offrein, Mohsen Kaboli(+1 more)

Abstract:Tactile hand gesture recognition is a crucial task for user control in the automotive sector, where Human-Machine Interactions (HMI) demand low latency and high energy efficiency. This study addresses the challenges of power-constrained edge training and inference by utilizing analog Resistive Random Access Memory (ReRAM) technology in conjunction with a real tactile hand gesture dataset. By optimizing the input space through a feature engineering strategy, we avoid relying on large-scale crossbar arrays, making the system more suitable for edge deployment. Through realistic hardware-aware simulations that account for device non-idealities derived from experimental data, we demonstrate the functionalities of our analog ReRAM-based analog in-memory computing for on-chip training, utilizing the state-of-the-art Tiki-Taka algorithm. Furthermore, we validate the classification accuracy of approximately 91.4% for post-deployment inference of hand gestures. The results highlight the potential of analog ReRAM technology and crossbar architecture with fully parallelized matrix computations for real-time HMI systems at the Edge.

* Accepted in IEEE ISCAS 2025

Via

Access Paper or Ask Questions

Predictive Visuo-Tactile Interactive Perception Framework for Object Properties Inference

Nov 13, 2024

Anirvan Dutta, Etienne Burdet, Mohsen Kaboli

Abstract:Interactive exploration of the unknown physical properties of objects such as stiffness, mass, center of mass, friction coefficient, and shape is crucial for autonomous robotic systems operating continuously in unstructured environments. Precise identification of these properties is essential to manipulate objects in a stable and controlled way, and is also required to anticipate the outcomes of (prehensile or non-prehensile) manipulation actions such as pushing, pulling, lifting, etc. Our study focuses on autonomously inferring the physical properties of a diverse set of various homogeneous, heterogeneous, and articulated objects utilizing a robotic system equipped with vision and tactile sensors. We propose a novel predictive perception framework for identifying object properties of the diverse objects by leveraging versatile exploratory actions: non-prehensile pushing and prehensile pulling. As part of the framework, we propose a novel active shape perception to seamlessly initiate exploration. Our innovative dual differentiable filtering with Graph Neural Networks learns the object-robot interaction and performs consistent inference of indirectly observable time-invariant object properties. In addition, we formulate a $N$-step information gain approach to actively select the most informative actions for efficient learning and inference. Extensive real-robot experiments with planar objects show that our predictive perception framework results in better performance than the state-of-the-art baseline and demonstrate our framework in three major applications for i) object tracking, ii) goal-driven task, and iii) change in environment detection.

Via

Access Paper or Ask Questions

Advancements in Tactile Hand Gesture Recognition for Enhanced Human-Machine Interaction

May 27, 2024

Chiara Fumelli, Anirvan Dutta, Mohsen Kaboli

Figure 1 for Advancements in Tactile Hand Gesture Recognition for Enhanced Human-Machine Interaction

Figure 2 for Advancements in Tactile Hand Gesture Recognition for Enhanced Human-Machine Interaction

Figure 3 for Advancements in Tactile Hand Gesture Recognition for Enhanced Human-Machine Interaction

Figure 4 for Advancements in Tactile Hand Gesture Recognition for Enhanced Human-Machine Interaction

Abstract:Motivated by the growing interest in enhancing intuitive physical Human-Machine Interaction (HRI/HVI), this study aims to propose a robust tactile hand gesture recognition system. We performed a comprehensive evaluation of different hand gesture recognition approaches for a large area tactile sensing interface (touch interface) constructed from conductive textiles. Our evaluation encompassed traditional feature engineering methods, as well as contemporary deep learning techniques capable of real-time interpretation of a range of hand gestures, accommodating variations in hand sizes, movement velocities, applied pressure levels, and interaction points. Our extensive analysis of the various methods makes a significant contribution to tactile-based gesture recognition in the field of human-machine interaction.

Via

Access Paper or Ask Questions

Visuo-Tactile based Predictive Cross Modal Perception for Object Exploration in Robotics

May 23, 2024

Anirvan Dutta, Etienne Burdet, Mohsen Kaboli

Abstract:Autonomously exploring the unknown physical properties of novel objects such as stiffness, mass, center of mass, friction coefficient, and shape is crucial for autonomous robotic systems operating continuously in unstructured environments. We introduce a novel visuo-tactile based predictive cross-modal perception framework where initial visual observations (shape) aid in obtaining an initial prior over the object properties (mass). The initial prior improves the efficiency of the object property estimation, which is autonomously inferred via interactive non-prehensile pushing and using a dual filtering approach. The inferred properties are then used to enhance the predictive capability of the cross-modal function efficiently by using a human-inspired `surprise' formulation. We evaluated our proposed framework in the real-robotic scenario, demonstrating superior performance.

* Accepted at IEEE International Symposium on Robotic and Sensors Environments 2024

Via

Access Paper or Ask Questions

Push to know! -- Visuo-Tactile based Active Object Parameter Inference with Dual Differentiable Filtering

Aug 02, 2023

Anirvan Dutta, Etienne Burdet, Mohsen Kaboli

Figure 1 for Push to know! -- Visuo-Tactile based Active Object Parameter Inference with Dual Differentiable Filtering

Figure 2 for Push to know! -- Visuo-Tactile based Active Object Parameter Inference with Dual Differentiable Filtering

Figure 3 for Push to know! -- Visuo-Tactile based Active Object Parameter Inference with Dual Differentiable Filtering

Figure 4 for Push to know! -- Visuo-Tactile based Active Object Parameter Inference with Dual Differentiable Filtering

Abstract:For robotic systems to interact with objects in dynamic environments, it is essential to perceive the physical properties of the objects such as shape, friction coefficient, mass, center of mass, and inertia. This not only eases selecting manipulation action but also ensures the task is performed as desired. However, estimating the physical properties of especially novel objects is a challenging problem, using either vision or tactile sensing. In this work, we propose a novel framework to estimate key object parameters using non-prehensile manipulation using vision and tactile sensing. Our proposed active dual differentiable filtering (ADDF) approach as part of our framework learns the object-robot interaction during non-prehensile object push to infer the object's parameters. Our proposed method enables the robotic system to employ vision and tactile information to interactively explore a novel object via non-prehensile object push. The novel proposed N-step active formulation within the differentiable filtering facilitates efficient learning of the object-robot interaction model and during inference by selecting the next best exploratory push actions (where to push? and how to push?). We extensively evaluated our framework in simulation and real-robotic scenarios, yielding superior performance to the state-of-the-art baseline.

* 8 pages. Accepted at IROS 2023

Via

Access Paper or Ask Questions

Touch if it's transparent! ACTOR: Active Tactile-based Category-Level Transparent Object Reconstruction

Jul 30, 2023

Prajval Kumar Murali, Bernd Porr, Mohsen Kaboli

Abstract:Accurate shape reconstruction of transparent objects is a challenging task due to their non-Lambertian surfaces and yet necessary for robots for accurate pose perception and safe manipulation. As vision-based sensing can produce erroneous measurements for transparent objects, the tactile modality is not sensitive to object transparency and can be used for reconstructing the object's shape. We propose ACTOR, a novel framework for ACtive tactile-based category-level Transparent Object Reconstruction. ACTOR leverages large datasets of synthetic object with our proposed self-supervised learning approach for object shape reconstruction as the collection of real-world tactile data is prohibitively expensive. ACTOR can be used during inference with tactile data from category-level unknown transparent objects for reconstruction. Furthermore, we propose an active-tactile object exploration strategy as probing every part of the object surface can be sample inefficient. We also demonstrate tactile-based category-level object pose estimation task using ACTOR. We perform an extensive evaluation of our proposed methodology with real-world robotic experiments with comprehensive comparison studies with state-of-the-art approaches. Our proposed method outperforms these approaches in terms of tactile-based object reconstruction and object pose estimation.

* Accepted for publication at IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023)

Via

Access Paper or Ask Questions

GMCR: Graph-based Maximum Consensus Estimation for Point Cloud Registration

Mar 07, 2023

Michael Gentner, Prajval Kumar Murali, Mohsen Kaboli

Abstract:Point cloud registration is a fundamental and challenging problem for autonomous robots interacting in unstructured environments for applications such as object pose estimation, simultaneous localization and mapping, robot-sensor calibration, and so on. In global correspondence-based point cloud registration, data association is a highly brittle task and commonly produces high amounts of outliers. Failure to reject outliers can lead to errors propagating to downstream perception tasks. Maximum Consensus (MC) is a widely used technique for robust estimation, which is however known to be NP-hard. Exact methods struggle to scale to realistic problem instances, whereas high outlier rates are challenging for approximate methods. To this end, we propose Graph-based Maximum Consensus Registration (GMCR), which is highly robust to outliers and scales to realistic problem instances. We propose novel consensus functions to map the decoupled MC-objective to the graph domain, wherein we find a tight approximation to the maximum consensus set as the maximum clique. The final pose estimate is given in closed-form. We extensively evaluated our proposed GMCR on a synthetic registration benchmark, robotic object localization task, and additionally on a scan matching benchmark. Our proposed method shows high accuracy and time efficiency compared to other state-of-the-art MC methods and compares favorably to other robust registration methods.

* Accepted at icra 2023

Via

Access Paper or Ask Questions

An Empirical Evaluation of Various Information Gain Criteria for Active Tactile Action Selection for Pose Estimation

May 10, 2022

Prajval Kumar Murali, Ravinder Dahiya, Mohsen Kaboli

Figure 1 for An Empirical Evaluation of Various Information Gain Criteria for Active Tactile Action Selection for Pose Estimation

Figure 2 for An Empirical Evaluation of Various Information Gain Criteria for Active Tactile Action Selection for Pose Estimation

Figure 3 for An Empirical Evaluation of Various Information Gain Criteria for Active Tactile Action Selection for Pose Estimation

Abstract:Accurate object pose estimation using multi-modal perception such as visual and tactile sensing have been used for autonomous robotic manipulators in literature. Due to variation in density of visual and tactile data, we previously proposed a novel probabilistic Bayesian filter-based approach termed translation-invariant Quaternion filter (TIQF) for pose estimation. As tactile data collection is time consuming, active tactile data collection is preferred by reasoning over multiple potential actions for maximal expected information gain. In this paper, we empirically evaluate various information gain criteria for action selection in the context of object pose estimation. We demonstrate the adaptability and effectiveness of our proposed TIQF pose estimation approach with various information gain criteria. We find similar performance in terms of pose accuracy with sparse measurements across all the selected criteria.

* arXiv admin note: substantial text overlap with arXiv:2109.13540

Via

Access Paper or Ask Questions

Towards Robust 3D Object Recognition with Dense-to-Sparse Deep Domain Adaptation

May 07, 2022

Prajval Kumar Murali, Cong Wang, Ravinder Dahiya, Mohsen Kaboli

Figure 1 for Towards Robust 3D Object Recognition with Dense-to-Sparse Deep Domain Adaptation

Figure 2 for Towards Robust 3D Object Recognition with Dense-to-Sparse Deep Domain Adaptation

Figure 3 for Towards Robust 3D Object Recognition with Dense-to-Sparse Deep Domain Adaptation

Figure 4 for Towards Robust 3D Object Recognition with Dense-to-Sparse Deep Domain Adaptation

Abstract:Three-dimensional (3D) object recognition is crucial for intelligent autonomous agents such as autonomous vehicles and robots alike to operate effectively in unstructured environments. Most state-of-art approaches rely on relatively dense point clouds and performance drops significantly for sparse point clouds. Unsupervised domain adaption allows to minimise the discrepancy between dense and sparse point clouds with minimal unlabelled sparse point clouds, thereby saving additional sparse data collection, annotation and retraining costs. In this work, we propose a novel method for point cloud based object recognition with competitive performance with state-of-art methods on dense and sparse point clouds while being trained only with dense point clouds.

Via

Access Paper or Ask Questions

Active Visuo-Tactile Interactive Robotic Perception for Accurate Object Pose Estimation in Dense Clutter

Feb 04, 2022

Prajval Kumar Murali, Anirvan Dutta, Michael Gentner, Etienne Burdet, Ravinder Dahiya, Mohsen Kaboli

Figure 1 for Active Visuo-Tactile Interactive Robotic Perception for Accurate Object Pose Estimation in Dense Clutter

Figure 2 for Active Visuo-Tactile Interactive Robotic Perception for Accurate Object Pose Estimation in Dense Clutter

Figure 3 for Active Visuo-Tactile Interactive Robotic Perception for Accurate Object Pose Estimation in Dense Clutter

Figure 4 for Active Visuo-Tactile Interactive Robotic Perception for Accurate Object Pose Estimation in Dense Clutter

Abstract:This work presents a novel active visuo-tactile based framework for robotic systems to accurately estimate pose of objects in dense cluttered environments. The scene representation is derived using a novel declutter graph (DG) which describes the relationship among objects in the scene for decluttering by leveraging semantic segmentation and grasp affordances networks. The graph formulation allows robots to efficiently declutter the workspace by autonomously selecting the next best object to remove and the optimal action (prehensile or non-prehensile) to perform. Furthermore, we propose a novel translation-invariant Quaternion filter (TIQF) for active vision and active tactile based pose estimation. Both active visual and active tactile points are selected by maximizing the expected information gain. We evaluate our proposed framework on a system with two robots coordinating on randomized scenes of dense cluttered objects and perform ablation studies with static vision and active vision based estimation prior and post decluttering as baselines. Our proposed active visuo-tactile interactive perception framework shows upto 36% improvement in pose accuracy compared to the active vision baseline.

* Accepted for publication at IEEE Robotics and Automation Letters and IEEE International Conference on Robotics and Automation (ICRA) 2022

Via

Access Paper or Ask Questions