Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Towards Surgical Context Inference and Translation to Gestures

Mar 15, 2023

Kay Hutchinson, Zongyu Li, Ian Reyes, Homa Alemzadeh

Figure 1 for Towards Surgical Context Inference and Translation to Gestures

Figure 2 for Towards Surgical Context Inference and Translation to Gestures

Figure 3 for Towards Surgical Context Inference and Translation to Gestures

Figure 4 for Towards Surgical Context Inference and Translation to Gestures

Share this with someone who'll enjoy it:

Abstract:Manual labeling of gestures in robot-assisted surgery is labor intensive, prone to errors, and requires expertise or training. We propose a method for automated and explainable generation of gesture transcripts that leverages the abundance of data for image segmentation. Surgical context is detected using segmentation masks by examining the distances and intersections between the tools and objects. Next, context labels are translated into gesture transcripts using knowledge-based Finite State Machine (FSM) and data-driven Long Short Term Memory (LSTM) models. We evaluate the performance of each stage of our method by comparing the results with the ground truth segmentation masks, the consensus context labels, and the gesture labels in the JIGSAWS dataset. Our results show that our segmentation models achieve state-of-the-art performance in recognizing needle and thread in Suturing and we can automatically detect important surgical states with high agreement with crowd-sourced labels (e.g., contact between graspers and objects in Suturing). We also find that the FSM models are more robust to poor segmentation and labeling performance than LSTMs. Our proposed method can significantly shorten the gesture labeling process (~2.8 times).

* accepted for the 2023 International Conference on Robotics and Automation (ICRA)

View paper on

Share this with someone who'll enjoy it:

Title:Towards Surgical Context Inference and Translation to Gestures

Paper and Code