Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition in the Operating Room

Dec 19, 2023

Idris Hamoud, Muhammad Abdullah Jamal, Vinkle Srivastav, Didier Mutter, Nicolas Padoy, Omid Mohareri

Figure 1 for ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition in the Operating Room

Figure 2 for ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition in the Operating Room

Figure 3 for ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition in the Operating Room

Figure 4 for ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition in the Operating Room

Share this with someone who'll enjoy it:

Abstract:Surgical robotics holds much promise for improving patient safety and clinician experience in the Operating Room (OR). However, it also comes with new challenges, requiring strong team coordination and effective OR management. Automatic detection of surgical activities is a key requirement for developing AI-based intelligent tools to tackle these challenges. The current state-of-the-art surgical activity recognition methods however operate on image-based representations and depend on large-scale labeled datasets whose collection is time-consuming and resource-expensive. This work proposes a new sample-efficient and object-based approach for surgical activity recognition in the OR. Our method focuses on the geometric arrangements between clinicians and surgical devices, thus utilizing the significant object interaction dynamics in the OR. We conduct experiments in a low-data regime study for long video activity recognition. We also benchmark our method againstother object-centric approaches on clip-level action classification and show superior performance.

View paper on

Share this with someone who'll enjoy it:

Title:ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition in the Operating Room

Paper and Code