Abstract:Endovascular guidewire manipulation is essential for minimally-invasive clinical applications (Percutaneous Coronary Intervention (PCI), Mechanical thrombectomy techniques for acute ischemic stroke (AIS), or Transjugular intrahepatic portosystemic shunt (TIPS)). All procedures commonly require 3D vessel geometries from 3D CTA (Computed Tomography Angiography) images. During these procedures, the clinician generally places a guiding catheter in the ostium of the relevant vessel and then manipulates a wire through the catheter and across the blockage. The clinician only uses X-ray fluoroscopy intermittently to visualize and guide the catheter, guidewire, and other devices. However, clinicians still passively control guidewires/catheters by relying on limited indirect observation (i.e., 2D partial view of devices, and intermittent updates due to radiation limit) from X-ray fluoroscopy. Modeling and controlling the guidewire manipulation in coronary vessels remains challenging because of the complicated interaction between guidewire motions with different physical properties (i.e., loads, coating) and vessel geometries with lumen conditions resulting in a highly non-linear system. This paper introduces a scalable learning pipeline to train AI-based agent models toward automated endovascular predictive device controls. First, we create a scalable environment by pre-processing 3D CTA images, providing patient-specific 3D vessel geometry and the centerline of the coronary. Next, we apply a large quantity of randomly generated motion sequences from the proximal end to generate wire states associated with each environment using a physics-based device simulator. Then, we reformulate the control problem to a sequence-to-sequence learning problem, in which we use a Transformer-based model, trained to handle non-linear sequential forward/inverse transition functions.