License: arXiv.org perpetual non-exclusive license
arXiv:2404.05151v1 [cs.RO] 08 Apr 2024

STITCH: Augmented Dexterity for Suture Throws
Including Thread Coordination and Handoffs

Kush Hari1*1{}^{1*}start_FLOATSUPERSCRIPT 1 * end_FLOATSUPERSCRIPT, Hansoul Kim1*1{}^{1*}start_FLOATSUPERSCRIPT 1 * end_FLOATSUPERSCRIPT, Will Panitch1*1{}^{1*}start_FLOATSUPERSCRIPT 1 * end_FLOATSUPERSCRIPT, Kishore Srinivas11{}^{1}start_FLOATSUPERSCRIPT 1 end_FLOATSUPERSCRIPT, Vincent Schorp1,212{}^{1,2}start_FLOATSUPERSCRIPT 1 , 2 end_FLOATSUPERSCRIPT,
Karthik Dharmarajan11{}^{1}start_FLOATSUPERSCRIPT 1 end_FLOATSUPERSCRIPT, Shreya Ganti11{}^{1}start_FLOATSUPERSCRIPT 1 end_FLOATSUPERSCRIPT, Tara Sadjadpour11{}^{1}start_FLOATSUPERSCRIPT 1 end_FLOATSUPERSCRIPT, Ken Goldberg11{}^{1}start_FLOATSUPERSCRIPT 1 end_FLOATSUPERSCRIPT
The AUTOLab at the University of California, Berkeley (automation.berkeley.edu), [email protected]* equal contribution11{}^{1}start_FLOATSUPERSCRIPT 1 end_FLOATSUPERSCRIPT AUTOLab at University of California, Berkeley22{}^{2}start_FLOATSUPERSCRIPT 2 end_FLOATSUPERSCRIPT Autonomous Systems Lab, ETH Zurich
Abstract

We present STITCH: an augmented dexterity pipeline that performs Suture Throws Including Thread Coordination and Handoffs. STITCH iteratively performs needle insertion, thread swee**, needle extraction, suture cinching, needle handover, and needle pose correction with failure recovery policies. We introduce a novel visual 6D needle pose estimation framework using a stereo camera pair and new suturing motion primitives. We compare STITCH to baselines, including a proprioception-only and a policy without visual servoing. In physical experiments across 15 trials, STITCH achieves an average of 2.93 sutures without human intervention and 4.47 sutures with human intervention. See https://sites.google.com/berkeley.edu/stitch for code and supplemental materials.

I Introduction

Surgical-assist robots such as Intuitive’s da Vinci series are used by surgeons to perform over 2 million procedures annually to facilitate minimally-invasive (“keyhole”) surgery to reduce pain, blood loss, scarring, complications, and recovery time. It is now the gold standard for procedures involving the appendix, colon, gallbladder, prostate, and many others. These robots are very sophisticated but every movement is currently controlled by human surgeons because surgery is extremely sensitive to errors – there are a vast number of rare but potentially dangerous edge conditions and the consequences of even a single failure can be fatal. So it may be a very long time before fully autonomous robots are sufficiently safe and reliable for the operating room. However, recent advances in AI are opening the door to augmenting surgeon skills when performing specific subtasks such as suturing, debridement, and resection. Goldberg [1] proposed the term “Augmented Dexterity” to describe systems where surgical subtasks are controlled by the robot under the close supervision of the human surgeon who is ready to take over at a moment’s notice. Augmented Dexterity has potential to elevate good surgeons to the level of the best surgeons, making surgery safer, faster, and more reliable. In this paper we present new results on augmented dexterity for surgical suturing.

Refer to caption
Figure 1: 6 sutures performed by STITCH. Step 1 shows Surgical Needle Insertion, Step 2 shows Needle Extraction, and Step 3 shows Needle Handover with Pose Correction. Detections of the needle endpoints are shown in the photo with the needle tip point shown as the orange circle and the needle swage point shown as the yellow circle. The light green circle represents the estimated needle pose.

We present STITCH, a novel pipeline that performs Suture Throws Including Thread Coordination and Handoffs. STITCH achieves ”level 2” task autonomy as defined by [*]yang2017medical, using novel perception and control techniques. The STITCH perception system combines deep learning, analytical, and sampling-based approaches, and prior knowledge of needle geometry to perform 6D needle pose estimation for closed-loop control. We also incorporate “interactive perception” as proposed by [3] for improving needle pose estimation and correction to increase robustness to uncertainty in perception, control, and physics [3], [4]. Furthermore, we introduce visual-based thread coordination motions such as swee** to reduce the risk of the thread tangling with the needle or itself. STITCH also includes a new approach for automated recovery from motion failures using closed loop visual feedback where we retry needle extraction and handover motions if they are unsuccessful.

This paper makes 3 main contributions:

  • An augmented dexterity pipeline for robot-assisted surgery capable of suture throwing, thread coordination, and needle handoff.

  • A novel visual 6D needle pose estimation framework.

  • Experimental results showing an average of 4.47 successful sutures with human intervention.

Refer to caption
Figure 2: 6D Needle Pose Estimation Module. The needle pose estimation starts with a pair of stereo left and right images. Using RAFT-Stereo, we generate a disparity image from the stereo pair [5]. Furthermore, we segment the needle in the left image with a U-Net to create a needle mask. From there, we apply the needle mask to the disparity image, and create the corresponding needle pointcloud. Using RANSAC, we find a best-fit plane to determine the normal vector of the 3D circle representing the needle (seen in green in the Fitted 3D Circle image). Then, we project all needle inliers from the RANSAC to the plane, and use RANSAC again to find the best fit circle (seen in black in the 3D Circle Fit image with the assumed fixed radius (12 mm for all experiments). Finally, we find the two farthest points on the needle pointcloud to determine the needle endpoints (seen as orange and yellow in the 6D Needle Pose image).

II Related Work

Fully automated robot systems have been approved for hair restoration and external beam radiation [6, 7]. However, all surgical procedures are performed 100% under human surgeon teleoperation [8]. Some research efforts have focused on autonomously performing specific sub-tasks such as debridement [9], vascular shunt insertion [10], and brain tumor resection [11]. In this work, we focus on the suturing task.

II-A Autonomous Suturing

Automating surgical suturing has been shown to be feasible in-vivo using IR markers, and industrial robot arms (like the KUKA LBR Med) both for the open-surgery setting [12] and the minimally invasive surgery setting [8] to perform in-vivo suturing on live pigs. More recent research efforts have been directed at automating surgical suturing using the da Vinci Research Kit (dVRK) [13]. While performing the whole suturing task autonomously remains an open problem [14], many researchers have explored the automation of sub-tasks such as suture placement [15], needle handover [16, 17, 18], needle extraction [19], needle pick-up [20], and knot tying [21]. Though each of these methods individually have shown high success rates (90%), performing these methods sequentially often has much lower success rate due to the product probability intersection of all of these events. In attempts to automate the complete suturing task, some hardware simplifications have been used in the past. [22] demonstrate impressive success rates with only a needle and no thread, avoiding the risk of the robot getting tangled in the thread. [14] use colored needles and a special mount for the gripper which help with needle detection and orientation. We use unmodified surgical needles and suture thread.

II-B Visual Servoing in Surgical Automation

Visual servoing has been explored for surgical automation, with needle [23, 24] and gripper [25] pose estimation using learned keypoint tracking models like DeepLabCut, a method that uses transfer learning to perform keypoint annotation for pose estimation by [26]. Another line of research focuses on learning end-to-end visual servoing policies which implicitly model the object state [27, 28]. [29, 16] have proposed intermittent visual servoing in the context of peg transfer and needle handover, respectively, both of which use a visual servoing policy trained with imitation learning in lieu of classical trajectory optimization where high precision is required. We propose a novel visual needle pose estimation approach using learned image segmentation models as well as known visual features and system dynamics. The estimated object poses are used to automate the surgical suturing sub-tasks of needle insertion, extraction, and handover using analytic control methods.

III Problem Statement

III-A Overview

Given a sequence of suture entry and exit points, perform as many sutures as possible.

III-B Assumptions

We assume known needle shape and diameter, predetermined 3D points for needle insertion and extraction, a calibrated stereo camera pair, and the transformation between the robot and camera coordinate frames. We also assume the wound is raised and orthogonal to the camera as shown in Fig. 1.

III-C Objectives and Evaluation Metrics

We consider two evaluation metrics: number of successful consecutive sutures and completion time. We consider a suture throw to be successful if the robot is able to pass the needle through the wound, perform suture thread cinching (tightening), and return the needle to a neutral position outside the phantom with no tangles of the suture thread.

IV Methods

STITCH achieves augmented dexterity for surgical suturing using novel perception and control methods as shown in Figs. 2 and 3.

Refer to caption
Figure 3: The individual processes embedded within the STITCH motion controller There are 4 parts to the state machine: 1. Needle insertion: (a) The right needle driver moves the needle to the initial insertion point at the proper orientation; (b) The right needle driver inserts the needle into the phantom with combined rotation and translation movements; 2. Swee** and Needle Extraction with suture cinching: (c) The right needle driver follows the +y axis in the robot frame down the center of the wound to “sweep” any thread off the wound; (d) The left needle driver moves 1 centimeter behind the needle endpoint to prepare for extraction; (e) The left gripper grasps the needle and pulls it through until the length of the thread is at a desired β𝛽\betaitalic_β; 3. Needle Handover: (f) The right needle driver moves 1 centimeter behind the needle endpoint to prepare for handover; (g) The right gripper grasps the needle for handover; (h) The left gripper releases the needle; 4. Needle Pose Correction: (i) The right needle driver moves the needle to an optimal needle pose estimation region of the scene; (j) Based on the current pose of the needle, it is rotated such that the normal vector of the needle is aligned with the + y axis in robot frame; (k) The needle is rotated about the +y axis in robot frame so it is at the optimal orientation for the next insertion so the pipeline can be repeated again.

IV-A 6D Needle Pose Estimation Module

The Allied Vision Prosilica GC 1290 cameras capture a stereo pair of images with a resolution of 1280x960 at up to 33fps. With these images, we create a pointcloud of the scene using disparity images from RAFT-Stereo, a deep architecture for rectified two-view stereo that has been empirically shown to be more robust than standard stereopsis techniques and generalizes well to unseen real-world data [5].

To isolate the points relevant to the needle, we train a U-Net [30], an image-based segmentation model, to detect the needle in image space. The training data is labeled using the Labels from UV technique proposed by [31]. The segmentation mask produced by the U-Net is used to isolate the needle from the pointcloud generated by RAFT-Stereo.

Since we also need to accurately estimate the positions of the two endpoints on the needle and the orientation of the needle itself, we propose a novel 6D needle pose estimation module as seen in Figure 2. Based on Wilcox’s work for needle handover (HOUSTON) [16], we propose an algorithm that determines the best-fitting 3D circle. A key distinction between this work and HOUSTON is that HOUSTON obtained a pointcloud from stereo point matching while this work uses RAFT-Stereo [5], a roboust stereopsis neural network. STITCH begins by using RANSAC to estimate a 3D plane equation that fits the needle pointcloud. Then, we calculate the normal vector from the estimated 3D plane to ascertain the orientation of the needle. All inlier needle points are projected onto that plane. To achieve higher accuracy, we then project all inlier points on the plane to the xy-plane where we use RANSAC to estimate a 2D circle equation. Next, we derive the positions of the two endpoints of the needle on the 2D circle by finding the 2 most distant needle inliers in the pointcloud. After that, we determine the 3D circle and endpoint positions by back-projecting onto the aforementioned 3D plane estimate.

IV-B Augmented Dexterity Suturing Motion Controller

The surgical suturing task is composed of several distinct motions. The motion controller directs the robot motions in every state, as well as the state transitions and when they ought to be performed. Each suture is composed of the sequence of needle insertion, a thread swee** motion to clear excess thread from the suture site, needle extraction with suture cinching, needle handover, needle pose correction, and failure recovery, as shown in Fig. 3. The needle motion inside the tissue is designed to reduce tissue damage while remaining withing the kinematic bounds of the gripper.

IV-B1 Needle Insertion

The robot inserts the needle into the tissue phantom with a circular twisting motion at the specified insertion point such that the needle tip exits the tissue at the specified extraction point.

The needle is inserted in the tissue phantom in two steps to minimize strain on the tissue. Both steps are performed open loop as a large part of the needle is occluded by the tissue and the needle driver during insertion. First, the tip of the needle is pushed into the tissue phantom corresponding to the vector between the given insertion and extraction points (Fig. 3(a)). This allows for the straight tip of the needle to penetrate the tissue and exit at the needle extraction point. Second, the needle is rotated into the tissue by performing a 45superscript4545^{\circ}45 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT rotation around the estimated circle normal vector (Fig. 3(b)). This motion follows the curvature of the needle as it passes through the tissue, so that it passes along the hole made by the needle tip and does not stretch or tear the tissue.

IV-B2 Thread Swee**

The thread swee** motion is designed to prevent failure during the needle extraction step (Fig. 3(c)). Failures occurring during the needle extraction process can be broadly classified into two cases: (1) the thread passes in front of the needle, occluding it and leading to detection errors; (2) when both the needle and the thread are accidentally grasped together during the re-gras** process, in spite of an accurate needle pose estimation. Pushing the thread out of the re-gras** site through a swee** motion before the extraction step can prevent the failures described above.

We use the thread modelling method from [32] to track the thread for the swee** motion. In this work, a U-Net is trained to output a segmentation mask of the thread. Running an analytic tracer on that segmentation mask for the left and right images, a 3D NURBS spline is fitted for the thread.

The swee** motion involves opening the gripper wide and passing it over the wound towards the camera so that the thread is caught and pushed ahead of the needle. With the thread in front of the needle, entanglement risks during extraction are significantly reduced.

IV-B3 Needle Extraction

The purpose of the needle extraction motion is to remove the needle from the tissue phantom and to pull the thread taut to close the wound. To successfully perform multiple sutures, it is necessary to pull the thread to an appropriate length prior to the next insertion motion. The needle extraction motion is performed in a closed-loop fashion using the needle pose estimator. The frequency of needle pose estimator for feedback was measured as 0.67 Hz. The re-grasp point and the axis around which to rotate and remove the needle are also determined through visual estimation.

The needle extraction motion is composed of two main components. At the start of the routine, the left needle driver is positioned at the upper left edge of the workspace, above the tissue phantom. The point closer to the left needle driver among the two needle endpoints, determined as described in  IV-A, is defined as the re-grasp point. The left needle driver is moved to a point offset by 1 cm horizontally from the defined re-grasp point (Fig. 3(d). Then, the left needle driver is moved in the direction of the re-grasp point by 1.5 cm, and the gripper jaws are closed.

Once the needle has been grasped by the left needle driver, it is rotated by 80superscript8080^{\circ}80 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT about the estimated needle axis to extract the needle and to minimize tissue damage (similar to the “rotate in”) motion in the needle insertion routine. At the end of this action, only a small bit of the needle remains inside the tissue so that the needle can be fully extracted by a linear motion (Fig. 3(e)).

IV-B4 Suture Cinching

To ensure each suture is properly tensioned, suture cinching (tightening) is performed after needle extraction. The length of the thread that needs to be pulled for suture cinching in the extraction motion can be defined as β𝛽\betaitalic_β = ldessubscript𝑙𝑑𝑒𝑠l_{des}italic_l start_POSTSUBSCRIPT italic_d italic_e italic_s end_POSTSUBSCRIPT - (i𝑖iitalic_i - 1)×\times×leachsubscript𝑙𝑒𝑎𝑐l_{each}italic_l start_POSTSUBSCRIPT italic_e italic_a italic_c italic_h end_POSTSUBSCRIPT depending on the number of sutures (Fig. 3(e)), where ldessubscript𝑙𝑑𝑒𝑠l_{des}italic_l start_POSTSUBSCRIPT italic_d italic_e italic_s end_POSTSUBSCRIPT represents the desired thread length for the final suture, i𝑖iitalic_i represents the number of sutures, and leachsubscript𝑙𝑒𝑎𝑐l_{each}italic_l start_POSTSUBSCRIPT italic_e italic_a italic_c italic_h end_POSTSUBSCRIPT is the length of thread used for a single suture.

IV-B5 Needle Handover

The goal of the needle handover motion is to transfer the extracted needle from the left needle driver to the right needle driver in preparation for the next insertion. Once the pose of the needle is estimated, we define the re-grasp point in handover as the endpoint further from the left needle driver, similar to the process in IV-B3. The right needle driver opens its jaws and moves to a point offset by 1 cm horizontally from the defined re-grasp point (Fig. 3(f)). After moving by the offset in the direction of the re-grasp point, the right needle driver closes to grasp the needle, (Fig. 3(g)) and the left needle driver opens to release it (Fig. 3(h)). The entire sequence is done in a closed-loop fashion with visual feedback tracking the pose of the needle. If the right needle driver moves into position to grasp the needle and the detected orientation of the needle remains unchanged, we pull the driver back, add a small (<<< 0.5cm) random horizontal offset, and reattempt the grasp, up to 5 additional times before declaring the handover a failure.

IV-B6 Needle Pose Correction

Since the pose of needle right before insertion significantly affects the insertion, we use interactive perception to improve needle tracking so it can be actuated to the ideal pose for the subsequent insertion. We adjust the needle for the insertion using a needle pose correction algorithm, which consists of three steps. First, the needle is moved to an optimal detection location at the lower, back, right corner of the workspace for RAFT-Stereo reconstruction as shown in Fig. 3(i). The lower back right corner was chosen empirically for consistent needle pose estimates. An ablation study confirmed this choice, showing 90% success rate in this corner for left gripper-held needles and 70% for right gripper-held needles, compared to 50% success in random positions. Identifying both endpoints is crucial for insertion, so obtaining pose estimates in the chosen corner maximized successful suture throws. In the next step, we sample 10 measurements of the needle normal vector, and rotate the gripper such that the normal vector of the needle is identical to the positive y axis in robot frame (Fig. 3(j)). This step constrains the discrepancy in orientation to be about the positive y-axis, which makes it more repeatable. Because the needle configuration during handover is relatively similar for each throw, the final step is a 90 degree rotation about the y-axis as seen in Fig. 3k. At this point, the STITCH pipeline is ready for the next suture throw insertion.

TABLE I: Success Metric Comparison across Ablations for 15 Trials.
Mean Sutures Single-Suture Three Throw Full Wound Mean Time Error types Mean Sutures
to Failure Success Rate Success Rate Success Rate per Suture I E H T to Intervention
Sensing Only 1.40 51.6% 0.0% 0.0% 106.8 sec 9 6 0 0
Thread Handling 1.80 55.9% 20.0% 0.0% 117.9 sec 6 5 2 2
STITCH 2.93 69.4% 73.3% 0.0% 159.3 sec 8 5 0 2
STITCH + Human 4.47 83.3% 100.0% 20.0% 141.9 sec 16 10 0 2 2.25

IV-B7 Motion Failure Recovery

The STITCH algorithm includes recovery mechanisms for both extraction motion failures and handover motion failures. For the extraction motion, the algorithm compares the positions of the needle endpoint before and after extraction. If the difference is below 2 cm, the extraction motion is retried up to 5 times. For the handover motion, the algorithm compares the normal vector of the needle before and after moving the right needle driver when both needle drivers have grasped the needle. If there is a nonzero change of the normal vector, then the handover motion is retried up to 5 times.

V Physical Experiments

V-A Experimental Setup

The experimental setup consists of a bi-manual dVRK robot [13], a soft tissue phantom consisting of a single wound from a 3-Dmed directional suture pad featuring parallel linear wounds, and a fixed RGB stereo camera pair. For our experiments, the stereo cameras are a pair of Allied Vision Prosilica GC 1290 industrial cameras, which each output images of size 1280x960 at 33 frames per second. The Edmund Optics lenses mounted on the sensors allow for precise adjustments to the focus and aperture based on our workspace depth and illumination. The cameras are angled at the phantom such that the full workspace of the robot is captured in the field of view. We define the workspace using a Cartesian (x,y,z)𝑥𝑦𝑧(x,y,z)( italic_x , italic_y , italic_z ) coordinate system. We use violet PolysorbTMsuperscriptPolysorbTM\text{Polysorb}^{\text{TM}}Polysorb start_POSTSUPERSCRIPT TM end_POSTSUPERSCRIPT surgical suture thread from Covidien. The threads are of variable length between 10 and 40 cm, with 2-0 USP Size (0.35-0.399 mm in diameter) and are attached to a GS-21 half-circular surgical needle with a radius of 12 mm.

V-B Baselines

We evaluate the full STITCH algorithm and two baselines, multi-throw suturing without thread manipulation or needle pose correction “Sensing Only”) and multi-throw suturing thread management without needle pose correction (“Thread Management”). The sensing-only baseline performs the needle insertion, extraction, and handover steps, omitting the suture cinching, thread swee**, and needle pose correction motions. The thread management baseline performs insertion, extraction (with thread swee**), cinching, and handover while omitting the needle pose correction step. We also evaluate a human-supervised setting (“STITCH + Human”), in which the robot can request intervention up to 2 times from a supervising surgeon. At these points, the human supervisor performs a single recovery motion to return the workspace to a safe configuration, and then immediately returns control to the robot.

V-C Trial Specifications and Error Classification

Each trial produces n successive running suture throws. A suture throw is considered complete when the needle has successfully been inserted and extracted through the raised edges of the wound and the thread has been sufficiently tensioned to close the wound between the insertion and extraction points. A trial ends when the robot either enters an unrecoverable state (such as a dropped needle or tangled thread preventing further sutures) or successfully closes the wound by throwing 6 consecutive sutures. For each experimental baseline, we perform 15 trials of 1–6 throws each, and report the following metrics in Table I:

  • Mean sutures to failure: the average number of successfully completed suture throws before the first unrecoverable error is encountered.

  • Single-suture success rate: the percentage of successful suture throws out of the total number of suture throw attempts.

  • Three throw success rate: the percentage of trials which terminate after at least three successful suture throws.

  • Full wound success rate: the percentage of trials which terminate in wound closure.

  • Mean time per suture: the average time elapsed per suture throw.

  • Error Types: the number of insertion (I), extraction (E), handover (H), and thread management (T) errors encountered.

  • Mean sutures to intervention: The average number of autonomously completed suture throws between human intervention requests.

We report trial-ending errors according to the portion of the pipeline during which the failure occurred. We use the following schema:

  • Insertion errors occur when the robot fails to insert the needle through the raised edge of the wound, or the needle enters a non-wound region of the phantom, or the needle exits the wound through the top or bottom of the phantom.

  • Extraction errors occur when the needle remains in the wound after extraction.

  • Handover errors occur when the robot drops the needle during handover or enters an unrecoverable configuration during the handover process.

  • Thread management errors occur when the robot fails to properly cinch the suture thread to close the wound or becomes dangerously entangled in the thread.

V-D Experimental Results

Table I and Fig. 4 report experimental results. Over 15 trials, STITCH achieves an average single-suture success rate of 69.39% and a mean sutures-to-failure of 2.93. When allowing the robot to request human intervention, the robot achieves a single-suture success rate of 83.33%, and a mean sutures-to-failure of 4.47. Also, the perception algorithm proposed in this study was confirmed to accurately track the 6D pose of the surgical needle 42% of the time throughout the experimental trials.

Refer to caption
Figure 4: Histogram of the Number of Sutures to Failure by Method. The results in the histogram shown above is for 15 trials per each of the four methods.

VI Limitations & Future Work

There are 2 primary limitations of the STITCH pipeline.

1. The most common failure case with the STITCH pipeline involves correcting the needle orientation so it is optimal for needle insertion for the subsequent insertion. This stems from high variance needle pointclouds from RAFT-Stereo. RAFT-Stereo struggles in high disparity areas, thus we move the needle to lower, back, right corner of the workspace as seen in Fig. 3(i). At this position, the perception algorithm can reliably detect the normal vector of the needle, allowing for the first rotation correction step to align the normal vector with the positive y axis of the robot frame as seen in Fig. 3(j). However, the needle endpoint detection at this position has a higher variance than desired. This means we cannot perform the final rotation step based on needle endpoint feedback. In the future, we will investigate alternate stereo methods better tuned for small, reflective objects. With a more reliable stereopsis method and improved needle endpoint tracking, we hope to mitigate the insertion failure case by using the needle endpoint estimates to servo the needle to the optimal insertion orientation.

2. Another common failure case with the STITCH pipeline involves thread tensioning issues. Even with the thread-swee** move, two potential thread failures are still present:

  1. a.

    The swee** move sometimes misses the thread and the extraction move will grab the thread with the endpoint causing system failure.

  2. b.

    The later sutures run out of thread and fail because not enough thread was pulled during the initial suture throw extraction.

In future work, we plan to introduce a suture cinching before handover where the left gripper holds the needle at the handover position such that a portion of the thread would be vertical. The other gripper would then move horizontally to push the vertical thread component to pull out additional thread and clear it from the gras** workspace.

To perform suturing in in-vivo experiments, challenges like tissue deformation, visual changes due to blood, and viewing angle variations must be addressed.

VII Conclusion

We present STITCH, a continuous suturing pipeline under the Augmented Dexterity framework [1] where surgical subtasks are performed autonomously under close human supervision. The pipeline includes a 6D needle pose estimation module using combined learned and analytical methods, and new motion primitives. Over 15 trials, we demonstrated that STITCH can autonomously perform an average of 2.93 suture throws without human intervention and 4.47 suture throws with human intervention.

Acknowledgement

This research was performed at the AUTOLAB at UC Berkeley in affiliation with the Berkeley AI Research (BAIR) Lab, and the CITRIS “People and Robots” (CPAR) Initiative. We thank Gary Guthart for his support. The da Vinci Research Kit is supported by the National Science Foundation, via the National Robotics Initiative (NRI), as part of the collaborative research project “Software Framework for Research in Semi-Autonomous Teleoperation” between The Johns Hopkins University (IIS1637789), Worcester Polytechnic Institute (IIS 1637759), and the University of Washington (IIS 1637444).)

References

  • [1] K. Goldberg “Augmented Dexterity: How Robots Can Enhance Surgeon Dexterity” Under Review (Summer 2023) In Science Robotics
  • [2] Guang-Zhong Yang et al. “Medical robotics—Regulatory, ethical, and legal considerations for increasing levels of autonomy” In Science Robotics 2.4 American Association for the Advancement of Science, 2017, pp. eaam8638
  • [3] Jeannette Bohg et al. “Interactive Perception: Leveraging Action in Perception and Perception in Action” In IEEE Transactions on Robotics 33.6, 2017, pp. 1273–1291 DOI: 10.1109/TRO.2017.2721939
  • [4] Tara Sadjadpour et al. “MANIP: A Modular Architecture for Integrating Interactive Perception into Long-Horizon Robot Manipulation” In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (Under Review), 2024
  • [5] Lahav Lipson, Zachary Teed and Jia Deng “RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching” In International Conference on 3D Vision (3DV), 2021
  • [6] Paul T. Rose and Bernard Nusbaum “Robotic Hair Restoration” Advances in Cosmetic Dermatology In Dermatologic Clinics 32.1, 2014, pp. 97–107 DOI: https://doi.org/10.1016/j.det.2013.09.008
  • [7] W Kilby et al. “The CyberKnife® robotic radiosurgery system in 2010” In Technology in cancer research & treatment 9.5 SAGE Publications Sage CA: Los Angeles, CA, 2010, pp. 433–452
  • [8] H. Saeidi et al. “Autonomous robotic laparoscopic surgery for intestinal anastomosis” In Science Robotics 7, 2022, pp. 1–14 DOI: 10.1126/scirobotics.abj2908
  • [9] Adithyavairavan Murali et al. “Learning by observation for surgical subtasks: Multilateral cutting of 3d viscoelastic and 2d orthotropic tissue phantoms” In 2015 IEEE International Conference on Robotics and Automation (ICRA), 2015, pp. 1202–1209 IEEE
  • [10] Karthik Dharmarajan et al. “Automating Vascular Shunt Insertion with the dVRK Surgical Robot” In 2023 IEEE International Conference on Robotics and Automation (ICRA), 2023, pp. 6781–6788 DOI: 10.1109/ICRA48891.2023.10160966
  • [11] Danying Hu et al. “Semi-autonomous image-guided brain tumour resection using an integrated robotic system: A bench-top study” In The International Journal of Medical Robotics and Computer Assisted Surgery 14.1 Wiley Online Library, 2018, pp. e1872
  • [12] Azad Shademan et al. “Supervised autonomous robotic soft tissue surgery” In Science Translational Medicine 8.337, 2016, pp. 337ra64–337ra64
  • [13] Peter Kazanzides et al. “An open-source research kit for the da Vinci® Surgical System” In 2014 IEEE international conference on robotics and automation (ICRA), 2014, pp. 6434–6439 IEEE
  • [14] Siddarth Sen et al. “Automating multi-throw multilateral surgical suturing with a mechanical needle guide and sequential convex optimization” In 2016 IEEE international conference on robotics and automation (ICRA), 2016, pp. 4178–4185 IEEE
  • [15] Varun Kamat et al. “Automating 2D Suture Placement” In 2023 IEEE 19th International Conference on Automation Science and Engineering (CASE), 2023 IEEE
  • [16] Albert Wilcox et al. “Learning to localize, grasp, and hand over unmodified surgical needles” In 2022 International Conference on Robotics and Automation (ICRA), 2022, pp. 9637–9643 IEEE
  • [17] Zih-Yun Chiu et al. “Bimanual Regras** for Suture Needles using Reinforcement Learning for Rapid Motion Planning” In 2021 IEEE International Conference on Robotics and Automation (ICRA), 2021, pp. 7737–7743 DOI: 10.1109/ICRA48506.2021.9561673
  • [18] Vignesh Manoj Varier et al. “Collaborative suturing: A reinforcement learning approach to automate hand-off task in suturing for surgical robots” In 2020 29th IEEE international conference on robot and human interactive communication (RO-MAN), 2020, pp. 1380–1386 IEEE
  • [19] Priya Sundaresan et al. “Automated extraction of surgical needles from tissue phantoms” In 2019 IEEE 15th International Conference on Automation Science and Engineering (CASE), 2019, pp. 170–177 IEEE
  • [20] Claudia D’Ettorre et al. “Automated pick-up of suturing needles for robotic surgical assistance” In 2018 IEEE International Conference on Robotics and Automation (ICRA), 2018, pp. 1370–1377 IEEE
  • [21] Brijen Thananjeyan et al. “Safety augmented value estimation from demonstrations (saved): Safe deep model-based rl for sparse cost robotic tasks” In IEEE Robotics and Automation Letters 5.2 IEEE, 2020, pp. 3612–3619
  • [22] Kim L. Schwaner et al. “Autonomous Bi-Manual Surgical Suturing Based on Skills Learned from Demonstration” In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 4017–4024 DOI: 10.1109/IROS51168.2021.9636432
  • [23] Zih-Yun Chiu et al. “Markerless suture needle 6d pose tracking with robust uncertainty estimation for autonomous minimally invasive robotic surgery” In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022, pp. 5286–5292 IEEE
  • [24] Yiwei Jiang, Haoying Zhou and Gregory S. Fischer “Markerless Suture Needle Tracking From A Robotic Endoscope Based On Deep Learning” In 2023 International Symposium on Medical Robotics (ISMR), 2023, pp. 1–7 DOI: 10.1109/ISMR57123.2023.10130199
  • [25] **gpei Lu, Florian Richter and Michael C Yip “Pose estimation for robot manipulators via keypoint optimization and sim-to-real transfer” In IEEE Robotics and Automation Letters 7.2 IEEE, 2022, pp. 4622–4629
  • [26] Alexander Mathis et al. “DeepLabCut: markerless pose estimation of user-defined body parts with deep learning” In Nature neuroscience 21.9 Nature Publishing Group, 2018, pp. 1281–1289
  • [27] Sergey Levine et al. “Learning hand-eye coordination for robotic gras** with deep learning and large-scale data collection” In The International journal of robotics research 37.4-5 SAGE Publications Sage UK: London, England, 2018, pp. 421–436
  • [28] Dmitry Kalashnikov et al. “Qt-opt: Scalable deep reinforcement learning for vision-based robotic manipulation” In arXiv preprint arXiv:1806.10293, 2018
  • [29] Samuel Paradis et al. “Intermittent visual servoing: Efficiently learning policies robust to instrument changes for high-precision surgical manipulation” In 2021 IEEE International Conference on Robotics and Automation (ICRA), 2021, pp. 7166–7173 IEEE
  • [30] Olaf Ronneberger, Philipp Fischer and Thomas Brox “U-net: Convolutional networks for biomedical image segmentation” In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, 2015, pp. 234–241 Springer
  • [31] Brijen Thananjeyan et al. “All You Need is LUV: Unsupervised Collection of Labeled Images Using UV-Fluorescent Markings” In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022, pp. 3241–3248 IEEE
  • [32] Vincent Schorp et al. “Self-Supervised Learning for Interactive Perception of Surgical Thread for Autonomous Suture Tail-Shortening” In 2023 IEEE 19th International Conference on Automation Science and Engineering (CASE), 2023 IEEE