Experimental evaluation of accuracy and efficiency of two control strategies for a novel foot commanded robotic laparoscope holders with surgeons

Yang, Yan-Jun; Vadivelu, Arvind Kumar N; Hepworth, Jessica; Zeng, Yongpeng; Pilgrim, Charles H. C.; Kulic, Dana; Abdi, Elahe

doi:10.1038/s41598-024-59338-3

Download PDF

Article
Open access
Published: 23 April 2024

Experimental evaluation of accuracy and efficiency of two control strategies for a novel foot commanded robotic laparoscope holders with surgeons

Yan-Jun Yang¹,
Arvind Kumar N Vadivelu²,
Jessica Hepworth¹,
Yongpeng Zeng¹,
Charles H. C. Pilgrim^3,4,5,
Dana Kulic¹ &
…
Elahe Abdi¹

Scientific Reports volume 14, Article number: 9264 (2024) Cite this article

202 Accesses
Metrics details

Subjects

Abstract

The implementation of a laparoscope-holding robot in minimally invasive surgery enhances the efficiency and safety of the operation. However, the extra robot control task can increase the cognitive load on surgeons. A suitable interface may simplify the control task and reduce the surgeon load. Foot interfaces are commonly used for commanding laparoscope-holding robots, with two control strategies available: decoupled control permits only one Cartesian axis actuation, known as decoupled commands; hybrid control allows for both decoupled commands and multiple axes actuation, known as coupled commands. This paper aims to determine the optimal control strategy for foot interfaces by investigating two common assumptions in the literature: (1) Decoupled control is believed to result in better predictability of the final laparoscopic view orientation, and (2) Hybrid control has the efficiency advantage in laparoscope control. Our user study with 11 experienced and trainee surgeons shows that decoupled control has better predictability than hybrid control, while both approaches are equally efficient. In addition, using two surgery-like tasks in a simulator, users’ choice of decoupled and coupled commands is analysed based on their level of surgical experience and the nature of the movement. Results show that trainee surgeons tend to issue more commands than the more experienced participants. Single decoupled commands were frequently used in small view adjustments, while a mixture of coupled and decoupled commands was preferred in larger view adjustments. A guideline for foot interface control strategy selection is provided.

The benefits of haptic feedback in robot assisted surgery and their moderators: a meta-analysis

Article Open access 06 November 2023

Sonification as a reliable alternative to conventional visual surgical navigation

Article Open access 12 April 2023

Sensory manipulation as a countermeasure to robot teleoperation delays: system and evidence

Article Open access 21 February 2024

Introduction

In conventional minimally invasive surgery (MIS), the surgical site is accessed through small incisions on the body. A laparoscope, operated by a camera handler assistant, is inserted to provide the surgeon with a view of the surgery site. A robotic laparoscope holder is applied in Robot-Assisted Minimally Invasive Surgery (RA-MIS) to replace the camera handler assistant in MIS. RA-MIS avoids issues such as surgeon-assistant communication inefficiency and assistant error, making it more efficient and safer than conventional MIS¹.

The adoption of new technologies challenges the surgeon both cognitively and physically^2,3,4. Performing open surgery has been shown to generate a very high cognitive workload, while MIS has even higher workload than open surgery since it demands additional cognitive skills, such as mentally correcting the laparoscope’s misorientation between visual information and horizon level^4,5. In MIS, the surgeon’s work is usually divided into a primary hands-on operation, including the routine surgical procedures, and a separate decision-making task⁶. Surgeon’s cognitive resource is considered limited⁶. The cognitive resource is allocated to primary and secondary tasks according to the task demands, and if the resources are not consumed, spare cognitive perception helps the surgeon to have a better situation awareness and might improve their performance in advanced tasks⁵. However, the high cognitive workload in using new technologies and new additional tasks has been shown to affect surgical performance and can cause human error^3,7,8. Excessive levels of mental workload may slow down decision-making and information processing^7,9, and cause the surgeon to ignore potential hazards⁶. Therefore, when introducing a new surgical technique, it is highly desirable to reduce the surgeon’s cognitive load, making sure they have enough resources to cover all demands, maintain their primary task performance⁶, and have spare cognitive capacity for dealing with emergencies. In RA-MIS, in addition to the two conventional MIS tasks, the laparoscope control task becomes a source of extra cognitive load, demanding better workload management. Therefore, one of the challenges in RA-MIS is the implementation of a suitable human-robot interface to limit the cognitive load¹⁰. Recently, researchers have presented new algorithms in FI to tackle this challenge^11,12,13.

The foot interface (FI) is commonly used in the laparoscope control task^14,15,16. Apart from appropriate mechanical design, applying a suitable control strategy can reduce the surgeon’s workload. However, existing FI research mostly concentrates on delivering new mechanical designs with less focus on control. In this paper, we compare the two most widely adopted control strategies for mapping from FI to robot motions in this type of application: decoupled control and hybrid control. One main goal of this study is to look into a fundamental but key question that has not been investigated before: which is the best approach for the laparoscope manipulation task?

An interface needs to control at least four Degrees of Freedom (DoF) of a laparoscope to provide suitable focus and orientation of the target site view. Decoupled control actuates a single axis at a time, known as decoupled commands. Hybrid control allows separate actuation of each DoF as well as simultaneous actuation of multiple DoF, known as coupled commands. When proposing each of these designs, designers commonly make the following two assumptions:

Decoupled control makes predicting the final view orientation of the laparoscope tip easier for the operator^14,17,18,19.
Hybrid control is more efficient compared to decoupled control^20,21,22.

To the best of the authors’ knowledge, the above assumptions underlying the choice of control configuration have not been tested. This paper is the first study to validate the efficiency and predictability assumptions of the decoupled and hybrid approaches using a pure laparoscope manipulation task. A FI that is independent of most mechanical constraints was used for a fair comparison between the two control strategies. Eleven experienced surgeons and surgeon trainees were recruited, so the results and feedback were from the actual end users.

In addition, as another uninvestigated and related topic, this study explored how surgeons select and combine coupled and/or decoupled commands in surgery-like tasks considering the task requirement, different task phases and individual experience. This is referred to as the interface usage pattern. In this paper, the interface usage pattern and safety in two bi-manual surgery-like tasks are analysed.

The rest of the paper is organised as follows: First, a system overview is provided, presenting the mechanical constraint-free FI, and the two studied control approaches. Then, the experimental platform setup and protocol are introduced, followed by the laparoscope control performance assessment criteria, FI usage pattern and safety evaluation methods. In the results section, the validation of the two control strategies’ assumptions is first described, followed by the usage pattern and safety analysis. The last two sections are the discussion and conclusion.

System overview

The RA-MIS simulation layout (Fig. 1a) replicates the setup used in the operating room. The endoscope is inserted from the surgeon’s side, and both the endoscopic view and the FI overlook view are displayed on a monitor placed in front of the participants. The surgery simulator is a 300\(\times\)220\(\times\)100 mm\(^3\) open box covered by a board, so participants do not have a direct view of the operating site and must rely on the laparoscope view. An LED light is attached beneath the cover to provide sufficient and stable lighting. Two L-bracket holders with 6 mm hole openings (Fig. 1b) near the simulator are used to simulate trocar sleeves.

Robotic laparoscope holder

The foot-actuated robotic laparoscope holder is composed of an ABB IRB 14000 YuMI robot and a 720P DEPSTECH endoscope²³ (Fig. 1a). A laparoscope needs at least four DoF (Fig. 1c), pitch, yaw, insertion/withdrawal, and roll to give a desired angle of view to the surgeon.

Pitch and yaw are used to adjust the translational movement of the laparoscopic view. Insertion/withdrawal relates to the view’s zoom factor, and roll controls the rotation of the image about an axis passing through the centre of the image. The laparoscope pitch motion is with respect to the local laparoscope frame (Fig. 1c), and the yaw motion is with respect to the predefined world frame Z-axis. Surgeons usually prefer the level of the laparoscopic view parallel to the ground level to ensure a comfortable orientation²⁴. So adjusting roll is mostly needed when the surgeon corrects the laparoscopic view twist caused by the manual operation of the camera assistant. However, the level of the view is always parallel to the horizontal plane as the rotation of the laparoscope is constrained in the controller. Thus, roll is temporarily disabled in the system to reduce the user’s mental effort²⁴.

The maximum controllable laparoscope pitch angle is between -10\(^{\circ }\) to 20\(^{\circ }\) and the maximum controllable yaw angle is between -30\(^{\circ }\) to 30\(^{\circ }\). The laparoscope insertion length is 6 cm due to the kinematic limitations of the YuMI robot. When the robot reaches the virtual workspace boundary, the control system stops the motion and displays a screen warning.

Foot interface

Recently developed FIs have unique hardware design and/or algorithm optimization^14,15,16,25 to ensure the dedicated control system runs smoothly and resists any noise or unintended commands. However, to compare the efficiency of the two control strategies and their ease of use, an intuitive and ergonomic FI independent of any specific control design is required.

Foot commands

To match the foot command and motion perceived from the laparoscopic view, moving the foot forward and backward controls pitch and moving the foot left and right maps to yaw control. Insertion/withdrawal is decided by the ankle lifting angle with respect to the reference plane initialised at the beginning of the experiment. The foot translation and ankle lifting are easier to perform and remember compared to a kick, shake, or shape trace²⁶. Thus, the system has six basic foot-actions (Fig. 2d) for 3 DoF.

Input sensing

The FI (Fig. 2a) combines an environmental sensor (two 3D printed hemisphere markers with a radius equal to 1.5 cm and a 1080P Logitech C922 Pro web camera) and a wearable sensor (a Cometa WaveTrack IMU²⁷) to detect the foot position/rotation and the ankle dorsiflexion/plantarflexion angle. The combination of wearable and external sensing allows the user to move their foot freely, naturally, and comfortably without any mechanical constraints²⁸. The environment sensor detects the foot position and rotation along the axis perpendicular to the ground. The camera attached to the table (Fig. 1a) tracks the two markers attached to the user’s foot at the tip and instep. The tip marker is tracked continuously to represent the foot position. At the same time, the instep marker is used with the tip marker to calculate the ankle rotation angle. The distance between the two markers is also registered when the experiment starts as the reference to filter out any tracking error. The IMU sensor is attached to the user’s foot to detect the ankle pitch angle. The reference plane is initialised by averaging the user’s standing static data for two seconds at the beginning of the experiment. The accuracy of the IMU is between -0.5\(^{\circ }\) to 0.8\(^{\circ }\).

Decoupled control and Hybrid control

Decoupled control requires the surgeon to control a single DoF at a time. In contrast to decoupled control, hybrid control allows both actuation of single DoF commands and simultaneous multiple DoF commands. Therefore, the number of commands increased, including both the basic actions, three 2 DoF combinations (Pitch &Insertion, Yaw &Insertion, Pitch &Yaw) and one 3 DoF combination.

Foot commands are continuously tracked in a pre-defined region: The FI mapping area (Fig. 2b,c). It comprises an inner rest zone and an outer activation region. The activation region is divided into eight sub-sections composed of four diagonal regions and four unidirectional regions (Fig. 2c). Detection of the tip marker in the activation region actuates the robot. Inside the rest zone, the user can relax their foot in a natural gesture as any DoF except for insertion/withdrawal will be deactivated. Control will be paused if the tip marker is outside the activation region.

In decoupled control, a pitch command is issued if the tip marker is in the forward and backward unidirectional regions and a yaw command is sent to the robot if the marker is in the left and right unidirectional regions. The remaining four diagonal blocks are then identified as the non-activated regions. The robot performs no action if the tip marker is detected in these blocks.

Hybrid control allows both actuation of single DoF commands and simultaneous multiple DoF commands. In all unidirectional regions, foot commands include single DoF commands and the combination of a single DoF command plus insertion/withdrawal. In all the diagonal regions, foot commands include the dual translation DoF (pitch & yaw) command and the combination of the dual translation DoF command plus insertion/withdrawal. In the rest zone, insertion/withdrawal is still the only DoF that can be activated.

Workspace

The design of the size of the workspace was determined based on a size-evaluation pretest with one experienced surgeon and a trainee surgeon. The interface map size is 40 cm in length and 35 cm in width. It considers the torso-to-leg ratio²⁹, and hip Flexion/Extension motion range²⁶ of the two participants, and both of them could easily reach any target and use it for at least 15 minutes without apparent physical fatigue. The rest zone (length: 12 cm, width: 12 cm) dimensions were also chosen based on their suggestions.

Visual feedback is provided to allow the user to quickly and clearly identify the laparoscope’s moving state and locate their foot position based on common usage^30,31. A command indicator window at the top left corner shows the current command and moving speed (Fig. 2a).

Robot control

The robot uses velocity control with a 30 Hz control loop in this paper, and the control script is written in Python 2.7 to use the abb_robot_driver library³². In MIS, the laparoscope and surgical tools are inserted through small incisions into the patient’s body. Therefore, the laparoscope is constrained to move through and rotate about the incision point to prevent the incision port from tearing, known as the Remote Center of Motion (RCM) constraint. A programmable RCM constraint algorithm³³ was implemented to control the endoscope holder end. The original algorithm was modified to use velocity input to replace the position input. The FI control integrates the RCM constraint, which means the laparoscope’s distal end will always move on the RCM sphere with the insertion point shown in Fig. 1c as the centre of the circle and the length from the insertion point to the laparoscope’s end as the radius. Based on the camera holding assistant and a surgeon’s suggestion, the speed range was from 0.02 rad/s to 0.08 rad/s for pitch and yaw, and 0 mm/s to 6 mm/s for insertion/withdrawal. The speed selection is a compromise between the reality of the surgery-like experiment setup and the performance limits of the robot.

The outputs from the FI are speeds in three DoF directions. The laparoscope pitch and yaw velocities positively correlate to the distance between the tip marker and the local coordinate that originates at the nearest vertex at the edge of the rest zone for the two control approaches (Fig. 2b). The DoF velocity is 0 if the tip marker is on the inner green edge (Fig. 2a), and the maximum velocity is issued if the tip marker is on the outer green edge. Insertion/withdrawal is decided by the ankle lifting angle with respect to the reference plane initialised at the beginning of the experiment. \(-\,10^{\circ }\) to 10\(^{\circ }\) is a dead zone to prevent disturbance. Any ankle pitch angle higher than the threshold (±18\(^{\circ }\)) will actuate the fastest speed. These two values were also determined according to the experienced and the trainee surgeon’s suggestion in the pretest.

In hybrid control, if no insertion/withdrawal command is detected, the resultant speed is that on the tangent surface of the RCM sphere. If the insertion speed is observed, the resultant speed sums the translational and insertion speeds. If the user intends to add only an additional insertion/withdrawal command while the tip marker is already in the outer activation region, it might lead to changes in the original translational DoF velocity output because the tip marker position can vary when the user adjusts their ankle pitch angle. Therefore, to interpret the user’s intention, two criteria are used: rotation angle change (with a threshold of 5\(^{\circ }\)) along the axis perpendicular to the ground and tip marker moving distance (with a threshold of 6 pixels). If both values are smaller than the thresholds, we assume the user only tends to add an insertion command, so the system adds the additional insertion/withdrawal based on the previous command. Otherwise, both the translation and insertion speeds would change.

Experiment

We designed and conducted an experiment to test the two assumptions about the predictability and efficiency of the hybrid and decoupled control and to study the FI usage patterns. This project was approved by the Human Ethics Review Committee, Monash University (ID: 29291). The experiment was carried out by strictly following the guidelines of low-risk projects provided by the Human Ethics Review Committee, Monash University and informed consent was obtained from all participating surgeons.

To ensure that the assumptions are validated with the target user population, eleven surgeons (three females) with average age of 38.6 ± 6.1 years were recruited. Six of them are experienced surgeons, and five are still in training. All participants’ dominant foot is the right foot and all of them had experience with MIS, with at least 10 years experience for surgeons and an average of 5 years experience for surgeons in training. Seven participants had previously used FIs (foot pedals) for surgical use. Two experienced surgeons had experience with RA-MIS.

Protocol

The experiment consists of two sets of tasks (Fig. 3a): the target aiming task where only laparoscope control was required, and the Pick &Place (hereby shortened as PP task) and Lead-through tasks (hereby shortened as LT task) where both laparoscope and surgical tool control was required. The target aiming task was performed first using each of the two control strategies to validate the two assumptions. Similar pure laparoscope control tasks are commonly used to test FIs with the proposed control strategy^14,15. Then, in the two surgery-like tasks (PP and LT tasks), participants could freely use decoupled or coupled commands at any time. The two surgery-like tasks, both involving laparoscope-tool coordination skills and modified from MIS training courses³⁴, were used to simulate a real operation and to study the FI usage pattern and safety when participants are simultaneously controlling the laparoscope and performing surgery-like tasks. In all three tasks, the participants were asked to perform the experimental tasks as fast as possible while trying to minimize operational errors.

Participants were asked to select the preferred foot to wear the FI equipment. They were then randomly assigned a control strategy to use first for the target aiming task and the performing sequence of the two surgery-like tasks.

Training: A board with four markers (Fig. 3b) was provided for five minutes of training, following a common protocol in similar research^20,35. The participant was encouraged to explore every foot command and speed level and use the assigned control strategy to approach the markers. After this training phase, the target aiming task started.

Target aiming task: This task involves moving the laparoscope view to point to two targets in a given sequence. The setup consists of five poles, including one central pole, and four others arranged around the central pole, labelled P1, P2, P3 and P4 in a clockwise direction (Fig. 3c). The spherical object on top of the pole is the target. A concentric circle (inner radius: 60 pixel and outer radius: 110 pixels) is drawn on the endoscopic view for this task (Fig. 2e). The outer circle represents the region of interest and the optimal zoom factor is confined within the two circles. Thus, the contour of the target needs to be between the inner and outer circles. Once the target is between the concentric circles, the target outline will light up, and if the target remains in that area for 1 s, a “Move back to centre” or “Move back to corner” indication is displayed on the screen. The start position is right below the centre target. The participant is instructed to guide the laparoscopic view to the centre first, then aim to the next given target, and back to the centre to execute the same procedure for the following target. The time starts when the aim target sequence is shown on the screen and stops when the participant reaches one blue and one red target and aims back at the centre target. The basic task is repeated four times for two different aiming orders: Center-P1-Center-P3-Center and Center-P4-Center-P2-Center.

In this task, two poles (P1 &P4) are not visible from the starting pose, and two are close to the start point (P2 &P3). This setup replicates two situations: first, the target is close to the current site and small adjustments to the laparoscope view are needed. Second, the target is far from the laparoscopic centre view and large adjustments are needed, followed by more detailed aiming. After the first target aiming task, the same process was completed with the other control strategy.

Pick & Place task: The experiment setup (Fig. 3d) is similar to the target aiming task. The central pole has three triangular objects arranged in a stack. A surgical grasper and curved forceps are provided as the available tools. The participant is instructed to remove these triangular objects from the central pole using one tool, transfer the object to the other tool, and place the object on one of the surrounding poles, then repeat the task until all three objects are placed at the surrounding poles. At the centre pole, the participant can not see P4 and P1 and does not have a complete view of P3 and P2 before zooming out. The participant is required to keep the triangle object always in the view, which means the laparoscope adjustment is necessary during the task. The ready pose requires the participant to bring the tools in the endoscopic view. Time starts when the moving sequence is shown on the screen and finishes when the participant places the last object at the corresponding pole and withdraws tools out of view. The task is repeated four times with a shorter placement sequence (P1–P2–P3) first and then four times with a longer placement sequence (P1–P4–P2).

Lead-through task: A needle is placed in Gate 1 at the beginning (Fig. 3e). The participant is asked to use the provided surgical tools to hold the needle all the time and thread it through three circular gates (Gate2-Gate3-Gate1). The participant can choose the toolset of two needle holders or one needle holder and one curved forceps based on individual preference. Once the participant is in the ready pose, the time starts when the “start” command is shown on the screen and stops when the participant threads all gates and withdraws the tools out of view. This task is more difficult than the PP task as this task requires frequent movements of the laparoscope. It is repeated four times for the gates shown in Fig. 3e first, and then new gates are set by mirroring Gates 2 and 3 with respect to the red dotted line.

Laparoscope control performance assessment

Four metrics were analysed: completion time, aiming accuracy, moving distance and the number of issued commands to compare the efficiency and predictability of the two control methods in the target aiming task and validate the two assumptions.

Efficiency

Efficiency is “the resources expended in relation to the accuracy and completeness with which users achieve the goal”³⁶. In the target aiming task, the participant is expected to finish the task with relatively high accuracy and a short moving distance in a short time. Therefore, completion time, aiming accuracy and moving distance were considered when identifying the control method with better efficiency. The accuracy is presented as the average of the shortest distance from the target centre to the centre of the view in pixels when aiming at the three targets. The moving distance refers to the total laparoscope tip moving distance in the task.

Predictability

Predictability is evaluated based on the total number of issued commands in the target aiming task using the two testing scenarios (i.e. a close target and a further target). If the laparoscope is moving in a predictable manner, the participant should be able to aim at the target with fewer commands since the movement of the tool would match the operator’s expectations, easing the control of the laparoscope control.

Subjective assessment

The Comparative Control strategy Questionnaire, customized Task Feedback Questionnaire, and NASA Task Load Index Questionnaire³⁷ were used in the experiment (Fig. 3a) mainly for the assumptions validation and control methods comparison.

The Comparative Control strategy Questionnaire compares the two control strategies’ predictability and efficiency in the target aiming task using single-choice questions at the end of the target aiming task. It also assesses the participants’ preference for a control strategy. The Task Feedback Questionnaire uses a five-point scale. It was used to assess the intuitiveness, usability and distraction of each control approach after the target aiming task.

The NASA Task Load Index Questionnaire is a subjective measurement of the participants’ perceived workload from six aspects, including effort, frustration, performance, mental, physical and temporal demand. Each factor’s raw scale is from 0 to 100 and has a weighted parameter from 0 to 5/15, with an interval of 1/15. The maximum weighted overall workload is 100. It is a method that enables participants to assess their experience with various human-machine interface systems.

Interface usage pattern

The interface usage pattern study investigates how surgeons interact with the laparoscope-holding robot in the two surgery-like tasks. In this paper, the interface usage pattern is analyzed with respect to the motion patterns as well as the context.

Laparoscope motion patterns

A laparoscope motion pattern is defined as a sequence of movements before the laparoscope stops. A motion may contain a single command or several commands in a row. In this analysis, five types of motions can be observed:

Single-decoupled motion: A motion only contains a decoupled command
Single-coupled motion: A motion only contains a coupled command
Multiple-decoupled motion: A motion contains multiple decoupled commands
Multiple-coupled motion: A motion contains multiple coupled commands
Mixed command motion: A motion contains decoupled and coupled commands

Performed motion patterns were compared between the experienced and trainee surgeon groups.

Context dependency

The context dependency concerns the task phases, the participants’ experience and the laparoscope moving speed.

Based on our observation, a complete surgery-like task is broken into two phases: the operation phase and the view transfer phase. The operation phase is when the participant focuses on hands-on work with few camera adjustments. The view transfer phase is defined as the period where the participant finishes one operation and moves to a new site. The view transfer phase usually requires the laparoscope to travel more than the operation phase. Every command sent from the FI was recorded, and the task phases were manually annotated.

In the PP task, the operation phase consists of two sub-phases: picking and placing an object from/to the corresponding poles. The view transfer phase is split into the object-holding phase moving towards a target pole (hereby shortened as object-holding phase) and the no-object phase moving towards the centre pole (hereby shortened as no-object phase). One complete pick and place operation starts from the no-object phase, then the picking phase, the object-holding phase and ends with the placing phase. The picking phase is considered easier than the placing phase since participants can finish picking as long as the target is visible in the view. In contrast, the placing phase needs a more clear view of the spatial relationship between a target pole and the holding object. Therefore, more frequent adjustments and larger motions are expected before the placing operation.

In the LT task, the operation phase starts when the thread enters the gate and stops when the needle exits the gate. The view transfer phase is the period after the needle leaves the current gate and before it enters the new gate.

Participants were split into an experienced surgeons group and a trainee surgeons group based on their experience. In actual surgeries, the experienced surgeons perform the hands-on operation, while the trainee surgeons control the laparoscope to assist them. Based on the real-life task assignment, we expect the trainee group to use more commands than the surgeon group due to their less experience in surgery.

Only the laparoscope angular speed of pure rotation was used in the analysis as the pure rotation takes 85% of the total motions. However, a motion like rotation with zooming out dramatically increases the user’s field of view, the rotation command only lasts for a very short time, usually less than a second. So any coupled command containing an insertion is excluded from the analysis due to insufficient valid recorded data.

Safety

Patients’ safety is the first concern in any surgery. Four metrics listed below were used to evaluate the safety of the control:

The number of times the object drops: Surgeons may inadvertently drop the object and leave it inside the patient body. This is dangerous in real surgery.
The total time that the held object is outside of the laparoscope view: Keeping the held object outside the view is dangerous as the user cannot see the object, which decreases their situational awareness.
The number of times the tool hits the board: in the setup, the baseboard is analogous to the patient’s tissue and the pole/gate is the operation site. Hitting the patient with the tools should be avoided.
Participants were asked to identify the approach they find less error-prone if implemented in the actual surgery in questionnaires.

Results

In this section, we first report the results of the target aiming task to evaluate the predictability and efficiency of the decoupled and hybrid control approaches, followed by laparoscope motion patterns, the context dependency, and safety of the laparoscope control in the PP and LT tasks. Although the sample size is small, statistical analyses is also presented. We use the Shapiro-Wilk method to check all data normality first. For paired samples comparison, such as if all eleven participants sent commands in decoupled and hybrid approaches, the paired t-test is used for normally distributed data and the Wilcoxon singed-rank test is used for non-normal distributed data. For independent samples comparison, such as the experienced surgeons and the trainee surgeons’ performance difference, we use pooled variances t-test for normally distributed data and the Mann-Whitney U test for non-normal distributed data. The significance level \(\alpha\) = 0.05 is applied to all hypothesis tests. The statistical power is calculated using GPower 3.1.9.4. For the paired t-test, the statistical power is 0.32, the statistical power of Wilcoxon singed-rank test is 0.44, the pooled variances t-test’s statistical power is 0.11, and the Mann-Whitney U test’s power is 0.18.

Assumptions validation

Predictability assumption validation

From Fig. 4a,b, decoupled control required fewer issued commands in all trials and two paths. The statistical analyses also show that decoupled control used significantly fewer commands. The maximum difference was in Trial 1 Path 1 (Decoupled 25.5 s vs Hybrid 39.8 s, p = 0.002), and the minimum difference happened in Trial 4 Path 1 (Decoupled 25.7 s vs Hybrid 32.1 s, p = 0.045), meaning it requires less adjustment compared to hybrid control. This result supports the assumption that predicting the movements of the end effector is easier using decoupled control.

Efficiency assumption validation

Efficiency is measured by the completion time, moving distance and accuracy.

In Fig. 4c,d, there was no obvious difference in task completion time between the two approaches and the analyses did not show any significant difference as well. The maximum time difference in Path 1 happened at Trail 4 (Decoupled 99.1 s vs Hybrid 88.2 s ), and that value for Path 2 was at Trial 3 (Decoupled 90.2 vs Hybrid 99). Sample points are more compact for decoupled control, which indicates that participants had a more consistent completion time with decoupled control.

The decoupled approach consistently had an equal to or better accuracy than the hybrid approach across both paths and all the trials in the aiming accuracy (Fig. 4e,f). Only one significant difference in Path 2 Trial 3 was identified (Decoupled 6.6 pixel vs Hybrid 8.5 pixel, p = 0.0297). Decoupled control moving distances were slightly shorter compared to hybrid control in all trials (Fig. 4g,h), and two significant differences were observed at Path 2 Trial 1 (Decoupled 720.5 mm vs Hybrid 791.2 mm, p = 0.00098) and Path 2 Trial 3 (Decoupled 701.9 mm vs Hybrid 750 mm, p = 0.0026).

In conclusion, decoupled control had overall shorter moving distances and higher accuracy. No obvious time difference was observed between the two approaches. Therefore, the assumption that hybrid control is more efficient than decoupled control is not supported.

Questionnaire

At the end of the target aiming task (Fig. 3a), the Comparative Control Strategy Questionnaire shows that 73% of participants considered the hybrid control more efficient than the decoupled control. In the predictability question, 55% of participants believed the decoupled method was more predictable. In addition, this questionnaire shows that 82% of participants identified hybrid control as a more distracting strategy. Only 18% of the participants considered the hybrid control to be less error-prone if implemented in actual surgery. Seven participants preferred using decoupled control.

Similarly, Task Feedback Questionnaires show participants believed decoupled gestures were more intuitive than hybrid foot gestures. The average score for decoupled control and hybrid control is 3.1 and 2.64 respectively. The average distraction score was a little bit higher for hybrid control (3.27) than decoupled control (2.91). Moreover, decoupled control (3.73) was considered to have better usability than hybrid control (3.31). However, no significant difference is found from the statistical analyses.

The NASA Task Load Index questionnaire shows hybrid control was rated higher in the weighted effort (12.4 vs 8.67, p = 0.1), mental demand (11.7 vs 6.24, p = 0.0039), and frustration level (7.52 vs 2.91, p = 0.16) than decoupled control, which means participants experienced more mental burden while using the hybrid approach. In terms of the physical aspect, the two methods had similar scores (Decoupled 6.6 vs Hybrid 7.4, p = 0.8). Decoupled control’s average overall perceived workload was significantly lower than hybrid control (Decoupled 38.53 vs Hybrid 51.67, p = 0.0037).

Overall, both predictability and efficiency comparison results from the questionnaires were the same as the assumptions we mentioned in the introduction section. In addition, decoupled control was reported to be potentially safer and less distracting from the Comparative Control Strategy Questionnaire and Task Feedback Questionnaires. Participating surgeons preferred using decoupled control as well. The NASA Task Load Index questionnaire also indicated that hybrid control caused a higher perceived workload than decoupled control, especially the cognitive load, and participants needed to make more effort to finish the target aiming task so they usually had a higher frustration level.