Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

PEg TRAnsfer Workflow recognition challenge report: Do multimodal data improve recognition?

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • Additional Information
    • Contributors:
      Laboratoire Traitement du Signal et de l'Image (LTSI); Université de Rennes (UR)-Institut National de la Santé et de la Recherche Médicale (INSERM); The University of Tokyo (UTokyo); Johns Hopkins University (JHU); Zhejiang University; The Chinese University of Hong Kong Hong Kong (CUHK); Netaji Subhas University of Technology (NSUT); National University of Singapore (NUS); Muroran Institute of Technology
    • Publication Information:
      HAL CCSD
      Elsevier
    • Publication Date:
      2023
    • Collection:
      Université de Rennes 1: Publications scientifiques (HAL)
    • Abstract:
      International audience ; Background and objective: In order to be context-aware, computer-assisted surgical systems require accurate, real-time automatic surgical workflow recognition. In the past several years, surgical video has been the most commonly-used modality for surgical workflow recognition. But with the democratization of robot-assisted surgery, new modalities, such as kinematics, are now accessible. Some previous methods use these new modalities as input for their models, but their added value has rarely been studied. This paper presents the design and results of the “PEg TRAnsfer Workflow recognition” (PETRAW) challenge with the objective of developing surgical workflow recognition methods based on one or more modalities and studying their added value. Methods: The PETRAW challenge included a data set of 150 peg transfer sequences performed on a virtual simulator. This data set included videos, kinematic data, semantic segmentation data, and annotations, which described the workflow at three levels of granularity: phase, step, and activity. Five tasks were proposed to the participants: three were related to the recognition at all granularities simultaneously using a single modality, and two addressed the recognition using multiple modalities. The mean application-dependent balanced accuracy (AD-Accuracy) was used as an evaluation metric to take into account class balance and is more clinically relevant than a frame-by-frame score. Results: Seven teams participated in at least one task with four participating in every task. The best results were obtained by combining video and kinematic data (AD-Accuracy of between 93% and 90% for the four teams that participated in all tasks). Conclusion: The improvement of surgical workflow recognition methods using multiple modalities compared with unimodal methods was significant for all teams. However, the longer execution time required for video/kinematic-based methods(compared to only kinematic-based methods) must be considered. Indeed, one must ask if it is ...
    • Relation:
      info:eu-repo/semantics/altIdentifier/pmid/37119774; hal-04089303; https://hal.science/hal-04089303; https://hal.science/hal-04089303/document; https://hal.science/hal-04089303/file/Huaulm%C3%A9%20et%20al.%20-%202022%20-%20PEg%20TRAnsfer%20Workflow%20Recognition%20Challenge%20Report.pdf; PUBMED: 37119774
    • Accession Number:
      10.1016/j.cmpb.2023.107561
    • Online Access:
      https://hal.science/hal-04089303
      https://hal.science/hal-04089303/document
      https://hal.science/hal-04089303/file/Huaulm%C3%A9%20et%20al.%20-%202022%20-%20PEg%20TRAnsfer%20Workflow%20Recognition%20Challenge%20Report.pdf
      https://doi.org/10.1016/j.cmpb.2023.107561
    • Rights:
      info:eu-repo/semantics/OpenAccess
    • Accession Number:
      edsbas.4B423D39