• Media type: E-Article
  • Title: The distinct roles of reinforcement learning between pre-procedure and intra-procedure planning for prostate biopsy
  • Contributor: Gayo, Iani J. M. B.; Saeed, Shaheer U.; Bonmati, Ester; Barratt, Dean C.; Clarkson, Matthew J.; Hu, Yipeng
  • imprint: Springer Science and Business Media LLC, 2024
  • Published in: International Journal of Computer Assisted Radiology and Surgery
  • Language: English
  • DOI: 10.1007/s11548-024-03084-4
  • ISSN: 1861-6429
  • Keywords: Health Informatics ; Radiology, Nuclear Medicine and imaging ; General Medicine ; Surgery ; Computer Graphics and Computer-Aided Design ; Computer Science Applications ; Computer Vision and Pattern Recognition ; Biomedical Engineering
  • Origination:
  • Footnote:
  • Description: <jats:title>Abstract</jats:title><jats:sec> <jats:title>Purpose</jats:title> <jats:p>Magnetic resonance (MR) imaging targeted prostate cancer (PCa) biopsy enables precise sampling of MR-detected lesions, establishing its importance in recommended clinical practice. Planning for the ultrasound-guided procedure involves pre-selecting needle sampling positions. However, performing this procedure is subject to a number of factors, including MR-to-ultrasound registration, intra-procedure patient movement and soft tissue motions. When a fixed <jats:italic>pre-procedure planning</jats:italic> is carried out without intra-procedure adaptation, these factors will lead to sampling errors which could cause false positives and false negatives. Reinforcement learning (RL) has been proposed for procedure plannings on similar applications such as this one, because intelligent agents can be trained for both pre-procedure and <jats:italic>intra-procedure planning</jats:italic>. However, it is not clear if RL is beneficial when it comes to addressing these intra-procedure errors.</jats:p> </jats:sec><jats:sec> <jats:title>Methods</jats:title> <jats:p>In this work, we develop and compare imitation learning (IL), supervised by demonstrations of predefined sampling strategy, and RL approaches, under varying degrees of intra-procedure motion and registration error, to represent sources of targeting errors likely to occur in an intra-operative procedure.</jats:p> </jats:sec><jats:sec> <jats:title>Results</jats:title> <jats:p>Based on results using imaging data from 567 PCa patients, we demonstrate the efficacy and value in adopting RL algorithms to provide intelligent intra-procedure action suggestions, compared to IL-based planning supervised by commonly adopted policies.</jats:p> </jats:sec><jats:sec> <jats:title>Conclusions</jats:title> <jats:p>The improvement in biopsy sampling performance for intra-procedure planning has not been observed in experiments with only pre-procedure planning. These findings suggest a strong role for RL in future prospective studies which adopt intra-procedure planning. Our open source code implementation is available <jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/i-gayo/ImitationLearning">here</jats:ext-link>.</jats:p> </jats:sec>