論文

2023年5月4日

Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory

BMC Medical Informatics and Decision Making
  • Bowen Wang
  • ,
  • Liangzhi Li
  • ,
  • Yuta Nakashima
  • ,
  • Ryo Kawasaki
  • ,
  • Hajime Nagahara

23
1
記述言語
掲載種別
研究論文(学術雑誌)
DOI
10.1186/s12911-023-02160-0
出版者・発行元
Springer Science and Business Media LLC

Abstract

Purpose

Estimating the surgery length has the potential to be utilized as skill assessment, surgical training, or efficient surgical facility utilization especially if it is done in real-time as a remaining surgery duration (RSD). Surgical length reflects a certain level of efficiency and mastery of the surgeon in a well-standardized surgery such as cataract surgery. In this paper, we design and develop a real-time RSD estimation method for cataract surgery that does not require manual labeling and is transferable with minimum fine-tuning.

Methods

A regression method consisting of convolutional neural networks (CNNs) and long short-term memory (LSTM) is designed for RSD estimation. The model is firstly trained and evaluated for the single main surgeon with a large number of surgeries. Then, the fine-tuning strategy is used to transfer the model to the data of the other two surgeons. Mean Absolute Error (MAE in seconds) was used to evaluate the performance of the RSD estimation. The proposed method is compared with the naïve method which is based on the statistic of the historical data. A transferability experiment is also set to demonstrate the generalizability of the method.

Result

The mean surgical time for the sample videos was 318.7 s (s) (standard deviation 83.4 s) for the main surgeon for the initial training. In our experiments, the lowest MAE of 19.4 s (equal to about 6.4% of the mean surgical time) is achieved by our best-trained model for the independent test data of the main target surgeon. It reduces the MAE by 35.5 s (-10.2%) compared to the naïve method. The fine-tuning strategy transfers the model trained for the main target to the data of other surgeons with only a small number of training data (20% of the pre-training). The MAEs for the other two surgeons are 28.3 s and 30.6 s with the fine-tuning model, which decreased by -8.1 s and -7.5 s than the Per-surgeon model (average declining of -7.8 s and 1.3% of video duration). External validation study with Cataract-101 outperformed 3 reported methods of TimeLSTM, RSDNet, and CataNet.

Conclusion

An approach to build a pre-trained model for estimating RSD estimation based on a single surgeon and then transfer to other surgeons demonstrated both low prediction error and good transferability with minimum fine-tuning videos.

リンク情報
DOI
https://doi.org/10.1186/s12911-023-02160-0
URL
https://link.springer.com/content/pdf/10.1186/s12911-023-02160-0.pdf
URL
https://link.springer.com/article/10.1186/s12911-023-02160-0/fulltext.html
ID情報
  • DOI : 10.1186/s12911-023-02160-0
  • eISSN : 1472-6947

エクスポート
BibTeX RIS