题名 | Global–local multi-stage temporal convolutional network for cataract surgery phase recognition |
作者 | |
通讯作者 | Gu,Yuanyuan; Chen,Xu |
发表日期 | 2022-12-01
|
DOI | |
发表期刊 | |
EISSN | 1475-925X
|
卷号 | 21期号:1 |
摘要 | Background: Surgical video phase recognition is an essential technique in computer-assisted surgical systems for monitoring surgical procedures, which can assist surgeons in standardizing procedures and enhancing postsurgical assessment and indexing. However, the high similarity between the phases and temporal variations of cataract videos still poses the greatest challenge for video phase recognition. Methods: In this paper, we introduce a global–local multi-stage temporal convolutional network (GL-MSTCN) to explore the subtle differences between high similarity surgical phases and mitigate the temporal variations of surgical videos. The presented work consists of a triple-stream network (i.e., pupil stream, instrument stream, and video frame stream) and a multi-stage temporal convolutional network. The triple-stream network first detects the pupil and surgical instruments regions in the frame separately and then obtains the fine-grained semantic features of the video frames. The proposed multi-stage temporal convolutional network improves the surgical phase recognition performance by capturing longer time series features through dilated convolutional layers with varying receptive fields. Results: Our method is thoroughly validated on the CSVideo dataset with 32 cataract surgery videos and the public Cataract101 dataset with 101 cataract surgery videos, outperforming state-of-the-art approaches with 95.8% and 96.5% accuracy, respectively. Conclusions: The experimental results show that the use of global and local feature information can effectively enhance the model to explore fine-grained features and mitigate temporal and spatial variations, thus improving the surgical phase recognition performance of the proposed GL-MSTCN. |
关键词 | |
相关链接 | [Scopus记录] |
收录类别 | |
语种 | 英语
|
学校署名 | 其他
|
资助项目 | Natural Science Foundation of Ningbo[202003N4039];Natural Science Foundation of Ningbo Municipality[202003N4039];
|
WOS研究方向 | Engineering
|
WOS类目 | Engineering, Biomedical
|
WOS记录号 | WOS:000892987300001
|
出版者 | |
Scopus记录号 | 2-s2.0-85143117573
|
来源库 | Scopus
|
引用统计 |
被引频次[WOS]:1
|
成果类型 | 期刊论文 |
条目标识符 | http://sustech.caswiz.com/handle/2SGJ60CL/416473 |
专题 | 工学院_计算机科学与工程系 |
作者单位 | 1.College of Mechanical Engineering,Zhejiang University of Technology,Hangzhou,310014,China 2.Cixi Institute of Biomedical Engineering,Ningbo Institute of Materials Technology and Engineering,Chinese Academy of Sciences,Ningbo,China 3.Department of Computer Science and Engineering,Southern University of Science and Technology,Shenzhen,518055,China 4.Department of Ophthalmology,Shanghai Aier Eye Hospital,Shanghai,China 5.Department of Ophthalmology,Shanghai Aier Qingliang Eye Hospital,Shanghai,China 6.Aier Eye Hospital,Jinan University,Guangzhou,No. 601, Huangpu Road West,China 7.Aier School of Ophthalmology,Central South University Changsha,Changsha,Hunan,China 8.Zhejiang Engineering Research Center for Biomedical Materials,Cixi Institute of Biomedical Engineering,Ningbo Institute of Materials Technology and Engineering,Chinese Academy of Sciences,Ningbo,315300,China 9.Aerospace Information Research Institute,Chinese Academy of Sciences,Beijing,China |
推荐引用方式 GB/T 7714 |
Fang,Lixin,Mou,Lei,Gu,Yuanyuan,等. Global–local multi-stage temporal convolutional network for cataract surgery phase recognition[J]. BioMedical Engineering Online,2022,21(1).
|
APA |
Fang,Lixin.,Mou,Lei.,Gu,Yuanyuan.,Hu,Yan.,Chen,Bang.,...&Zhao,Yitian.(2022).Global–local multi-stage temporal convolutional network for cataract surgery phase recognition.BioMedical Engineering Online,21(1).
|
MLA |
Fang,Lixin,et al."Global–local multi-stage temporal convolutional network for cataract surgery phase recognition".BioMedical Engineering Online 21.1(2022).
|
条目包含的文件 | 条目无相关文件。 |
|
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论