中文版 | English
题名

Uni4Eye++: A General Masked Image Modeling Multi-modal Pre-training Framework for Ophthalmic Image Classification and Segmentation

作者
发表日期
2024
DOI
发表期刊
ISSN
1558-254X
卷号PP期号:99
摘要
A large-scale labeled dataset is a key factor for the success of supervised deep learning in most ophthalmic image analysis scenarios. However, limited annotated data is very common in ophthalmic image analysis, since manual annotation is time-consuming and labor-intensive. Self-supervised learning (SSL) methods bring huge opportunities for better utilizing unlabeled data, as they do not require massive annotations. To utilize as many unlabeled ophthalmic images as possible, it is necessary to break the dimension barrier, simultaneously making use of both 2D and 3D images as well as alleviating the issue of catastrophic forgetting. In this paper, we propose a universal self-supervised Transformer framework named Uni4Eye++ to discover the intrinsic image characteristic and capture domain-specific feature embedding in ophthalmic images. Uni4Eye++ can serve as a global feature extractor, which builds its basis on a Masked Image Modeling task with a Vision Transformer architecture. On the basis of our previous work Uni4Eye, we further employ an image entropy guided masking strategy to reconstruct more-informative patches and a dynamic head generator module to alleviate modality confusion. We evaluate the performance of our pre-trained Uni4Eye++ encoder by fine-tuning it on multiple downstream ophthalmic image classification and segmentation tasks. The superiority of Uni4Eye++ is successfully established through comparisons to other state-of-the-art SSL pre-training methods. Our code is available at Github1.
相关链接[IEEE记录]
收录类别
学校署名
第一
ESI学科分类
CLINICAL MEDICINE
引用统计
成果类型期刊论文
条目标识符http://sustech.caswiz.com/handle/2SGJ60CL/783767
专题工学院_电子与电气工程系
南方科技大学
作者单位
1.Department of Electronic and Electrical Engineering, Southern University of Science and Technology, Shenzhen, China
2.Jiaxing Research Institute, Southern University of Science and Technology, Jiaxing, China
第一作者单位电子与电气工程系;  南方科技大学
第一作者的第一单位电子与电气工程系
推荐引用方式
GB/T 7714
Zhiyuan Cai,Li Lin,Huaqing He,et al. Uni4Eye++: A General Masked Image Modeling Multi-modal Pre-training Framework for Ophthalmic Image Classification and Segmentation[J]. IEEE Transactions on Medical Imaging,2024,PP(99).
APA
Zhiyuan Cai,Li Lin,Huaqing He,Pujin Cheng,&Xiaoying Tang.(2024).Uni4Eye++: A General Masked Image Modeling Multi-modal Pre-training Framework for Ophthalmic Image Classification and Segmentation.IEEE Transactions on Medical Imaging,PP(99).
MLA
Zhiyuan Cai,et al."Uni4Eye++: A General Masked Image Modeling Multi-modal Pre-training Framework for Ophthalmic Image Classification and Segmentation".IEEE Transactions on Medical Imaging PP.99(2024).
条目包含的文件
条目无相关文件。
个性服务
原文链接
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
导出为Excel格式
导出为Csv格式
Altmetrics Score
谷歌学术
谷歌学术中相似的文章
[Zhiyuan Cai]的文章
[Li Lin]的文章
[Huaqing He]的文章
百度学术
百度学术中相似的文章
[Zhiyuan Cai]的文章
[Li Lin]的文章
[Huaqing He]的文章
必应学术
必应学术中相似的文章
[Zhiyuan Cai]的文章
[Li Lin]的文章
[Huaqing He]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
[发表评论/异议/意见]
暂无评论

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。