TY - JOUR
T1 - Beyond convolutions and supervised learning with transformers and representation learning for retinal image analysis
AU - Wu, Yue
AU - Lee, Cecilia S.
AU - Lee, Aaron Y.
N1 - Publisher Copyright:
© 2025 Elsevier Ltd.
PY - 2026/1
Y1 - 2026/1
N2 - Retinal image analysis has enjoyed groundbreaking advances in the last ten years due to seismic improvements in image analysis techniques from the field of computer vision. Previous reviews in deep learning and artificial intelligence (AI) (Schmidt-Erfurth et al., 2018; Ting et al., 2019) have either focused on supervised learning, where labels are curated or manually created, or concentrated on the application of AI in specific image modalities and retina diseases (Hormel et al., 2021; Li et al., 2024a (Hormel et al., 2021; Li et al., 2024a)). In this review, we sought to summarize the advances in the field with the shift towards label-free approaches using representational learning and the emergence of vision transformers as alternatives to convolutional neural networks for image analysis. These advances include semi-supervised learning, self-supervised learning and directly led to the advent of foundation models, vision-language models, and multi-modal models.
AB - Retinal image analysis has enjoyed groundbreaking advances in the last ten years due to seismic improvements in image analysis techniques from the field of computer vision. Previous reviews in deep learning and artificial intelligence (AI) (Schmidt-Erfurth et al., 2018; Ting et al., 2019) have either focused on supervised learning, where labels are curated or manually created, or concentrated on the application of AI in specific image modalities and retina diseases (Hormel et al., 2021; Li et al., 2024a (Hormel et al., 2021; Li et al., 2024a)). In this review, we sought to summarize the advances in the field with the shift towards label-free approaches using representational learning and the emergence of vision transformers as alternatives to convolutional neural networks for image analysis. These advances include semi-supervised learning, self-supervised learning and directly led to the advent of foundation models, vision-language models, and multi-modal models.
KW - AI
KW - Deep learning
KW - Foundation models
KW - Image analysis
KW - Retinal imaging
KW - Self-supervised learning
KW - Semi-supervised learning
UR - https://www.scopus.com/pages/publications/105023970275
U2 - 10.1016/j.preteyeres.2025.101419
DO - 10.1016/j.preteyeres.2025.101419
M3 - Article
C2 - 41352580
AN - SCOPUS:105023970275
SN - 1350-9462
VL - 110
JO - Progress in Retinal and Eye Research
JF - Progress in Retinal and Eye Research
M1 - 101419
ER -