TY - JOUR
T1 - PtychoDV
T2 - Vision Transformer-Based Deep Unrolling Network for Ptychographic Image Reconstruction
AU - Gan, Weijie
AU - Zhai, Qiuchen
AU - McCann, Michael T.
AU - Cardona, Cristina Garcia
AU - Kamilov, Ulugbek S.
AU - Wohlberg, Brendt
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2024
Y1 - 2024
N2 - Ptychography is an imaging technique that captures multiple overlapping snapshots of a sample, illuminated coherently by a moving localized probe. The image recovery from ptychographic data is generally achieved via an iterative algorithm that solves a nonlinear phase retrieval problem derived from measured diffraction patterns. However, these iterative approaches have high computational cost. In this paper, we introduce PtychoDV, a novel deep model-based network designed for efficient, high-quality ptychographic image reconstruction. PtychoDV comprises a vision transformer that generates an initial image from the set of raw measurements, taking into consideration their mutual correlations. This is followed by a deep unrolling network that refines the initial image using learnable convolutional priors and the ptychography measurement model. Experimental results on simulated data demonstrate that PtychoDV is capable of outperforming existing deep learning methods for this problem, and significantly reduces computational cost compared to iterative methodologies, while maintaining competitive performance.
AB - Ptychography is an imaging technique that captures multiple overlapping snapshots of a sample, illuminated coherently by a moving localized probe. The image recovery from ptychographic data is generally achieved via an iterative algorithm that solves a nonlinear phase retrieval problem derived from measured diffraction patterns. However, these iterative approaches have high computational cost. In this paper, we introduce PtychoDV, a novel deep model-based network designed for efficient, high-quality ptychographic image reconstruction. PtychoDV comprises a vision transformer that generates an initial image from the set of raw measurements, taking into consideration their mutual correlations. This is followed by a deep unrolling network that refines the initial image using learnable convolutional priors and the ptychography measurement model. Experimental results on simulated data demonstrate that PtychoDV is capable of outperforming existing deep learning methods for this problem, and significantly reduces computational cost compared to iterative methodologies, while maintaining competitive performance.
KW - and image reconstruction
KW - deep learning
KW - deep unrolling
KW - Ptychography
KW - vision transformer
UR - http://www.scopus.com/inward/record.url?scp=85187343143&partnerID=8YFLogxK
U2 - 10.1109/OJSP.2024.3375276
DO - 10.1109/OJSP.2024.3375276
M3 - Article
AN - SCOPUS:85187343143
SN - 2644-1322
VL - 5
SP - 539
EP - 547
JO - IEEE Open Journal of Signal Processing
JF - IEEE Open Journal of Signal Processing
ER -