Full text

Turn on search term navigation

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Optical coherence tomography (OCT) acquisitions often reduce lateral sampling density to shorten scan time and suppress motion artifacts, but this strategy degrades the signal-to-noise ratio and obscures fine retinal microstructures. To recover these details without hardware modifications, we propose MGFormer, a lightweight Transformer for OCT super-resolution (SR) that integrates a multi-granularity attention mechanism with tensor distillation. A feature-enhancing convolution first sharpens edges; stacked multi-granularity attention blocks then fuse coarse-to-fine context, while a row-wise top-k operator retains the most informative tokens and preserves their positional order. We trained and evaluated MGFormer on B-scans from the Duke SD-OCT dataset at 2×, 4×, and 8× scaling factors. Relative to seven recent CNN- and Transformer-based SR models, MGFormer achieves the highest quantitative fidelity; at 4× it reaches 34.39 dB PSNR and 0.8399 SSIM, surpassing SwinIR by +0.52 dB and +0.026 SSIM, and reduces LPIPS by 21.4%. Compared with the same backbone without tensor distillation, FLOPs drop from 289G to 233G (−19.4%), and per-B-scan latency at 4× falls from 166.43 ms to 98.17 ms (−41.01%); the model size remains compact (105.68 MB). A blinded reader study shows higher scores for boundary sharpness (4.2 ± 0.3), pathology discernibility (4.1 ± 0.3), and diagnostic confidence (4.3 ± 0.2), exceeding SwinIR by 0.3–0.5 points. These results suggest that MGFormer can provide fast, high-fidelity OCT SR suitable for routine clinical workflows.

Details

Title
MGFormer: Super-Resolution Reconstruction of Retinal OCT Images Based on a Multi-Granularity Transformer
Author
Luan Jingmin 1   VIAFID ORCID Logo  ; Jiao Zhe 1 ; Li, Yutian 1 ; Si Yanru 2 ; Liu, Jian 3 ; Yao, Yu 3 ; Yang Dongni 4 ; Sun, Jia 4 ; Wei Zehao 5 ; Ma Zhenhe 3   VIAFID ORCID Logo 

 School of Computer and Communication Engineering, Northeastern University at Qinhuangdao, No. 143 Taishan Road, Qinhuangdao 066004, China; [email protected] (J.L.); [email protected] (Z.J.); [email protected] (Y.L.) 
 Laiwu People’s Hospital, Jinan 271100, China; [email protected] 
 School of Control Engineering, Northeastern University at Qinhuangdao, No. 143 Taishan Road, Qinhuangdao 066004, China; [email protected] (J.L.); [email protected] (Y.Y.) 
 Department of Ophthalmology, The First Hospital of Qinhuangdao, Qinhuangdao 066001, China; [email protected] (D.Y.); [email protected] (J.S.) 
 School of Electronics and Information, Northwestern Polytechnical University, No. 1 Dongxiang Road, Xi’an 710129, China; [email protected] 
First page
850
Publication year
2025
Publication date
2025
Publisher
MDPI AG
e-ISSN
23046732
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3254625297
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.