Content area

Abstract

With the increasing demand for high-definition video, video super-resolution technology has become a key means to improve video picture quality. Traditional video super-resolution methods are limited by computational resources and model complexity, which struggle to meet the demands of modern video processing. In recent years, the rise of deep learning technology has brought a revolutionary breakthrough for video super-resolution. In this paper, we propose a deep learning-based video super-resolution reconstruction method that combines Transformer, cross-modal learning and fusion, and an attention mechanism. We design the Temporal Transformer-based Video Super-Resolution (TT-VSR) architecture, which significantly improves the accuracy and detail richness of video reconstruction by integrating the Transformer's self-attention mechanism with CNN's spatial feature extraction capabilities. The introduction of cross-modal learning and fusion, along with the cross-modal attention mechanism, further enhances the model's adaptability to complex scenes and detail recovery ability. Experimental results demonstrate that our model outperforms existing methods, achieving a PSNR ofXdB and an SSIM of Y, indicating substantial improvements in image quality. These results validate the efficacy of our approach and open a new path for the development of video super-resolution technology.

Details

1009240
Business indexing term
Title
Temporal Transformer-Based Video Super-Resolution Reconstruction with Cross-Modal Attention
Author
Gong, Jingmin 1 ; Xu, Qinfei 2 

 Informationization Department, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China 
 College of Astronautics, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China 
Publication title
Informatica; Ljubljana
Volume
49
Issue
10
Pages
179-190
Publication year
2025
Publication date
Feb 2025
Publisher
Slovenian Society Informatika / Slovensko drustvo Informatika
Place of publication
Ljubljana
Country of publication
Slovenia
Publication subject
ISSN
03505596
e-ISSN
18543871
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
ProQuest document ID
3188467206
Document URL
https://www.proquest.com/scholarly-journals/temporal-transformer-based-video-super-resolution/docview/3188467206/se-2?accountid=208611
Copyright
© 2025. This work is published under https://creativecommons.org/licenses/by/3.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-08-14
Database
ProQuest One Academic