Content area

Abstract

Text-to-Thangka generation requires preserving both semantic accuracy and textural details. Current methods struggle with fine-grained feature extraction, multi-level feature integration, and discriminator overfitting due to limited Thangka data. We present HST-GAN, a novel framework combining parallel hybrid attention with differentiable symmetric augmentation. The architecture features a Parallel Spatial-Channel Attention module (PSCA) for precise localization of deity facial features and ritual object textures, along with a Hierarchical Feature Fusion Network (HLFN) for multi-scale alignment. The framework’s Differentiable Symmetric Augmentation (DiffAugment) dynamically adjusts discriminator inputs to prevent overfitting while improving generalization. On the T2IThangka dataset, HST-GAN achieves an Inception Score of 2.08 and reduces Fréchet Inception Distance to 87.91, demonstrating superior performance over baselines on the Oxford-102 benchmark.

Details

1009240
Business indexing term
Title
Hierarchical symmetric GAN for Thangka image generation
Author
Hu, Wenjin 1 ; Zhao, Yan 2 ; Yin, Lemei 2 ; Zhang, Guoquan 1 

 Northwest Minzu University, School of Mathematics and Computer Science, Lanzhou, China (GRID:grid.412264.7) (ISNI:0000 0001 0108 3408); Gansu Provincial Engineering Research Center of Multi-Modal Artificial Intelligence, Lanzhou, China (GRID:grid.412264.7); Northwest Minzu University, Key Laboratory of China’s Ethnic Languages and Information Technology of Ministry of Education, Key Laboratory of Linguistic and Cultural Computing of Ministry of Education, Chinese National Information Technology Research Institute, Lanzhou, China (GRID:grid.412264.7) (ISNI:0000 0001 0108 3408) 
 Northwest Minzu University, Key Laboratory of China’s Ethnic Languages and Information Technology of Ministry of Education, Key Laboratory of Linguistic and Cultural Computing of Ministry of Education, Chinese National Information Technology Research Institute, Lanzhou, China (GRID:grid.412264.7) (ISNI:0000 0001 0108 3408) 
Publication title
Volume
13
Issue
1
Pages
568
Publication year
2025
Publication date
Dec 2025
Publisher
Springer Nature B.V.
Place of publication
London
Country of publication
Netherlands
Publication subject
e-ISSN
20507445
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-11-08
Milestone dates
2025-10-05 (Registration); 2025-06-03 (Received); 2025-10-05 (Accepted)
Publication history
 
 
   First posting date
08 Nov 2025
ProQuest document ID
3269991594
Document URL
https://www.proquest.com/scholarly-journals/hierarchical-symmetric-gan-thangka-image/docview/3269991594/se-2?accountid=208611
Copyright
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-11-09
Database
ProQuest One Academic