Abstract

We introduce HAPPY (Hierarchically Abstracted rePeat unit of PolYmers), a string representation for polymers, designed to efficiently encapsulate essential polymer structure features for property prediction. HAPPY assigns single constituent elements to groups of sub-structures and employs grammatically complete and independent connectors between chemical linkages. Using a limited number of datapoints, we trained neural networks utilizing both HAPPY and conventional SMILES encoding of repeated unit structures and compared their performance in predicting five polymer properties: dielectric constant, glass transition temperature, thermal conductivity, solubility, and density. The results showed that the HAPPY-based network could achieve higher prediction R-squared score and two-fold faster training times. We further tested the robustness and versatility of HAPPY-based network with an augmented training dataset. Additionally, we present topo-HAPPY (Topological HAPPY), an extension that incorporates topological details of the constituent connectivity, leading to improved solubility and glass transition temperature prediction R-squared score.

Details

Title
Enhancing deep learning predictive models with HAPPY (Hierarchically Abstracted rePeat unit of PolYmers) representation
Author
Ahn, Jihun 1   VIAFID ORCID Logo  ; Irianti, Gabriella Pasya 1   VIAFID ORCID Logo  ; Choe, Yeojin 1 ; Hur, Su-Mi 2   VIAFID ORCID Logo 

 Chonnam National University, Department of Polymer Engineering, Graduate School, Gwangju, Republic of Korea (GRID:grid.14005.30) (ISNI:0000 0001 0356 9399) 
 Chonnam National University, Department of Polymer Engineering, Graduate School, Gwangju, Republic of Korea (GRID:grid.14005.30) (ISNI:0000 0001 0356 9399); Chonnam National University, School of Polymer Science and Engineering, Gwangju, Republic of Korea (GRID:grid.14005.30) (ISNI:0000 0001 0356 9399) 
Pages
110
Publication year
2024
Publication date
2024
Publisher
Nature Publishing Group
e-ISSN
20573960
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3059661679
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.