Content area

Abstract

In the research and production of fluorinated materials, large volumes of unstructured textual data are generated, characterized by high heterogeneity and fragmentation. These issues hinder systematic knowledge integration and efficient utilization. Constructing a knowledge graph for fluorinated materials processing is essential for enabling structured knowledge management and intelligent applications. Among its core components, Named Entity Recognition (NER) plays an essential role, as its accuracy directly impacts relation extraction and semantic modeling, which ultimately affects the knowledge graph construction for fluorinated materials. However, NER in this domain faces challenges such as fuzzy entity boundaries, inconsistent terminology, and a lack of high-quality annotated corpora. To address these problems, (i) We first construct a domain-specific NER dataset by combining manual annotation with an improved Easy Data Augmentation (EDA) strategy; (ii) Secondly, we propose a novel model, RRC-ADV, which integrates RoBERTa-wwm for dynamic contextual word representation, adversarial training to improve robustness against boundary ambiguity, and a Residual BiLSTM (ResBiLSTM) to enhance sequential feature modeling. Further, a Conditional Random Field (CRF) layer is incorporated for globally optimized label prediction. Experimental results demonstrate that RRC-ADV achieves an average F1 score of 89.23% on the self-constructed dataset, significantly outperforming baseline models. The model exhibits strong robustness and adaptability within the domain of fluorinated materials. Our work enhances the accuracy of NER in the fluorinated materials processing domain and paves the way for downstream tasks such as relation extraction in knowledge graph construction.

Details

1009240
Business indexing term
Title
Domain-Specific NER for Fluorinated Materials: A Hybrid Approach with Adversarial Training and Dynamic Contextual Embeddings
Author
Lan, Jiming 1 ; Fu, Hongwei 1 ; Wu, Yadong 2 ; Liu, Yaxian 3 ; Dong, Jianhua 2 ; Liu, Wei 2 ; Chen, Huaqiang 2 

 School of Computer Science and Engineering, Sichuan University of Science and Engineering, Zigong, 644005, China 
 School of Computer Science and Engineering, Sichuan University of Science and Engineering, Zigong, 644005, China, Sichuan Engineering Research Center for Big Data Visual Analytics, Zigong, 644005, China 
 School of Computer Science and Engineering, Sichuan University of Science and Engineering, Zigong, 644005, China, Key Laboratory of Higher Education of Sichuan Province for Enterprise Informationalization and Internet of Things, Sichuan University of Science and Engineering, Zigong, 644005, China 
Publication title
Volume
85
Issue
3
Pages
4645-4665
Number of pages
22
Publication year
2025
Publication date
2025
Section
ARTICLE
Publisher
Tech Science Press
Place of publication
Henderson
Country of publication
United States
Publication subject
ISSN
1546-2218
e-ISSN
1546-2226
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-10-23
Milestone dates
2025-04-29 (Received); 2025-07-22 (Accepted)
Publication history
 
 
   First posting date
23 Oct 2025
ProQuest document ID
3270083981
Document URL
https://www.proquest.com/scholarly-journals/domain-specific-ner-fluorinated-materials-hybrid/docview/3270083981/se-2?accountid=208611
Copyright
© 2025. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-12-02
Database
ProQuest One Academic