Content area

Abstract

Automated extraction of disaster-related named entities is crucial for gathering pertinent information during natural or human-made crises. Timely and reliable data is vital for effective disaster management, benefiting humanitarian response authorities, law enforcement agencies, and other concerned organizations. Online news media plays a pivotal role in disseminating crisis-related information during emergencies and facilitating post-hazard disaster response operations. To extract relevant named entities, contextual embedding features prove instrumental. In this study, we investigate the automatic extraction of disaster-related named entities from an annotated dataset of 1000 online news articles. These articles are carefully annotated with 14 crisis-specific entities obtained from relevant ontologies. To generate contextual vector representations of words, we construct a novel word embedding model inspired by Word2vec. These contextual word embedding features, combined with lexicon features, are encoded using a novel contextualized deep Bi-directional LSTM network augmented with self-attention and conditional random field (CRF) layers. We compare the performance of our proposed model with existing word embedding approaches. Through extensive evaluation on an independent test set of 200 articles that includes more than 80,000 tokens, our context-sensitive optimized NER model achieves impressive results at the sentence level. With a Precision of 92%, Recall of 91%, Accuracy of 87%, and an F1-score of 92%, our model outperforms those utilizing general and non-contextual word embeddings, including fine-tuned and contextual BERT models, showcasing its superior performance.

Details

1009240
Title
Accurate disaster entity recognition based on contextual embeddings in self-attentive BiLSTM-CRF
Publication title
PLoS One; San Francisco
Volume
20
Issue
3
First page
e0318262
Publication year
2025
Publication date
Mar 2025
Section
Research Article
Publisher
Public Library of Science
Place of publication
San Francisco
Country of publication
United States
e-ISSN
19326203
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Milestone dates
2024-03-21 (Received); 2025-01-14 (Accepted); 2025-03-26 (Published)
ProQuest document ID
3181720862
Document URL
https://www.proquest.com/scholarly-journals/accurate-disaster-entity-recognition-based-on/docview/3181720862/se-2?accountid=208611
Copyright
© 2025 Hafsa et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-03-28
Database
ProQuest One Academic