Full text

Turn on search term navigation

© 2024. This work is licensed under https://creativecommons.org/licenses/by/4.0/" target="_blank">https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Background:Systematic literature review (SLR), a robust method to identify and summarize evidence from published sources, is considered to be a complex, time-consuming, labor-intensive, and expensive task.

Objective:This study aimed to present a solution based on natural language processing (NLP) that accelerates and streamlines the SLR process for observational studies using real-world data.

Methods:We followed an agile software development and iterative software engineering methodology to build a customized intelligent end-to-end living NLP-assisted solution for observational SLR tasks. Multiple machine learning–based NLP algorithms were adopted to automate article screening and data element extraction processes. The NLP prediction results can be further reviewed and verified by domain experts, following the human-in-the-loop design. The system integrates explainable articificial intelligence to provide evidence for NLP algorithms and add transparency to extracted literature data elements. The system was developed based on 3 existing SLR projects of observational studies, including the epidemiology studies of human papillomavirus–associated diseases, the disease burden of pneumococcal diseases, and cost-effectiveness studies on pneumococcal vaccines.

Results:Our Intelligent SLR Platform covers major SLR steps, including study protocol setting, literature retrieval, abstract screening, full-text screening, data element extraction from full-text articles, results summary, and data visualization. The NLP algorithms achieved accuracy scores of 0.86-0.90 on article screening tasks (framed as text classification tasks) and macroaverage F1 scores of 0.57-0.89 on data element extraction tasks (framed as named entity recognition tasks).

Conclusions:Cutting-edge NLP algorithms expedite SLR for observational studies, thus allowing scientists to have more time to focus on the quality of data and the synthesis of evidence in observational studies. Aligning the living SLR concept, the system has the potential to update literature data and enable scientists to easily stay current with the literature related to observational studies prospectively and continuously.

Details

Title
Accelerating Evidence Synthesis in Observational Studies: Development of a Living Natural Language Processing–Assisted Intelligent Systematic Literature Review System
Author
Manion, Frank J  VIAFID ORCID Logo  ; Du, Jingcheng  VIAFID ORCID Logo  ; Wang, Dong  VIAFID ORCID Logo  ; Long, He  VIAFID ORCID Logo  ; Wang, Siwei  VIAFID ORCID Logo  ; Eckels, David  VIAFID ORCID Logo  ; Cervenka, Jan  VIAFID ORCID Logo  ; Fiduccia, Peter C  VIAFID ORCID Logo  ; Cossrow, Nicole  VIAFID ORCID Logo  ; Yao, Lixia  VIAFID ORCID Logo 
First page
e54653
Section
Methods and Instruments in Medical Informatics
Publication year
2024
Publication date
2024
Publisher
JMIR Publications
e-ISSN
22919694
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3120521977