Harnessing NLP and Large Language Models for Pattern Discovery and Information Extraction in Electric Health Reports

Abstract

In this work, we report on a series of natural language processing tools and models to improve the efficiency and accuracy of information discovery from clinical trials and pharmacological studies. Our main contributions are:

1. The development of an open-source platform Tri-AL that

• Enables dynamic tracking of clinical trials information over time,

• Excels in data visualization and user interaction with a particular emphasis on enhancing the analysis and representation of race and ethnicity data to foster equity in clinical research, and

• Includes a predictive model utilizing machine learning to decipher drug mechanisms of action.

2. Heterogeneous Graph Neural Network for Gene-Chemical Entity Relation Extraction: We created a supervised deep learning model that adapts a heterogeneous Graph Neural Network to extract gene-chemical components. This model augments word representations using message passing that accurately identifies gene-chemical named entities and their relationships class.

3. Bipartite Graph Model for Evaluating Summarization Performance: We proposed a bipartite graph model to evaluate the performance of large language models in summarizing clinical trials. This model provides a robust framework to assess the accuracy and effectiveness of automated summarization tools in the medical domain.

Details

Subject

Engineering;
Biomedical engineering;
Bioinformatics;
Computer science

Classification

0537: Engineering
0541: Biomedical engineering
0715: Bioinformatics
0984: Computer science

Identifier / keyword

Natural language processing; Data visualization; Graph Neural Network; Bipartite graph model; Machine learning

Title

Harnessing NLP and Large Language Models for Pattern Discovery and Information Extraction in Electric Health Reports

Author

Esmail Zadeh Nojoo Kambar, Mina

Number of pages

109

Publication year

2024

Degree date

2024

School code

0506

Source

DAI-B 86/3(E), Dissertation Abstracts International

ISBN

9798384437598

Advisor

Taghva, Kazem

Committee member

Gewali, Laxmi; Bein, Wolfgang; Kang, Mingon; Regentova, Emma

University/institution

University of Nevada, Las Vegas

Department

Computer Science

University location

United States -- Nevada

Degree

Ph.D.

Source type

Dissertation or Thesis

Language

English

Document type

Dissertation/Thesis

Dissertation/thesis number

31489161

ProQuest document ID

3109723414

Document URL

https://www.proquest.com/dissertations-theses/harnessing-nlp-large-language-models-pattern/docview/3109723414/se-2?accountid=208611

Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.

Database

ProQuest One Academic

Harnessing NLP and Large Language Models for Pattern Discovery and Information Extraction in Electric Health Reports

Content area

Abstract

Details