Content area

Abstract

Unstructured scientific text plays a critical role in preserving, transferring, and developing research knowledge. Valuable outputs are often recorded in forms such as patents, research articles, and project reports. Unlike generic text, scientific literature usually follows specialized formats and terminology. This significant difference leads to greater challenges and opportunities for NLP (Natural Language Processing) researchers. To automate the process of extracting and structuring domain-specific knowledge from unstructured text, this dissertation addresses these challenges by leveraging NLP methods for automated materials science knowledge extraction.

Through three case studies, this dissertation explores the use of deep learning, LLM (Large Language Model) and prompt-based techniques to extract critical materials synthesis knowledge from scientific texts. Building on these efforts, the dissertation introduces an end-to-end, cost-effective framework designed for large-scale knowledge extraction with domain experts in the loop. The framework demonstrates how combining automated methods with light human guidance enables scalable, accurate, and efficient processing of materials science literature. Together, these contributions aim to mitigate key bottlenecks in scientific knowledge extraction and support the development of AI-ready materials data.

Details

1010268
Title
Large-Scale Materials Knowledge Extraction Using LLMs and Human-in-the-Loop
Number of pages
121
Publication year
2025
Degree date
2025
School code
0065
Source
DAI-A 86/12(E), Dissertation Abstracts International
ISBN
9798280774124
Committee member
An, Yuan; Uribe Romo, Fernando; Daniel, Ron
University/institution
Drexel University
Department
Information Science [Ph.D.] (College of Computing and Informatics)
University location
United States -- Pennsylvania
Degree
Ph.D.
Source type
Dissertation or Thesis
Language
English
Document type
Dissertation/Thesis
Dissertation/thesis number
32114495
ProQuest document ID
3219265383
Document URL
https://www.proquest.com/dissertations-theses/large-scale-materials-knowledge-extraction-using/docview/3219265383/se-2?accountid=208611
Copyright
Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Database
ProQuest One Academic