Content area

Abstract

Span-based information extraction (SIE) is a set of natural language processing and information extraction tasks which aim to extract the span of interest from digital text and assign corresponding span classes that describe the nature of that text. SIE is essential yet challenging. On one hand, the development of SIE directly reflects natural language processing especially on text understanding. On the other hand, SIE can link digital text to knowledge base and knowledge graph entries, which can enhance the background information of the highlighted text.

In this thesis, I focus on SIE tasks with four parts. (1) Foundations of Span-based Information Extraction. This section outlines the concepts and history of this task.(2) Models of Span-based Information Extraction. This section introduces our presented three SIE models including Ask-and-Verify, EntGPT, and G3. (3) Applications of Span-based Information Extraction. This section introduces two applications of SIE including SIE for multi-choice question answering and SIE to enhance trust of plain text. (4) Limitations and Future Work Beyond Span-based Information Extraction. This section covers limitations of SIE and some directions for future work.

Details

1010268
Title
Span-Based Information Extraction and Beyond
Author
Number of pages
125
Publication year
2025
Degree date
2025
School code
0165
Source
DAI-A 86/12(E), Dissertation Abstracts International
ISBN
9798280702011
Committee member
Jiang, Meng; Zhang, Xiangliang; Dong, Luna
University/institution
University of Notre Dame
Department
Computer Science and Engineering
University location
United States -- Indiana
Degree
Ph.D.
Source type
Dissertation or Thesis
Language
English
Document type
Dissertation/Thesis
Dissertation/thesis number
31937707
ProQuest document ID
3215670312
Document URL
https://www.proquest.com/dissertations-theses/span-based-information-extraction-beyond/docview/3215670312/se-2?accountid=208611
Copyright
Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Database
2 databases
  • ProQuest One Academic
  • ProQuest One Academic