GraphMatch: Knowledge Graphs for Allogeneic Stem Cell Matching

Abstract

Allogeneic bone marrow and umbilical cord stem cell transplants often offer the best hope for curing patients with blood cancers and other blood diseases. We establish the historical context by starting with World War II and the initial use of nuclear weapons when the world recognized the dangers of radiologic events and the following public health threats. These threats included blood cancers on an unimaginable scale. At that time, little was known about survivability or available treatments. Researchers at the University of Chicago, associated with the Manhattan Project, were among the first to identify the preservation of hematopoiesis as a primary therapy for otherwise deadly exposures. We introduce hematopoietic stem cell (HSC) transplantation, which has two distinct types: autologous and allogeneic.

Next, we discuss how matching patients to unrelated donors requires flexible and timely searches as the matching criteria evolve. Matching systems should scale to accommodate the diversity in patient and donor typing resolution as well as the increasing number of donors. We begin by describing the context of this work and the challenges associated with timely, scalable, and adaptable platform solutions. We introduce a current patient/donor matching method called HapLogic at the National Marrow Donor Program. HapLogic is a leading platform for this work, offering nearly interactive access to transplant patients worldwide.

In the second chapter of the thesis, we propose a novel approach to matching patients and donors for stem cell transplants. GraphMatch (GM) is a scalable graph database solution for storing and searching variable-resolution HLA genotype markers. For our test set, we expanded the World Marrow Donor Association (WMDA) validation set based on version 2.16 of the IPD-IMGT/HLA Database to create a synthetic production dataset comprising 1 million patients and 10 million donors. Single-patient identity search times range from 218.5 milliseconds per patient with 2 million donors to 1201.4 milliseconds per patient with 10 million donors. Search performance timing remained linear relative to the number of edges, even at a production scale.

In the third chapter, we anticipate practical extensions to the GraphMatch platform, allowing horizontally scalable performance and a flexible schema to accommodate additional search criteria. GraphMatch can also simulate additional matching algorithms, such as GRIMM and Hap-E. Ultimately, GM demonstrates the usefulness of graph databases as a flexible platform for scalable matching solutions.

Details

Subject

Computer science;
Biomedical engineering;
Bioinformatics

Classification

0984: Computer science
0541: Biomedical engineering
0715: Bioinformatics

Identifier / keyword

Stem cell; Blood cancers; Transplantation; World Marrow Donor Association; Allogeneic bone marrow

Title

GraphMatch: Knowledge Graphs for Allogeneic Stem Cell Matching

Author

Kunau, Timothy M.

Number of pages

Publication year

2025

Degree date

2025

School code

0130

Source

DAI-B 86/12(E), Dissertation Abstracts International

ISBN

9798286443291

Advisor

Myers, Chad

Committee member

Netoff, Tey; Pankratz, Nathan; Boley, Daniel

University/institution

University of Minnesota

Department

Computer Science

University location

United States -- Minnesota

Degree

Ph.D.

Source type

Dissertation or Thesis

Language

English

Document type

Dissertation/Thesis

Dissertation/thesis number

32046763

ProQuest document ID

3225299695

Document URL

https://www.proquest.com/dissertations-theses/graphmatch-knowledge-graphs-allogeneic-stem-cell/docview/3225299695/se-2?accountid=208611

Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.

Database

ProQuest One Academic

GraphMatch: Knowledge Graphs for Allogeneic Stem Cell Matching

Content area

Abstract

Details