Content area

Abstract

As data in a database management system (DBMS) becomes increasingly complex, there is a growing need for sophisticated analytical techniques—such as deep neural networks—capable of extracting meaningful insights. However, these methods are often compute-intensive, necessitating the use of specialized accelerators such as the graphic processing unit (GPU) to achieve practical performance.

Given this trend, modern DBMSs frequently integrate GPU-based query execution. For a good performance, we argue that query planning must be approached judiciously, ensuring that advanced algorithms are employed cautiously so as not to abuse the underlying hardware. Once an optimal plan is devised, the next critical step is to refine its implementation to leverage the GPU’s computational resources fully. By combining judicious query planning with optimized execution, we can unlock the full potential of modern hardware accelerators in database systems.

My first research endeavor focuses on improving the query plan to avoid overusing compute intensive algorithms for machine learning-based queries, particularly queries in video DBMSs. FiGO proposes a fine-grained approach where just-enough accurate deep learning algorithms are used to analyze the data without compromising accuracy. To pick the right algorithm, FiGO uses an ensemble of algorithms, evaluates all algorithms on sampled data, and applies the fastest but yet still accurate algorithm to analyze. Aero proposes to apply the classic adaptive query processing (AQP) framework on those emergent queries. It utilizes runtime statistics to dynamically reorder predicates that involve deep learning to minimize the query processing time.

My second research endeavor focuses on improving hardware utilization, particularly GPUs in DBMSs. The GPU database systems characterization and optimization paper analyzes the resource utilization of GPUs during query execution and their performance bottlenecks. Based on the roofline analysis, we propose optimizations to reduce the total data access and increase the query execution concurrency to improve the performance, delivering better performance than the state-of-the-art systems. As part of Aero, we also extend the AQP framework to also promote the GPU utilization during machine learning (ML) query executions, which previously was not done through the AQP framework. Lastly, to facilitate the recent demand of using the large language model (LLM) in DBMSs, we propose tailored buffer management policy called LIRS-M to better utilize the GPU memory resource for a good performance.

Details

1010268
Title
Enhance Database Query Performance Through Software Optimization and Hardware Adaptation
Number of pages
183
Publication year
2025
Degree date
2025
School code
0078
Source
DAI-B 87/5(E), Dissertation Abstracts International
ISBN
9798263326289
Committee member
Mahajan, Divya; Rong, Kexin; Sen, Rathiji; Siddiqui, Tarique
University/institution
Georgia Institute of Technology
University location
United States -- Georgia
Degree
Ph.D.
Source type
Dissertation or Thesis
Language
English
Document type
Dissertation/Thesis
Dissertation/thesis number
32308062
ProQuest document ID
3278115110
Document URL
https://www.proquest.com/dissertations-theses/enhance-database-query-performance-through/docview/3278115110/se-2?accountid=208611
Copyright
Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works; open.access
Database
ProQuest One Academic