Content area

Abstract

As the application of Machine Learning (ML) models continues to expand across various domains, it becomes increasingly important to ensure the fairness of these models' function. While measuring fairness metrics is straightforward in the classification domain, it remains complex and computationally intractable in regression. To address the computational intractability challenge, prior research proposed various methods to approximate fairness metrics in regression, but their consistency remain an open question. To fill this gap, this dissertation investigates the consistency of fairness measurement methods in regression tasks through the following studies. The first study examines the consistency of the outcome of various fairness measurement methods. The experimental results reveal varying levels of consistency, with some methods, particularly, the probabilistic classification-based density ratio estimation approach, exhibit relatively poor consistency in certain cases. Then, the second study focuses on the probabilistic classification-based density ratio estimation approach to fairness measurement and explores the sensitivity of its outcome to the choice of underlying classifiers. Results demonstrate that the use of different classifiers could impact fairness values, leading to inconsistent measurements in certain circumstances. The third study analyzes alternative density ratio measurement approaches beyond the probabilistic classification-based one. The experimental results indicate concerning inconsistencies among various density ratio estimation-based approaches, raising fundamental questions about their reliability for fairness measurement in regression. To gain deeper insight, the fourth study investigates whether data distributions could contribute to such inconsistencies by generating synthetic datasets with varying distributions. The findings suggest that inconsistencies may indeed arise from the data distribution in certain cases.

Details

1010268
Business indexing term
Title
Evaluation of Machine Learning Fairness in Regression Domain
Number of pages
84
Publication year
2025
Degree date
2025
School code
1283
Source
DAI-B 86/11(E), Dissertation Abstracts International
ISBN
9798314892060
Committee member
Desai, Kevin; Akter, Taslima; Lee, Dongwon
University/institution
The University of Texas at San Antonio
Department
Computer Science
University location
United States -- Texas
Degree
Ph.D.
Source type
Dissertation or Thesis
Language
English
Document type
Dissertation/Thesis
Dissertation/thesis number
32037838
ProQuest document ID
3204642804
Document URL
https://www.proquest.com/dissertations-theses/evaluation-machine-learning-fairness-regression/docview/3204642804/se-2?accountid=208611
Copyright
Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Database
ProQuest One Academic