Content area

Abstract

Mendelian Randomization (MR) has become a cornerstone approach for inferring causal relationships in epidemiological and genetic studies by leveraging genetic variants as instrumental variables (IV). Despite its popularity, conventional MR analyses, particularly those based on two-stage least squares (TSLS) and conducted within a single sample, face significant methodological challenges. These include selection-induced winner's curse and the pervasive problem of weak instruments and invalid IVs, all of which can undermine the reliability and interpretability of causal effect estimates.

To address these limitations, this dissertation develops a unified and robust MR framework through a sequence of methodological innovations. First, we introduce MR-SPLIT, a novel adaptive sample-splitting and cross-fitting procedure that effectively mitigates biases arising from IV selection and weak instruments in one-sample MR settings. MR-SPLIT employs multiple sample splits to further enhance robustness, demonstrating superior performance in bias reduction, type I error control, and statistical power compared to existing approaches, as validated in extensive simulation studies and real-world data applications. Building on this foundation, we further propose MR-SPLIT+, which integrates best subset selection to accommodate invalid IVs under a relaxed plurality rule. MR-SPLIT+ substantially reduces estimation bias due to invalid instruments while maintaining efficiency and robustness. Simulation results consistently demonstrate that MR-SPLIT+ outperforms contemporary methods, and real-data analyses confirm its practical reliability in complex genetic architectures. Recognizing that causal relationships are often bidirectional or ambiguous, especially within gene expression networks and complex traits, we extend this framework to BiMR-SPLIT+. This method is specifically designed to disentangle bidirectional causality between pairs of traits, even when the underlying IV assumptions are partially violated. Extensive simulation studies and application to Drosophila melanogaster data illustrate that BiMR-SPLIT+ not only recapitulates established biological mechanisms, but also identifies novel candidate genes with potential regulatory roles. This bidirectional MR framework enables more accurate inference of gene-trait relationships and has broad implications for precision medicine.

Collectively, this dissertation presents a cohesive suite of MR methodologies that systematically address weak and invalid IVs, IV selection bias, and bidirectional causality. The resulting toolkit substantially advances the reliability of causal inference in genetic epidemiology and lays the groundwork for future exploration in complex causal networks as large-scale human datasets continue to grow.

Details

1010268
Title
Robust Statistical Methods for Causal Discovery in One-Sample Mendelian Randomization Studies
Author
Number of pages
144
Publication year
2025
Degree date
2025
School code
0128
Source
DAI-B 87/1(E), Dissertation Abstracts International
ISBN
9798288819766
Advisor
Committee member
Xie, Yuying; Huang, Wen; Huebner, Marianne
University/institution
Michigan State University
Department
Statistics - Doctor of Philosophy
University location
United States -- Michigan
Degree
Ph.D.
Source type
Dissertation or Thesis
Language
English
Document type
Dissertation/Thesis
Dissertation/thesis number
32119110
ProQuest document ID
3229986530
Document URL
https://www.proquest.com/dissertations-theses/robust-statistical-methods-causal-discovery-one/docview/3229986530/se-2?accountid=208611
Copyright
Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Database
ProQuest One Academic