Content area

Abstract

Motivation

The expansion of genetic association data from genome-wide association studies has increased the importance of methodologies like Polygenic Risk Scores (PRS) and Mendelian Randomization (MR) in genetic epidemiology. However, their application is often impeded by complex, multi-step workflows requiring specialized expertise and the use of disparate tools with varying data formatting requirements. Existing solutions are frequently standalone packages or command-line based—largely due to dependencies on tools like PLINK—limiting accessibility for researchers without computational experience. Given Python’s popularity and ease of use, there is a need for an integrated, user-friendly Python toolkit to streamline PRS and MR analyses.

Results

We introduce Genal, a Python package that consolidates SNP-level data handling, cleaning, clumping, PRS computation, and MR analyses into a single, cohesive toolkit. By eliminating the need for multiple R packages and for command-line interaction by wrapping around PLINK, Genal lowers the barrier for medical scientists to perform complex genetic epidemiology studies. Genal draws on concepts from several well-established tools, ensuring that users have access to rigorous statistical techniques in the intuitive Python environment. Additionally, Genal leverages parallel processing for MR methods, including MR-PRESSO, significantly reducing the computational time required for these analyses.

Availability and implementation

The package is available on Pypi (https://pypi.org/project/genal-python/), the code is openly available on Github with a tutorial: https://github.com/CypRiv/genal, and the documentation can be found on readthedocs: https://genal.rtfd.io.

Details

1009240
Title
Genal: a Python toolkit for genetic risk scoring and Mendelian randomization
Author
Rivier, Cyprien A 1   VIAFID ORCID Logo  ; Clocchiatti-Tuozzo, Santiago 1 ; Huo, Shufan 1 ; Torres-Lopez, Victor 1 ; Renedo, Daniela 1 ; Sheth, Kevin N 1 ; Falcone, Guido J 1 ; Acosta, Julian N 2 

 Department of Neurology, Yale School of Medicine , New Haven, CT 06510, United States 
 Department of Biomedical Informatics, Harvard Medical School , Boston, MA 02115, United States 
Publication title
Volume
5
Issue
1
Publication year
2025
Publication date
2025
Publisher
Oxford University Press
Place of publication
Oxford
Country of publication
United Kingdom
Publication subject
e-ISSN
26350041
Source type
Scholarly Journal
Language of publication
English
Document type
Report
Publication history
 
 
Online publication date
2024-12-24
Milestone dates
2024-09-16 (Received); 2024-12-19 (Accepted); 2024-11-13 (Rev-recd); 2025-01-07 (Corrected)
Publication history
 
 
   First posting date
24 Dec 2024
ProQuest document ID
3191363189
Document URL
https://www.proquest.com/scholarly-journals/genal-python-toolkit-genetic-risk-scoring/docview/3191363189/se-2?accountid=208611
Copyright
© The Author(s) 2024. Published by Oxford University Press. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-04-18
Database
ProQuest One Academic