Content area

Abstract

The spatial organization of proteins within eukaryotic cells underlies essential biological processes and can be mapped by identifying nearby proteins using proximity-dependent biotinylation approaches such as BioID. When applied systematically to hundreds of bait proteins, BioID has localized thousands of endogenous proteins in human cells, generating a comprehensive view of subcellular organization. However, the need for large bait sets limits the scalability of BioID for context-dependent spatial profiling across different cell types, states, or perturbations. To address this, we develop a benchmarking framework with multiple complementary metrics to assess how well a given bait subset recapitulates the structure and coverage of a reference BioID dataset. We also introduce GENBAIT, a genetic algorithm-based method that identifies optimized bait subsets predicted to retain maximal spatial information while reducing the total number of baits. Applied to three large BioID datasets, GENBAIT consistently selected subsets representing less than one-third of the original baits while preserving high coverage and network integrity. This flexible, data-driven approach enables intelligent bait selection for targeted, context-specific studies, thereby expanding the accessibility of large-scale subcellular proteome mapping.

Proximity-dependent biotinylation maps protein locations using multiple bait proteins, limiting scalability. Here, the authors present GENBAIT, a computational tool that selects optimal baits to reduce the number of experiments required while preserving subcellular organization.

Details

1009240
Business indexing term
Title
Computational design and evaluation of optimal bait sets for scalable proximity proteomics
Author
Kasmaeifar, Vesal 1 ; Sedighi, Saya 1 ; Gingras, Anne-Claude 1   VIAFID ORCID Logo  ; Campbell, Kieran R. 2   VIAFID ORCID Logo 

 Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Sinai Health, Toronto, ON, Canada (ROR: https://ror.org/044790d95) (GRID: grid.492573.e) (ISNI: 0000 0004 6477 6457); Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada (ROR: https://ror.org/03dbr7087) (GRID: grid.17063.33) (ISNI: 0000 0001 2157 2938) 
 Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Sinai Health, Toronto, ON, Canada (ROR: https://ror.org/044790d95) (GRID: grid.492573.e) (ISNI: 0000 0004 6477 6457); Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada (ROR: https://ror.org/03dbr7087) (GRID: grid.17063.33) (ISNI: 0000 0001 2157 2938); Department of Statistical Sciences, University of Toronto, Toronto, ON, Canada (ROR: https://ror.org/03dbr7087) (GRID: grid.17063.33) (ISNI: 0000 0001 2157 2938); Department of Computer Science, University of Toronto, Toronto, ON, Canada (ROR: https://ror.org/03dbr7087) (GRID: grid.17063.33) (ISNI: 0000 0001 2157 2938); Ontario Institute of Cancer Research, Toronto, ON, Canada (ROR: https://ror.org/043q8yx54) (GRID: grid.419890.d) (ISNI: 0000 0004 0626 690X); Vector Institute, Toronto, ON, Canada (ROR: https://ror.org/03kqdja62) (GRID: grid.494618.6) (ISNI: 0000 0005 0272 1351) 
Publication title
Volume
16
Issue
1
Pages
9333
Number of pages
18
Publication year
2025
Publication date
2025
Section
Article
Publisher
Nature Publishing Group
Place of publication
London
Country of publication
United States
Publication subject
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-10-22
Milestone dates
2025-09-16 (Registration); 2024-10-10 (Received); 2025-09-15 (Accepted)
Publication history
 
 
   First posting date
22 Oct 2025
ProQuest document ID
3264095761
Document URL
https://www.proquest.com/scholarly-journals/computational-design-evaluation-optimal-bait-sets/docview/3264095761/se-2?accountid=208611
Copyright
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-10-23
Database
ProQuest One Academic