Full text

Turn on search term navigation

© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Structure-based virtual screening approaches like molecular docking rely on accurately identifying and precisely calculating binding pockets to efficiently search for potential ligands. In this paper, we introduce GENEOnet, a machine learning model designed for volumetric protein pocket detection that employs Group Equivariant Non-Expansive Operators (GENEOs). These operators simplify model complexity and enable more informed domain knowledge integration by selecting specific physical and chemical properties for each operator to focus on, as well as how they should react. Unlike other methods in this field, GENEOnet has fewer model parameters, resulting in reduced training costs, and offers greater explainability, allowing the parameters to be easily interpreted. GENEOnet processes the empty space within a protein by converting it into a 3D grid of uniform blocks, known as ‘voxels’. It then identifies regions of the grid with an output value above a threshold, thus producing a list of predicted pockets, ranked according to the model’s average output value. Our experimental results show that GENEOnet performs robustly even with small training datasets of 200 proteins and surpasses other established state-of-the-art methods in various metrics. Specifically, GENEOnet’s score indicating the probability that the top-ranked pocket is the correct one is 0.764, compared to 0.702 for P2Rank, the next best performing algorithm on our PDBbind test set. Moreover, a case study considering various ABL1 kinase conformations demonstrates the excellent agreement between GENEOnet’s predictions and experimental sites. GENEOnet is available as a web service at https://geneonet.exscalate.eu, where users can access the pre-trained model for detecting and ranking protein cavities.

Details

Title
GENEOnet: a breakthrough in protein binding pocket detection using group equivariant non-expansive operators
Author
Bocchi, Giovanni 1 ; Frosini, Patrizio 2 ; Micheletti, Alessandra 1 ; Pedretti, Alessandro 3 ; Palermo, Gianluca 4 ; Gadioli, Davide 4 ; Gratteri, Carmen 5 ; Lunghini, Filippo 6 ; Biswas, Akash Deep 6 ; Stouten, Pieter F. W. 7 ; Beccari, Andrea R. 6 ; Fava, Anna 6 ; Talarico, Carmine 6 

 Department of Environmental Science and Policy, Università degli Studi di Milano, Via Celoria 10, 20133, Milano, Italy (ROR: https://ror.org/00wjc7c48) (GRID: grid.4708.b) (ISNI: 0000 0004 1757 2822) 
 Department of Computer Science, University of Pisa, Largo B. Pontecorvo 3, 56127, Pisa, Italy (ROR: https://ror.org/03ad39j10) (GRID: grid.5395.a) (ISNI: 0000 0004 1757 3729) 
 Department of Pharmaceutical Sciences, Università degli Studi di Milano, Via Mangiagalli 25, 20133, Milano, Italy (ROR: https://ror.org/00wjc7c48) (GRID: grid.4708.b) (ISNI: 0000 0004 1757 2822) 
 Department of Electronics, Information and Bioengineering, Politecnico di Milano, Via Ponzio 34/5, 20133, Milano, Italy (ROR: https://ror.org/01nffqt88) (GRID: grid.4643.5) (ISNI: 0000 0004 1937 0327) 
 LIGHT S.c.a.r.l., Via Branze 45, 25123, Brescia, Italy 
 Dompé Farmaceutici S.p.A., Via Tommaso de Amicis 95, 80145, Napoli, Italy 
 Stouten Pharma Consultancy BV, Kempenarestraat 47, 2860, Sint-Katelijne-Waver, Belgium 
Pages
34597
Section
Article
Publication year
2025
Publication date
2025
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3256960060
Copyright
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.