GENEOnet: a breakthrough in protein binding

Full text

Turn on search term navigation

Introduction

Introduction to Exscalate platform

The field of drug discovery is undergoing a significant transformation with advancements in computational methods. The Exscalate platform, developed by Dompé, is at the forefront of this evolution. This high-throughput system allows for virtual screening and drug design, offering a faster and more cost-effective alternative to traditional experimental approaches. Exscalate’s strength lies in its ability to integrate multiple aspects of computational chemistry and biology, such as binding site generation, docking, toxicity prediction, and cheminformatics. By leveraging a unique chemical cartridge, the platform can process vast chemical libraries, analyzing trillions of compounds to identify potential drug candidates with precision^{1, 2–3}. A critical aspect of Exscalate is its capability to detect and analyze protein binding pockets regions on proteins where ligands can form stable interactions. Accurate identification of these sites is crucial in understanding how ligands bind, as their properties significantly impact the affinity and specificity of the interaction. As show in this paper, through the integration of machine learning and sophisticated molecular modeling, Exscalate can identify these critical sites even without clear experimental evidence. The platform’s efficiency is further demonstrated by LiGen, a tool for ultra-high-throughput docking that can process tens of trillions of compounds using advanced GPU technology. This capability showcases the robust computational power of the Exscalate platform and Dompé’s commitment to global health initiatives, such as the pro bono discovery of inhibitors for Covid-19 and Zika viruses^{4, 5–6}. The integration of these sophisticated computational techniques makes the Exscalate platform a significant advancement in drug design, promising to expedite the discovery of new therapeutics while reducing costs and broadening access to innovative treatments across the global healthcare landscape.

Introduction to protein pocket detection

The growing capabilities of computational technology have increasingly enabled in silico techniques to play a significant role in drug discovery and repositioning processes⁷. Various computational approaches⁸, such as structure-based drug design⁹, have emerged as effective tools for predicting binding affinity, identifying interacting substructures^10,11, and understanding the mechanism of action of pharmacologically active agents¹². A precise characterization of the binding pocket is essential for efficient docking calculations¹³. Blind docking analyses that involve the entire target protein tend to be less effective in finding correct ligand poses¹⁴. To overcome this challenge, researchers often rely on co-crystallized ligand-receptor complexes or site-directed mutagenesis studies^15,16 to identify binding sites. However, when such data are not available, accurately characterizing the binding site becomes more difficult, even with an experimentally resolved protein structure¹⁷. In these cases, identifying allosteric/accessory binding sites and assessing their druggability is crucial for optimizing virtual screening campaigns. To address this challenge, many algorithms have been developed to identify and characterize putative binding sites. These tools involve both geometric analysis of empty regions within a protein structure and physicochemical analysis to prioritize and identify the correct binding site(s). Geometric detection methods can be categorized into grid-based and grid-free approaches. Grid-based methods include algorithms that analyze grids of voxels (i.e. the 3D equivalent of pixels) to determine which points are within the protein structure and meet specific geometric and physicochemical requirements. Examples of grid-based methods include POCKET¹⁸, which scans a grid to detect patterns of protein-solvent-protein points in order to locate empty regions surrounded by protein atoms. CAVIAR¹⁹ further enhances this method by computing descriptors to predict binding site “ligandability”. CavVis²⁰, instead, exploits volumes to detect cavities in combination with an analysis of Gaussian surfaces to approximate the solvent-excluded surface. In contrast, grid-free approaches use spherical probes placed on the protein surface and clustered according to representative properties of candidate pockets. Other grid-free methods, such as Fpocket²¹, use alpha spheres to detect local curvatures on the protein surface. Due to recent computational advancements, machine learning and deep learning techniques are now being applied to binding site prioritization. P2Rank²² uses random forests to evaluate the capability of each surface point to bind a ligand. DeepSite²³ and DeepPocket²⁴, instead, employ Convolutional Neural Networks (CNNs).

Introduction to group equivariant non-expansive operators

Recent advancements in deep learning have highlighted the significance of equivariant operators in enhancing transparency, interpretability, and simplifying the training phase of neural networks^{25, 26, 27, 28, 29, 30, 31, 32–33}. Group Equivariant Non-Expansive Operators (GENEOs) are closely related to the concept of eXplainable Artificial Intelligence (XAI), which aims to develop methods that can be understood and trusted by humans^34,35. GENEOs have been introduced as elementary components to build new types of networks (for mathematical definitions and more details refer to^{36, 37–38}) by exploiting the possibility of combining different operators using suitable operations. Operators that process data are often compatible with specific geometric transformations, a property called equivariance. Image blurring is an example of such an operation, for example we can say that image blurring is equivariant with respect to rotations: we obtain the same result if we first blur an image and then rotate it as if we first rotate it and then blur it. Operators often possess additional properties, such as non-expansivity. Non-expansivity, among other benefits, guarantees stability to small perturbations of the input data. For example, if one blurs an image slightly corrupted by white noise, the result should not be very different from the blurring of the uncorrupted image. GENEOs networks combine equivariance and non-expansivity and allows for a kind of “geometric knowledge engineering” of such models enhancing transparency by integrating information into operators that process data while also reducing parameters involved with respect to non-equivariant models.

Structure of the paper

This paper presents an investigation into the application of the GENEO model, specifically GENEOnet, to identify protein pockets. The first section describes the sources of data used in this study (Section “Data sources”). We then introduce GENEOnet (Section “The GENEOnet model”), a network model that leverages empirical knowledge and exploits the equivariance properties of GENEOs to detect protein pockets. The specific problem of pocket detection is well-suited for GENEO-based approaches, as it relies on relevant empirical knowledge regarding binding site preference for lipophilic areas and hydrogen bonding opportunities. Moreover, pocket detection is equivariant to rotations and translations of the space, meaning that rotating or translating a protein does not alter its pockets but only their spatial orientation. Consequently, we define the GENEOnet network to be equivariant with respect to these transformations. The subsequent sections of this paper detail our methodology for training and selecting the GENEOnet model (Sections “Training”–“Model selection”) as well as comparing its performance with state-of-the-art methods (Section “Results”). We evaluate GENEOnet’s efficacy in both pocket identification (Section “Pocket identification and ranking”) volumetric goodness (Section “Overlap analysis”), distance metrics (Section “Distance metrics”), discussing the effects of equivariance and non-expansivity on its performance (Section “Equivariance and non-expansivity combined effect”), also by means of an ablation study (Section “Ablation study”). Moreover, we discuss an experiment regarding the computational times of the various methods (Section “Computational time evaluation”). Finally, a case study concerning kinases (Section “Structural analysis of ABL1 Kinase using GENEOnet”) highlights the coherence of GENEOnet predictions with experimental data. As additional validation, we compare GENEOnet with two other deep learning and generative AI methods on a representative example (Section “GENEOnet, TankBind and AF2BIND comparison on ABL1 Kinase”). Notably, Section “Webservice” presents the GENEOnet webservice. In addition to its technical merits, we highlight several advantages of GENEOnet, including a simpler model structure, fewer unknown parameters, and reduced data requirements for training. Notably, despite its relative simplicity, GENEOnet achieves superior results compared to methods relying on tens of thousands of trainable parameters. Finally, Section “Discussion and conclusions” presents our conclusions and provides a discussion of the implications of this work.

Materials and methods

This section introduces the mathematical model that we use to identify pockets. We also explain how the model was trained together with the datasets used for training, model selection and testing.

Data sources

During the training, validation, and testing of GENEOnet, two data sources were utilized:

The PDBbind v.2020 database³⁹. The PDBbind database provides binding affinity data for protein-ligand complexes deposited in the RCSB PDB⁴⁰. The aim of using the PDBbind database is to provide high-quality datasets for drug design methods. A total of 12,295 protein-ligand complexes were retrieved, we will refer to this set as BIND. This set, as explained in the following, is subdivided into three parts:
- TRAIN, which consists of 200 complexes sampled uniformly at random from the whole BIND. During the model selection phase, 200 sets of size 200 are considered as different TRAINs, eventually choosing the optimal one for deployment. To avoid reserving only part of BIND for sampling TRAIN proteins, we decided to sample them from the entire BIND dataset, this choice generates small intersections between each TRAIN and the sets defined in the next points.
- BINDVAL, which consists of 3073 complexes (approximately 25% of the whole BIND), is used for model selection. Each of the TRAIN sets considered during model selection has an average intersection of 50 proteins with BINDVAL. In particular, the TRAIN version that is chosen as optimal has an intersection of 48 proteins with BINDVAL.
- BINDTEST, which initially consists of 9222 complexes (approximately 75% of the whole BIND), is used for a first comparison of the model with other tools from the state-of-the-art. After training and model selection, the 152 molecules contained in the intersection between the chosen TRAIN and BINDTEST are removed. The final BINDTEST is made of 9070 complexes, and it is disjoint from both BINDVAL and the chosen TRAIN.
The RCSB PDB is the largest resource for experimentally determined biomolecular structures, releasing new data daily. From this data source, 41,519 complexes were retrieved. First of all, we removed all the complexes already contained in BINDTEST, moreover, we also removed all the complexes whose ligands are classified as post-translational modifications (PTMs), which are of small interest from a pharmaceutical perspective. After this, we obtained a set of 33,341 complexes that we will denote as BANK, which is completely disjoint from BINDTEST.

Fig. 1 [Images not available. See PDF.]

Datasets. Visual representation of the different datasets that will be considered. The chosen TRAIN set after model selection has an intersection of 48 proteins with BINDVAL, while BINDTEST, which is made of 6854 complexes, is completely disjoint from both of them. BANK is the largest set consisting of 28382 complexes and it is disjoint from BINDTEST. Proteins with large sequence identity with the training ones were removed from BINDTEST and BANK.

Figure 1 provides a graphical visualization of the relationships between the diverse datasets that we considered. Furthermore, all protein structures were initially preprocessed using the Schrödinger Protein Preparation Wizard (Schrödinger, The Schrödinger Software. 2020). Exclusively for complexes coming from PDBbind (in particular those of BINDTEST), only those protein chains within a certain distance from the ligand were kept to avoid repetitions of the true pocket. This choice will be better detailed in section “Metrics of interest”.

The GENEOnet model

Fig. 2 [Images not available. See PDF.]

Model workflow. The channels , computed from the PDB input file, are fed GENEOs that depend on the shape parameters , this first layer returns the intermediate outputs . These outputs are combined through convex combination with weights to get the final result . To obtain pockets, a thresholding operation with parameter is applied to , producing the binary function , which finally is compared to the ground truth through the loss function.

GENEOnet, whose architecture is depicted in Fig. 2, consists of five steps: a data preparation step, in which the input data are represented as functions defined on a grid of voxels; a GENEO layer, composed by operators which provides a first processing of the input data; a convex combination of the outputs of the GENEO layer, to obtain new GENEOs which provide a second and more refined processing; a thresholding step which gives a spatial prediction of presence or absence of a pocket; an evaluation step of a loss function which compares the output with the ground truth. The loss function is then optimized, using a training set, to identify the unknown parameters. A further step, applied after the training of the model, allows to compute the scoring of the identified pockets, and consequently their ranking. All the steps are described in detail in the following sections.

Data preparation

In the data preparation phase, we represent the protein-ligand complex stored in a PDB file as functions defined in the empty space surrounding the protein, which is discretized using a grid of cubic voxels. For each voxel, we compute approximations of the input functions, or channels . As shown in Table 1, such functions reflect a reasoned selection of geometrical, physical and chemical protein properties that are considered to be relevant for pockets detection by medicinal chemistry experts. Auxiliary software called GENEOprep (see Supplementary Information Section 1) was developed to automate the process of computing such channels. The co-crystallized ligand of a protein will be used in the evaluation step to define the true pocket (i.e. the ground truth function ) for the parameters identification.Table 1

List of potentials that have been used to build GENEOnet.

Name	Type	Notes
Distance	Geometrical	and are coordinates and radius of the nearest atom to the point x.
Gravitational	Geometrical	m(a) is the mass of atom a.
Electrostatic	Physical	q(a) is the partial charge of atom a.
Lipophilic	Chemical	l(a) is the lipophilic coefficient of the atom a if it is negative, 0 otherwise.
Hydrophilic	Chemical	h(a) is the lipophilic coefficient of the atom a if it is positive, 0 otherwise.
Polar	Chemical	p(a) is 1 if atom is polar, 0 otherwise.
HB Acceptor	Chemical	where and are parameters of the specific type of atom.
HB Donor	Chemical	where and are parameters of the specific type of atom, and are angles defined by triples of points involved in the bond.

In some cases, constants have been ignored because of the subsequent normalization. A summation over all the atoms of the protein would be computationally unfeasible, but, since many potentials depend on the inverse of the distance from x, in our computations we neglected atoms too far apart from x thus the sums have been computed only for , where is a suitable neighborhood of x.

GENEO layer

The channels computed in the grid of voxels are then fed to the layer of d GENEOs, , one per channel. Each operator is chosen from a specific parametric family, parametrized by a shape parameter . These families were designed to reflect the a priori knowledge of the experts of medicinal chemistry about the specific role of the corresponding potentials in the pocket identification. We opted for convolutional operators , where are normalized kernels in , symmetric with respect to the origin. This choice ensures that all the operators under consideration are indeed non-expansive and equivariant with respect to translations and rotations of the space. We set the parameters regulating the shape of each kernel, so that, also because of their central symmetry, each convolutional operator depends on a single real parameter only, which regulates the “amplitude” of the kernel itself. For the details about the specific kernels employed, refer to the Supplementary Information, Section 2.

Convex combination

In the fourth step the intermediate GENEO outputs are combined through a convex combination, with weights in order to obtain a composite operator , which is a new GENEO for each choice of the parameters. The output of the convex combination is then normalized to obtain the function , defined from to [0,1]. Here can be read as the likelihood that voxel x belongs to a pocket. The coefficients can be regarded as feature importance scores, highlighting the importance of each channel in the pocket identification and thus providing a useful tool to explain the results the model delivers.

Thresholding

Finally, given a threshold , we get the different pockets returned by the model by taking connected components of the set of voxels where is above . In this way, voxels located inside a pocket are labeled with the sequential number of the connected component they belong to, while they are labeled with 0 if they are not judged to belong to any pocket. To summarize, the model that was described so far has a total of 17 learnable parameters ( , and ).

Evaluation

For each crystallized complex, the ligand has been converted to the binary function that is equal to 1 on the voxels that (possibly partially) overlap with the ligand, and equal to 0 elsewhere. If we call the output of the model after thresholding, then we have to compare it to the ground truth represented by the binary function in order to asses the goodness of the prediction.

Training

In order to learn GENEOnet’s learnable parameters, we choose to optimize a loss function evaluating the volumetric matching of ground-truth and prediction . The loss function, which needs to be maximized, is defined below:

Here denotes the discretized volume, that is the number of voxels labelled with 1 inside the region, is a function equal to 1 on the intersection between the prediction and the true pocket , is a constant function equal to 1. All these functions are defined on the voxelized grid built around the protein. The hyperparameter k ranges in [0, 1]. We found that values in the range [0.01, 0.05] produce similar results, all characterized by a relatively small number of pockets of appropriate size (see the Supplementary Information, Section 3). Essentially, the choice of the loss function mitigates the imbalance in the number of voxels labeled as part of the pocket in the ground truth. The optimization of was performed using Adam optimizer. Random sets of 200 proteins uniformly sampled from BIND were used as training sets. We chose this size since empirical evidence showed that increasing the size of the training set did not significantly impact parameter estimates (see the Supplementary Information, Section 6).

Pocket scoring

In medicinal chemistry, identifying and prioritizing potential ligand-binding pockets is pivotal. This process, essential for streamlining virtual screening, involves evaluating pockets based on their potential to accommodate ligands. Although GENEOnet, as described till now, is able to detect pockets, it does not prioritize them. We can refine this by deriving scores from GENEOnet’s pre-thresholding output. These scores are calculated by averaging the function across voxels within each pocket and adjusting for pocket volume to prevent bias towards smaller pockets. This scoring yields a prioritized set of pockets, facilitating focused and efficient subsequent analyses.

Metrics of interest

To compare and select different models, we are interested in metrics that express the ability of a model to assign the highest score to the pocket that matches the true one. First of all, we say that a predicted pocket matches the true pocket if it has the largest overlap with the ground truth. By overlap, we mean the ratio between the discretized volume of the intersection and the volume of the true pocket. If no predicted pocket has an intersection with the true one, we say that the method failed on that protein. In this manner, when provided with a dataset of proteins, we can calculate a sequence of coefficients for . Fixing n as the protein-dependent number of true pockets, is the fraction of proteins whose ligand is identified within the first n-th predicted pockets. On the other hand, for are the proportions of proteins whose ligand is identified by the -th predicted pocket. Moreover, we can also consider cumulative sums of these proportions, we generate another sequence of coefficients for . We have that while if then represents the fraction of proteins whose true pocket has been successfully recognized within the first -th predicted pockets. See the Supplementary Information, Section 4, for the formal definitions of the and coefficients. In this way, different methods can be compared as follows: if a method shows higher for all then it is the best method. In an optimal situation, we would like to have a model with and for every . We want to remark that in the case each protein has exactly one true pocket (as in the case of BINDTEST due to the applied preprocessing), we can safely consider , since for every protein.

We also consider additional metrics that measure the distance between the predicted pocket centroid and the ground truth. In particular, DCA/PPC is the minimal distance between the ground truth and the centroid of the prediction; on the other hand, DCC is the distance between the centroid of the ground truth and that of the prediction. In literature²⁴, usually predictions with DCA/DCC values lower than 4Å are considered successful, and the fractions of successes at different thresholds are usually plotted. Although reporting the results for such metrics, we notice that, differently from the overlap-related ones, DCA and, more importantly, DCC do not take into account the shapes of the ground truth (which may be smaller than the actual pocket) and the predictions. Thus, they shouldn’t be trusted too much in the presence of non-convex pockets or pockets much larger than the ligand; they are well suited to evaluate small and ellipsoid-like pockets.

Model selection

To assess the reliability of the estimation process and determine the most accurate model for ranking pockets, the loss function L was optimized 200 times with different TRAIN sets, all of size 200, each time starting from the same initial guess for the model parameters. For each trained model, BINDVAL was used to calculate the coefficients; such results are reported in Supplementary Information, Section 7. Finally, the model having the highest coefficient on BINDVAL was chosen for deployment. The TRAIN set corresponding to such a model, as already stated in section “Data sources”, has a small intersection of size 48 with BINDVAL. We consider this intersection to be negligible, due to the relative size compared to BINDVAL and the fact that BINDVAL is only used to select the optimal version of GENEOnet from the 200 trained instances, not for comparison with other models. The chosen version of TRAIN will also be used in the following in the ablation study described in section “Ablation study”. Moreover, the optimal parameter values of the selected model and additional details about their interpretation can be found in Supplementary Information, Section 5.

Software and hardware details

The implementation of GENEOnet was achieved through the integration of two distinct software components. The first component comprises a C library, dedicated to computing the potential functions listed in Table 1, while the second module is written in Python. The Python code initiates the computation process by invoking the GENEOprep auxiliary software and the C library via a Cython extension, which enables the calculation of protein potentials. Subsequently, the resulting potentials are passed as input into the GENEOnet network, whose architecture was developed using PyTorch. In terms of computational efficiency, we report that running the optimization algorithm for 50 epochs on the TRAIN dataset results in a processing time of approximately six minutes when performed on a laptop with an NVIDIA GeForce RTX 3060 GPU. In contrast, the same task takes approximately 40 minutes to complete using only the CPU on the same laptop having an 8-core Intel^® Core^TM i7-10870H processor.

Results

This Section features the outcome of the experiments designed to evaluate the performance of GENEOnet, including benchmarking against the following state-of-the-art methods that could be accessed by us: Fpocket, P2Rank, DeepPocket, CAVIAR, CavVis.

Equivariance and non-expansivity combined effect

Fig. 3 [Images not available. See PDF.]

Global view of the prediction: each predicted pocket is shown in a different color and is labeled with its calculated score. In the lower left-hand region is the pocket that correctly identifies the true pocket in which the ligand is located. The ligand is slightly illuminated for better visualization.

Figure 3 shows GENEOnet output for protein (PDB ID 2QWE). This protein has four symmetrical units, resulting in four replicas of the true pocket thus we have . GENEOnet correctly identifies these symmetrical pockets and assigns them high scores (third to sixth top ranked. The fifth predicted pocket matches the ground truth, thus this would contribute to ). This is because results on similar, but differently oriented units, are guaranteed to be similar by the combined actions of equivariance and non-expansivity, moreover, we could expect such a result actually even before running the algorithm. This example highlights the positive effects of equivariance and non-expansivity, effects that are extremely beneficial in determining the robustness and trustworthiness of GENEOnet as already studied in^41,42. The ablation study of section “Ablation study”, instead, will focus on removing such properties in order to evaluate their impact on model training.

Pocket identification and ranking

Fig. 4 [Images not available. See PDF.]

coefficients. Bar chart of the coefficients computed on (a) BINDTEST and (b) BANK for the different methods. On both datasets GENEOnet has the highest value of .

The goal of this experiment is to compare different methods in their ability to accurately identify pockets that match the true one and to assign them high scores. Firstly, we report estimates of coefficients computed on BINDTEST in Fig. 4a. Secondly, the experiment was repeated on BANK and the results are shown in Fig. 4b. Numerical estimates of the coefficients can be found in Supplementary Information, Section 10.

Overlap analysis

After testing the scoring capabilities of the methods, we also compared the ability to identify and to rank on top, pockets that match the true one with high overlap. We computed the distributions of overlaps between the true pocket and the top ranked pockets for the compared methods (if the method does not hit the true pocket within the top ranked ones, the overlap is set to 0.0), again first for BIND and then for BANK datasets. Figure 5 shows the distributions of the overlaps using violin plots. The peak of the estimated density in correspondence with zero is related to the proportion of failures within the predicted pockets. The box plots inside the violins allow for the comparison of the quartiles of the overlap distribution as well.

Fig. 5 [Images not available. See PDF.]

Violin plots of the distributions of the overlaps between the ground truth and the predicted pockets within the top-ranked. The peak of the estimated density in correspondence of 0 highlights the number of failures within the predicted pockets.

Distance metrics

We present here the results of our analysis, which detail the percentages of proteins exhibiting DCA/DCC values below a range of thresholds extending from 4Å to 20Å as computed using both BINDTEST and BANK datasets. The reported DCA and DCC values are the minimum among the three top-ranked pockets for each method, consistent with the overlap analysis presented in the preceding section. Figure 6a–b show the curves for DCA, while Fig. 6c–d show those for DCC.

Fig. 6 [Images not available. See PDF.]

DCA and DCC curves. For each threshold value on the horizontal axis, the proportion of proteins with a DCA/DCC value (the best among the three top-ranked pockets) lower than the threshold is plotted. For the two metrics, the desired model is the one that exhibits the steepest ascent, showing larger success proportions at lower thresholds.

Ablation study

To further prove the key importance of equivariance and non-expansivity, we designed an ablation study to compare GENEOnet with other models sharing similar architectures but missing either one or both of the two properties. We will consider three models:

An equivariant and non-expansive model (GENEOnet, here E-NE for short).
A non-equivariant and non-expansive model (NE-NE for short).
A non-equivariant and potentially expansive model (NE-E for short).

For the NE-NE and NE-E models, we replaced the convolutional operators with rotationally invariant kernels of GENEOnet with general convolutional operators, featuring fully learnable kernels, identical in size to those used by GENEOnet. Additionally, for model NE-NE, we normalized the kernels using the norm, consistently with the approach employed by GENEOnet. All three models were trained using the TRAIN set chosen for GENEOnet, with initial parameter and hyperparameter values maintained at their original settings. For this ablation study, an additional test set was generated through sampling 200 proteins with sequence identity less than 80% relative to those in the TRAIN set. This test set was exclusively employed to evaluate potential overfitting of the models.

Fig. 7 [Images not available. See PDF.]

Ablation study: The training and test loss curves are plotted for the three models in the ablation study. Model NE-E is not really able to minimize the loss and learn effectively; model NE-NE learns well on the training set, but it clearly overfits, while model E-NE (GENEOnet) is the only one able to learn effectively while preventing overfitting.

Figure 7 illustrates the evolution of the logarithm of the loss function (1) during training epochs for each model, as indicated by continuous lines representing training loss and dashed lines representing test loss. Notably, model NE-E failed to learn effectively, minimizing the loss, while model NE-NE outperformed GENEOnet in terms of training results but was unable to avoid overfitting. Conversely, model E-NE (GENEOnet) demonstrated the capacity for learning effectively while preventing overfitting of the training data. These findings provide further evidence supporting the importance of equivariance and non-expansivity in achieving an accurate yet parsimonious model. Supplementary Information Section 9 presents additional figures comparing the predictions of each model on one training and one test example.

Computational time evaluation

To provide a more comprehensive evaluation of GENEOnet’s performance, an additional experiment was conducted to compare the computational costs of the models. Noting that grid-based models, such as GENEOnet, may incur higher computational overhead and larger memory requirements compared to models that exploit only the protein structure, we sought to investigate this aspect further. To do so, we followed this protocol: for each of the 10 protein size classes , where comprises proteins with a number of atoms ranging from 1000j to , we selected the first 100 representative proteins in BIND. Subsequently, for each protein, we ran the methods, this time focusing solely on the computational times.

Fig. 8 [Images not available. See PDF.]

Analysis of computational times. Panel (a) shows the distributions of the base-10 logarithm of the total computational times for the considered methods and for the different protein size classes. Panel (b) shows a comparison of inference times, once the protein potentials are computed, for GENEOnet and four Random Forest models trained to use GENEOnet potentials.

Figure 8a depicts the estimated distributions of the computational times for the different methods: as foreseeable, Fpocket is the fastest method while CavVis is the second fastest, among the ML approaches; instead, GENEOnet is the fastest method (par with CAVIAR) up to class . For larger proteins, DeepPocket and CAVIAR become faster but are still comparable to GENEOnet in terms of order of magnitude. Regarding P2Rank, the computational times are almost constant with respect to the protein size; this is likely due to the long P2Rank initialization process, while the actual inference is considerably faster. Anyway, among the considered methods, DeepPocket is the one closest to GENEOnet in terms of mechanism and architecture; thus, this analysis shows that GENEOnet speed is better or comparable to DeepPocket on the considered sample. Furthermore, we acknowledge that for GENEOnet, the phase having the highest computational load is the phase of computation of the potentials of Table 1. Although we developed a dedicated C library for this task, the cost of computing the potentials constitutes the large majority of GENEOnet’s computational time. Once the potentials are computed, the inference part of the network is quite fast, as evidenced by Fig. 8b. The plot shows the estimated distributions of the inference computational times for GENEOnet and four Random Forest models that use the same potentials of GENEOnet to generate a prediction for each voxel. The inference time of GENEOnet is lower than all the considered Random Forests. Additional details regarding the models and why GENEOnet’s inference cannot be replaced by a traditional ML approach like Random Forest are provided in n Information Section 9.

Structural analysis of ABL1 Kinase using GENEOnet

The performance of GENEOnet was evaluated on a case study concerning multiple X-ray structures of ABL1, both from human and mouse models, in their active and inactive conformations. For the active conformation, only complexes with Type 1 ligands were considered. The structures were aligned and mapped. This case study aimed to compare the pocket identified by GENEOnet with the experimental one, obtained as the space occupied by each ligand in a given structure. As shown in Fig. 9, GENEOnet exhibited excellent results in pocket prediction, capturing all the space occupied by experimental ligands. Moreover, in the active conformation (Fig. 9c), we observed that Type 1 inhibitors and ATP occupy only a portion of the super pocket predicted by GENEOnet. Notably, there is another unexplored region within this binding site that could be useful for enhancing ligand selectivity. Furthermore, in the inactive conformation, the predicted pocket included all experimental ligands, reinforcing the accuracy of GENEOnet in binding pocket prediction.

Fig. 9 [Images not available. See PDF.]

GENEOnet predictions for ABL1 kinase. (a,d) front and (b,e) back view of ABL1 aligned structures in the active and inactive conformations mapped with GENEOnet. (c–f) aligned pockets predicted by GENEOnet.

GENEOnet, TankBind and AF2BIND comparison on ABL1 Kinase

We have chosen to compare GENEOnet qualitatively with two other state-of-the-art deep learning and generative AI methods: TankBind⁴³ and AF2BIND⁴⁴. However, neither of these methods was designed to produce volumetric predictions, making a comprehensive comparison akin to those presented in previous sections impractical. TankBind is a method that predicts the conformation of a protein-ligand complex given the two structures, utilizing a Graph Neural Network (GNN) architecture to assess the binding affinity of the ligand within specific functional blocks extracted via P2Rank. As such, TankBind is not a ligand-agnostic approach and was initially compared to docking algorithms rather than pocket finders. Nevertheless, we deemed it worthy of qualitative comparison with GENEOnet due to its use of P2Rank, a method previously evaluated in our benchmarks, for prioritizing areas of the protein to be processed. AF2BIND, on the other hand, leverages AlphaFold2⁴⁵ pair features to predict the probability that each residue will contact a small-molecule ligand, given a target protein structure. Specifically, it employs a logistic regression model to assign binding probabilities P(bind) to individual residues. We selected the protein PDB ID 6HD6, an example of the ABL1 Kinase that was previously discussed in section “Structural analysis of ABL1 Kinase using GENEOnet” of our case study, as a test structure for evaluating GENEOnet, TankBind, and AF2BIND. The sequence identity of this protein is less than 30.5% with any of the proteins in the TRAIN set used to train GENEOnet, whereas we are uncertain whether it may be present in the training sets of the other two methods. Given these considerations, we applied each of the three methods to the 6HD6 structure predicted by AlphaFold2 and present a comparison of their results in Fig. 10.

Fig. 10 [Images not available. See PDF.]

Comparative analysis of GENEOnet, TankBind, and AF2BIND on the protein PDB ID 6HD6. Specifically panels (a) and (b) display the following elements: experimental ligands STI (located in the middle) and FYH (situated at the lower left), which are depicted in green; TankBind-placed ligands, shown in magenta; residues with a predicted probability of binding P(bind) greater than 0.5, as determined by AF2BIND, represented in blue; and GENEOnet’s first two predicted pockets (orange clouds), which scored 0.863 (middle) and 0.690 (lower left), respectively.

An examination of Fig. 10 reveals that GENEOnet successfully identifies both pockets containing the ligands, as well as ranking them among the top two predicted sites. This outcome was anticipated for the central pocket relative to STI, given our analysis in section “Structural analysis of ABL1 Kinase using GENEOnet”. Notably, GENEOnet’s performance is confirmed in identifying the pocket hosting FYH. In contrast, TankBind correctly places both ligands near their experimental binding sites; however, it is worth noting that this method relies on access to the ligand structures for its predictions. In contrast, both GENEOnet and AF2BIND operate as ligand-agnostic predictors. AF2BIND generates a probability of ligandability for each protein residue using logistic regression, which is the primary output of this method. Unfortunately, the authors do not provide a procedure for deriving a finite number of predicted pockets from this outcome. This uncertainty could impact drug design, as it requires accurate selection of the optimal druggable binding site. To facilitate a comparison between AF2BIND and GENEOnet, we have highlighted residues with a probability of ligandability P(bind) greater than 0.5, a common choice with logistic regression classification. Under these conditions, AF2BIND successfully identifies the STI pocket, albeit with an additional region that is larger than the one predicted by GENEOnet. Conversely, AF2BIND fails to identify the FYH pocket, although it can be detected if the threshold for P(bind) is lowered to approximately 0.3. This adjustment, however, results in a further enlargement of the STI pocket. In conclusion, our analysis indicates that TankBind produces the most precise predictions, albeit under the condition that both ligand structures are available. GENEOnet successfully identifies both pockets and ranks them among the top two predicted sites. AF2BIND, while capable of identifying both pockets at a low threshold for P(bind), is unable to detect the second pocket relative to FYH when using larger thresholds.

Webservice

GENEOnet webservice has been developed to be freely accessible to the scientific community. Figure 11a shows GENEOnet homepage, the “use it” option allows for the submission of the PDB code of the protein of interest (Fig. 11b). By submitting the code, the protein is retrieved from the Protein Data Bank along with every annotation available (Fig. 11c). After submitting the structure, protein pockets identification is performed via GENEOnet and findings are returned in the results table (Fig. 11d). Pockets are described in terms of druggability score, number of hydrogen bond acceptor (HBA) and hydrogen bond donor (HBD) atoms, lipophilicity, and polarity. Measures of the radius and pocket center are also provided. The “small” flag in results table shown in Fig. 11d is marked with an “X” if the pocket is considered small by GENEOnet.

Fig. 11 [Images not available. See PDF.]

Webservice snapshots.

Discussion and conclusions

In terms of pocket identification accuracy, GENEOnet outperforms all other methods evaluated in the comparison on both the BINDTEST and BANK datasets. Notably, when considering the coefficient on BINDTEST, our results indicate that GENEOnet achieves a value of 0.929, implying that in approximately 93% of cases, the correct pocket can be identified by selecting the top three ranked pockets. All other methods, instead, reach values of below 0.9. Similarly, for BANK, GENEOnet has the highest value of equal to 0.875. GENEOnet shows a higher number of failures on BANK than BINDTEST, however this should not be considered a problem firstly because all methods have very low failure rates, secondly it is essential to acknowledge that GENEOnet’s final model was chosen to maximise on BINDVAL, in fact, as seen in Fig. 5, when considering only the pockets ranked first, GENEOnet again outperforms the other methods by having the lowest number of failures. On the other hand, DeepPocket/Fpocket, which exhibit the overall lowest failure rate on both datasets, have significantly high failure rates when considering top-ranked pockets. Therefore, in the context of favoring recognitions within the top-ranked pockets, GENEOnet remains the best-performing method, even when considering failures. Regarding the overlap analysis, the experimental results in Fig. 5 show that GENEOnet has one of the most skewed distributions, favouring high overlaps. Additionally, as expected, GENEOnet’s distribution has the smallest number of zero overlap cases (in comparison to DeepPocket/Fpocket, which have similarly skewed distributions). The evaluation of DCA and DCC distances using GENEOnet reveals that it does not outperform other methods, such as P2Rank, DeepPocket, and CAVIAR. However, when considering DCA specifically, GENEOnet’s performance improves significantly and quickly with increasing success thresholds. In contrast, the growth in performance for DCC is slower, and the other methods are only surpassed at larger thresholds. Taking these results into account, we make two key observations. Firstly, it appears that DCA is somewhat insensitive to the ligand’s shape, as its calculation involves determining the minimal distance from the ligand to the centroid of the prediction. In contrast, DCC does not consider either the ligand or pocket shapes, merely computing the distance between the ligand centroid and the centroid of the prediction. Consequently, this metric may be less suitable for evaluating non-convex or larger pockets where the ligand occupies only a portion of the predicted space. From a molecular docking perspective, having slightly larger pockets relative to smaller ones with respect to the ligand is generally desirable. Therefore, while metrics like DCA and DCC are widely reported in the literature, they may not provide valuable insights into the evaluation of pocket finder algorithms that prioritize volumetric inclusion over strict adherence to ground truth (i.e., the cocrystallized ligand). Secondly, these findings can serve as a starting point for improving GENEOnet by incorporating subcavity identification capabilities, similar to those already implemented in CAVIAR. For example, this could be achieved through stricter criteria for identifying predicted units rather than simply considering connected components, as is currently done. We believe that further investigation into this approach may yield beneficial results and warrant future study. Another potential way forward involves the development of a hybrid approach combining several models in a mixture-of-experts approach. In this research work, the primary objective was to identify and select the optimal model based on different performance metrics through a systematic model selection methodology. Future work could involve exploring the integration of distinct GENEOnet models, each of which has been optimized to evaluate different but complementary aspects of the outcome (ranking, overlap, failures, etc.), thus capitalizing on their respective strengths to achieve better predictions and mitigating their weaknesses. As for the identification of subcavities, this deserves further study. In conclusion, results obtained in all the experiments confirm that GENEOnet is able to find and assign high scores to the most likely pockets for a given protein, also with a high overlap. It performs better than the other state-of-the-art models according to many metrics of interest. In cases where GENEOnet does not excel in terms of specific metrics, there are two possible explanations. Firstly, it may be that further refinements to the method itself are warranted, as evidenced by limitations associated with DCA/DCC results and possible subcavity identification. Alternatively, GENEOnet’s performance is on a par with other methods, such as when evaluating computational efficiency for larger proteins. Beyond solely assessing GENEOnet’s performance, this framework also possesses several additional properties that are relevant to its utility and interpretability. Firstly, GENEOnet can incorporate prior knowledge, for example, regarding the significance of lipophilicity properties in protein structure prediction. Furthermore, due to its equivariance, it exhibits insensitivity to irrelevant geometric factors such as the precise location and orientation of the protein, thereby enhancing its robustness. Additionally, empirical evidence from previous studies ^41,42 has demonstrated that GENEOnet is resilient to minor conformational changes in protein structure. In conclusion, this framework relies on a relatively small number of learnable parameters (only 17), which can be efficiently identified using minimal training data. In fact, as shown by the ablation study, compared to GENEOnet’s non-equivariant and expansive counterpart, GENEOnet requires less training data to converge. Finally, GENEOnet is an interpretable model by design.

Acknowledgements

Images of molecular structures have been generated using SAMSON: Integrative molecular design https://www.samson-connect.net. Computational resources were partially provided by the INDACO core facility for HPC at Università degli Studi di Milano.

Author contributions

G.B., P.F., A.M., A.P. conceived the idea; G.B. wrote the codes; G.B., A.P., F.L., C.G., G.P., D.G. processed the data; G.B., G.P., D.G. ran the experiments; C.G. and A.D.B. developed the case study; A.F. developed the webservice; A.R.B., C.T., P.F.W.S. supervised the experiments and acted as coordinators. All authors wrote and reviewed the manuscript.

Funding

Dompé Farmaceutici S.p.A. funded this project. Additionally, P. Frosini has been partially supported by INdAM-GNSAGA, and G. Bocchi and A. Micheletti by INdAM-GNAMPA.

Data availability

Protein data are derived from the following resources available in the public domain: PDBbind v2020—http://pdbbind.org.cn/ and RCSB Protein Data Bank—https://www.rcsb.org/ GENEOnet website, developed to ensure the scientific community can freely access the tool, is available at the address: https://geneonet.exscalate.eu. The website is based on LAMP software stack (Linux, Apache, MariaDB, PHP). The front-end interface is built on bootstrap 5, jQuery, and HTML5 doctype.

Declarations

Competing interests

A.R. Beccari, A. Fava, A.D. Biswas, F. Lunghini and C. Talarico are employees of Dompé Farmaceutici S.p.A.

References

Lunghini

Fava

<article-title>ProfhEX: AI-based platform for small molecules liability profiling

J. Cheminformatics202315160

1:CAS:528:DC%2BB3sXhtF2mtbzL

10.1186/s13321-023-00728-6

Manelfi

<article-title>“DompeKeys”: A set of novel substructure-based descriptors for efficient chemical space mapping, development and structural interpretation of machine learning models, and indexing of large databases

J. Cheminformatics202416121

10.1186/s13321-024-00813-4

Zian

Iaconis

<article-title>The efficiency of high-throughput screening (HTS) and in-silico data analysis during medical emergencies: Identification of effective antiviral 3CLpro inhibitors

Antivir. Res.2025237

1:CAS:528:DC%2BB2MXkvF2ht7c%3D

10.1016/j.antiviral.2025.106119

39978553

106119

Schimunek

Seidl

<article-title>A community effort in SARS-CoV-2 drug discovery

Mol. Inform.2024431

1:CAS:528:DC%2BB3sXisVSlsLbP

10.1002/minf.202300262

37833243

e202300262

Vistoli

Manelfi

<article-title>MEDIATE-Molecular DockIng at homE: Turning collaborative simulations into therapeutic solutions

Expert Opin. Drug Discov.202318821833

1:CAS:528:DC%2BB3sXhsVKjtrnO

10.1080/17460441.2023.2221025

37424369

12404243

Gadioli

Vitali

<article-title>EXSCALATE: An extreme-scale virtual screening platform for drug discovery targeting polypharmacology to fight SARS-CoV-2

IEEE Trans. Emerg. Topics Comput.202311170181

10.1109/TETC.2022.3187134

Shi

Chen

<article-title>A review of recent developments and progress in computational drug repositioning

Curr. Pharm. Design20202630593068

1:CAS:528:DC%2BB3cXhslKju73F

10.2174/1381612826666200116145559

Brogi

<article-title>Computational approaches for drug discovery

Molecules201924173061

1:CAS:528:DC%2BC1MXit1WmsbvN

10.3390/molecules24173061

31443558

6749237

Anderson

<article-title>The process of structure-based drug design

Chem. Biol.200310787797

1:CAS:528:DC%2BD3sXnslSqs7w%3D

10.1016/j.chembiol.2003.09.002

14522049

10.

Crisman

Sisay

<article-title>Ligand-target interaction-based weighting of substructures for virtual screening

J. Chem. Inf. Model.20084819551964

1:CAS:528:DC%2BD1cXhtFKqsrfN

10.1021/ci800229q

18821751

11.

Thafar

Bin Raies

<article-title>Comparison study of computational prediction tools for drug-target binding affinities

Front. Chem.20197782

2019FrCh....7..782T

1:CAS:528:DC%2BB3cXhtFOrs7nL

10.3389/fchem.2019.00782

31824921

6879652

12.

Barreca

Iraci

<article-title>Induced-fit docking approach provides insight into the binding mode and mechanism of action of HIV-1 integrase inhibitors

ChemMedChem2009414461456

1:CAS:528:DC%2BD1MXhtVKgtb%2FI

10.1002/cmdc.200900166

19544345

13.

Ghersi

Sanchez

<article-title>Improving accuracy and efficiency of blind protein-ligand docking by focusing on predicted binding sites

Proteins200974417424

1:CAS:528:DC%2BD1MXlsFSkug%3D%3D

10.1002/prot.22154

18636505

2610246

14.

Torres

PHM

Sodero

ACR

<article-title>Key topics in molecular docking for drug design

Int. J. Mol. Sci.2019204574

1:CAS:528:DC%2BB3cXovFSmsrk%3D

10.3390/ijms20184574

31540192

6769580

15.

Bachman, J. Site-directed mutagenesis. In Laboratory Methods in Enzymology: DNA, vol. 529 of Methods in Enzymology, 241–248 (2013).

16.

Mueller

<article-title>Guidelines for the successful generation of protein-ligand complex crystals

Acta Crystallogr. Sect. D-Struct. Biol.2017737992

2017AcCrD..73...79M

10.1107/S2059798316020271

17.

Klebe

<article-title>Virtual ligand screening: Strategies, perspectives and limitations

Drug Discov. Today200611580594

1:CAS:528:DC%2BD28XlvFGqtLo%3D

10.1016/j.drudis.2006.05.012

16793526

7108249

18.

Levitt

Banaszak

<article-title>POCKET—A computer-graphics method for identifying and displaying protein cavities and their surrounding amino-acids

J. Mol. Graph.199210229234

1:CAS:528:DyaK3sXhsVaqs7k%3D

10.1016/0263-7855(92)80074-N

1476996

19.

Marchand

J. R

Pirard

<article-title>CAVIAR: A method for automatic cavity detection, description and decomposition into subcavities

J. Comput.-Aided Mol. Des2021356737750

2021JCAMD..35..737M

1:CAS:528:DC%2BB3MXhtF2ls77K

10.1007/s10822-021-00390-w

34050420

20.

Simoes

TMC

Gomes

AJP

<article-title>CavVis-a field-of-view geometric algorithm for protein cavity detection

J. Chem. Inf. Model.201959786796

1:CAS:528:DC%2BC1MXotVSjsw%3D%3D

10.1021/acs.jcim.8b00572

30629446

21.

Le Guilloux

Schmidtke

<article-title>Fpocket: An open source platform for ligand pocket detection

BMC Bioinformatics2009101168

10.1186/1471-2105-10-168

19486540

2700099

22.

Krivak

Hoksza

<article-title>P2Rank: Machine learning based tool for rapid and accurate prediction of ligand binding sites from protein structure

J. Cheminformatics201810139

10.1186/s13321-018-0285-8

23.

Jimenez

Doerr

<article-title>DeepSite: Protein-binding site predictor using 3D-convolutional neural networks

Bioinformatics20173330363042

1:CAS:528:DC%2BC1cXhvFGju7nN

10.1093/bioinformatics/btx350

28575181

24.

Aggarwal

Gupta

<article-title>DeepPocket: Ligand binding site detection and segmentation using 3D convolutional neural networks

J. Chem Inf. Model.20226250695079

1:CAS:528:DC%2BB3MXhslegsLzK

10.1021/acs.jcim.1c00799

34374539

25.

Anselmi

Rosasco

<article-title>On invariance and selectivity in representation learning

Inf. Inference20165134158

3516856

26.

Bengio

Courville

<article-title>Representation learning: A review and new perspectives

IEEE Trans. Pattern Anal. Mach. Intell.20133517981828

2013ITPAM..35.1798B

10.1109/TPAMI.2013.50

23787338

27.

Anselmi

Evangelopoulos

<article-title>Symmetry-adapted representation learning

Pattern Recognit.201986201208

2019PatRe..86..201A

10.1016/j.patcog.2018.07.025

28.

Cohen, T. S. & Welling, M. Group equivariant convolutional networks. In Proceedings of the International Conference on Machine Learning, Vol. 48 (2016).

29.

Mallat

<article-title>Group invariant scattering

Commun. Pure Appl. Math.20126513311398

2957703

10.1002/cpa.21413

30.

Mallat

<article-title>Understanding deep convolutional networks

Philos. Trans. R. Soc. A-Math. Phys. Eng. Sci.2016374206520150203

2016RSPTA.37450203M

10.1098/rsta.2015.0203

31.

Worrall, D. E. et al. Harmonic Networks: Deep Translation and Rotation Equivariance. In Proceedings of CVPR, Vol. 2017, 7168–7177 (2017).

32.

Zhang, C. et al. Discriminative template learning in group-convolutional networks for invariant speech representations. In Proceedings of INTERSPEECH, Vol. 2015, 3229–3233 (2015).

33.

Micheletti

<article-title>A weighted test to detect the presence of a major change point in non-stationary Markov chains

Stat. Methods Appl.202029899912

4174691

10.1007/s10260-020-00510-0

34.

Rudin

<article-title>Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

Nat. Mach. Intell.20191206215

10.1038/s42256-019-0048-x

35603010

9122117

35.

Carrieri

Haiminen

<article-title>Explainable AI reveals changes in skin microbiome composition linked to phenotypic differences

Sci. Rep.20211114565

2021NatSR..11.4565C

1:CAS:528:DC%2BB3MXlvV2ms7o%3D

10.1038/s41598-021-83922-6

33633172

7907326

36.

Bergomi

Frosini

<article-title>Towards a topological-geometrical theory of group equivariant non-expansive operators for data analysis and machine learning

Nat. Mach. Intell.20191423433

10.1038/s42256-019-0087-3

37.

Bocchi

Botteghi

<article-title>On the finite representation of linear group equivariant operators via permutant measures

Ann. Math. Artif. Intell.202391465487

4627271

10.1007/s10472-022-09830-1

38.

Bocchi

Ferri

Frosini

<article-title>A novel approach to graph distinction through GENEOs and permutants

Sci. Rep.2025156259

2025NatSR..15.6259B

1:CAS:528:DC%2BB2MXkslWltL4%3D

10.1038/s41598-025-90152-7

39979336

11842813

39.

Liu

<article-title>Forging the basis for developing protein-ligand interaction scoring functions

Accounts Chem. Res.201750302309

1:CAS:528:DC%2BC2sXit12qsLc%3D

10.1021/acs.accounts.6b00491

40.

Berman

Henrick

<article-title>Announcing the worldwide protein data bank

Nat. Struct. Biol.200310980

1:CAS:528:DC%2BD3sXptFOmsbY%3D

10.1038/nsb1203-980

14634627

41.

Bocchi, G., Frosini, P. et al. A geometric XAI approach to protein pocket detection. In Joint Proceedings of the xAI 2024 Late-breaking Work, Demos and Doctoral Consortium co-located with the 2nd World Conference on eXplainable Artificial Intelligence (xAI-2024), Valletta, Malta, July 17–19, 2024, vol. 3793, 217–224 (2024).

42.

Bocchi

Frosini

<article-title>GENEOnet: Statistical analysis supporting explainability and trustworthiness

Statistics20255910371062

4929721

10.1080/02331888.2025.2478203

43.

Lu, W. et al. TANKBind: Trigonometry-aware neural networks for drug-protein binding structure prediction. In Advances in Neural Information Processing Systems 35 (NeurIPS 2022), Advances in Neural Information Processing Systems (2022).

44.

Gazizov, A., Sergey, O. & Nicholas, P. AF2BIND: Prediction of protein-peptide and protein-ligand binding sites using AlphaFold. In Protein Science, Vol. 32 (2023).

45.

Jumper

<article-title>Highly accurate protein structure prediction with AlphaFold

Nature2021596583589

2021Natur.596..583J

1:CAS:528:DC%2BB3MXhvVaktrrL

10.1038/s41586-021-03819-2

34265844

8371605

Supplementary Information

The online version contains supplementary material available at https://doi.org/10.1038/s41598-025-18132-5.

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1. Lunghini, F; Fava, A et al. ProfhEX: AI-based platform for small molecules liability profiling. J. Cheminformatics; 2023; 15, 1 60.1:CAS:528:DC%2BB3sXhtF2mtbzL [DOI: https://dx.doi.org/10.1186/s13321-023-00728-6]

2. Manelfi, C et al. “DompeKeys”: A set of novel substructure-based descriptors for efficient chemical space mapping, development and structural interpretation of machine learning models, and indexing of large databases. J. Cheminformatics; 2024; 16, 1 21. [DOI: https://dx.doi.org/10.1186/s13321-024-00813-4]

3. Zian, D; Iaconis, D et al. The efficiency of high-throughput screening (HTS) and in-silico data analysis during medical emergencies: Identification of effective antiviral 3CLpro inhibitors. Antivir. Res.; 2025; 237, 1:CAS:528:DC%2BB2MXkvF2ht7c%3D [DOI: https://dx.doi.org/10.1016/j.antiviral.2025.106119] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/39978553]106119.

4. Schimunek, J; Seidl, P et al. A community effort in SARS-CoV-2 drug discovery. Mol. Inform.; 2024; 43, 11:CAS:528:DC%2BB3sXisVSlsLbP [DOI: https://dx.doi.org/10.1002/minf.202300262] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/37833243]e202300262.

5. Vistoli, G; Manelfi, C et al. MEDIATE-Molecular DockIng at homE: Turning collaborative simulations into therapeutic solutions. Expert Opin. Drug Discov.; 2023; 18, pp. 821-833.1:CAS:528:DC%2BB3sXhsVKjtrnO [DOI: https://dx.doi.org/10.1080/17460441.2023.2221025] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/37424369][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12404243]

6. Gadioli, D; Vitali, E et al. EXSCALATE: An extreme-scale virtual screening platform for drug discovery targeting polypharmacology to fight SARS-CoV-2. IEEE Trans. Emerg. Topics Comput.; 2023; 11, pp. 170-181. [DOI: https://dx.doi.org/10.1109/TETC.2022.3187134]

7. Shi, W; Chen, X et al. A review of recent developments and progress in computational drug repositioning. Curr. Pharm. Design; 2020; 26, pp. 3059-3068.1:CAS:528:DC%2BB3cXhslKju73F [DOI: https://dx.doi.org/10.2174/1381612826666200116145559]

8. Brogi, S. Computational approaches for drug discovery. Molecules; 2019; 24, 17 3061.1:CAS:528:DC%2BC1MXit1WmsbvN [DOI: https://dx.doi.org/10.3390/molecules24173061] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31443558][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6749237]

9. Anderson, A. The process of structure-based drug design. Chem. Biol.; 2003; 10, pp. 787-797.1:CAS:528:DC%2BD3sXnslSqs7w%3D [DOI: https://dx.doi.org/10.1016/j.chembiol.2003.09.002] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/14522049]

10. Crisman, TJ; Sisay, MT et al. Ligand-target interaction-based weighting of substructures for virtual screening. J. Chem. Inf. Model.; 2008; 48, pp. 1955-1964.1:CAS:528:DC%2BD1cXhtFKqsrfN [DOI: https://dx.doi.org/10.1021/ci800229q] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/18821751]

11. Thafar, M; Bin Raies, A et al. Comparison study of computational prediction tools for drug-target binding affinities. Front. Chem.; 2019; 7, 782.2019FrCh..7.782T1:CAS:528:DC%2BB3cXhtFOrs7nL [DOI: https://dx.doi.org/10.3389/fchem.2019.00782] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31824921][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6879652]

12. Barreca, ML; Iraci, N et al. Induced-fit docking approach provides insight into the binding mode and mechanism of action of HIV-1 integrase inhibitors. ChemMedChem; 2009; 4, pp. 1446-1456.1:CAS:528:DC%2BD1MXhtVKgtb%2FI [DOI: https://dx.doi.org/10.1002/cmdc.200900166] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/19544345]

13. Ghersi, D; Sanchez, R. Improving accuracy and efficiency of blind protein-ligand docking by focusing on predicted binding sites. Proteins; 2009; 74, pp. 417-424.1:CAS:528:DC%2BD1MXlsFSkug%3D%3D [DOI: https://dx.doi.org/10.1002/prot.22154] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/18636505][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2610246]

14. Torres, PHM; Sodero, ACR et al. Key topics in molecular docking for drug design. Int. J. Mol. Sci.; 2019; 20, 4574.1:CAS:528:DC%2BB3cXovFSmsrk%3D [DOI: https://dx.doi.org/10.3390/ijms20184574] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31540192][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6769580]

15. Bachman, J. Site-directed mutagenesis. In Laboratory Methods in Enzymology: DNA, vol. 529 of Methods in Enzymology, 241–248 (2013).

16. Mueller, I. Guidelines for the successful generation of protein-ligand complex crystals. Acta Crystallogr. Sect. D-Struct. Biol.; 2017; 73, pp. 79-92.2017AcCrD.73..79M [DOI: https://dx.doi.org/10.1107/S2059798316020271]

17. Klebe, G. Virtual ligand screening: Strategies, perspectives and limitations. Drug Discov. Today; 2006; 11, pp. 580-594.1:CAS:528:DC%2BD28XlvFGqtLo%3D [DOI: https://dx.doi.org/10.1016/j.drudis.2006.05.012] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/16793526][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7108249]

18. Levitt, D; Banaszak, L. POCKET—A computer-graphics method for identifying and displaying protein cavities and their surrounding amino-acids. J. Mol. Graph.; 1992; 10, pp. 229-234.1:CAS:528:DyaK3sXhsVaqs7k%3D [DOI: https://dx.doi.org/10.1016/0263-7855(92)80074-N] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/1476996]

19. Marchand, J. R; Pirard, B et al. CAVIAR: A method for automatic cavity detection, description and decomposition into subcavities. J. Comput.-Aided Mol. Des; 2021; 35, 6 pp. 737-750.2021JCAMD.35.737M1:CAS:528:DC%2BB3MXhtF2ls77K [DOI: https://dx.doi.org/10.1007/s10822-021-00390-w] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34050420]

20. Simoes, TMC; Gomes, AJP. CavVis-a field-of-view geometric algorithm for protein cavity detection. J. Chem. Inf. Model.; 2019; 59, pp. 786-796.1:CAS:528:DC%2BC1MXotVSjsw%3D%3D [DOI: https://dx.doi.org/10.1021/acs.jcim.8b00572] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30629446]

21. Le Guilloux, V; Schmidtke, P et al. Fpocket: An open source platform for ligand pocket detection. BMC Bioinformatics; 2009; 10, 1 168. [DOI: https://dx.doi.org/10.1186/1471-2105-10-168] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/19486540][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2700099]

22. Krivak, R; Hoksza, D. P2Rank: Machine learning based tool for rapid and accurate prediction of ligand binding sites from protein structure. J. Cheminformatics; 2018; 10, 1 39. [DOI: https://dx.doi.org/10.1186/s13321-018-0285-8]

23. Jimenez, J; Doerr, S et al. DeepSite: Protein-binding site predictor using 3D-convolutional neural networks. Bioinformatics; 2017; 33, pp. 3036-3042.1:CAS:528:DC%2BC1cXhvFGju7nN [DOI: https://dx.doi.org/10.1093/bioinformatics/btx350] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/28575181]

24. Aggarwal, R; Gupta, A et al. DeepPocket: Ligand binding site detection and segmentation using 3D convolutional neural networks. J. Chem Inf. Model.; 2022; 62, pp. 5069-5079.1:CAS:528:DC%2BB3MXhslegsLzK [DOI: https://dx.doi.org/10.1021/acs.jcim.1c00799] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34374539]

25. Anselmi, F; Rosasco, L et al. On invariance and selectivity in representation learning. Inf. Inference; 2016; 5, pp. 134-158.3516856

26. Bengio, Y; Courville, A et al. Representation learning: A review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell.; 2013; 35, pp. 1798-1828.2013ITPAM.35.1798B [DOI: https://dx.doi.org/10.1109/TPAMI.2013.50] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/23787338]

27. Anselmi, F; Evangelopoulos, G et al. Symmetry-adapted representation learning. Pattern Recognit.; 2019; 86, pp. 201-208.2019PatRe.86.201A [DOI: https://dx.doi.org/10.1016/j.patcog.2018.07.025]

28. Cohen, T. S. & Welling, M. Group equivariant convolutional networks. In Proceedings of the International Conference on Machine Learning, Vol. 48 (2016).

29. Mallat, S. Group invariant scattering. Commun. Pure Appl. Math.; 2012; 65, pp. 1331-1398.2957703 [DOI: https://dx.doi.org/10.1002/cpa.21413]

30. Mallat, S. Understanding deep convolutional networks. Philos. Trans. R. Soc. A-Math. Phys. Eng. Sci.; 2016; 374, 2065 20150203.2016RSPTA.37450203M [DOI: https://dx.doi.org/10.1098/rsta.2015.0203]

31. Worrall, D. E. et al. Harmonic Networks: Deep Translation and Rotation Equivariance. In Proceedings of CVPR, Vol. 2017, 7168–7177 (2017).

32. Zhang, C. et al. Discriminative template learning in group-convolutional networks for invariant speech representations. In Proceedings of INTERSPEECH, Vol. 2015, 3229–3233 (2015).

33. Micheletti, A et al. <article-title>A weighted test to detect the presence of a major change point in non-stationary Markov chains

Stat. Methods Appl.; 2020; 29, pp. 899-912.4174691 [DOI: https://dx.doi.org/10.1007/s10260-020-00510-0]

Word count: 9128

Show less

© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

Structure-based virtual screening approaches like molecular docking rely on accurately identifying and precisely calculating binding pockets to efficiently search for potential ligands. In this paper, we introduce GENEOnet, a machine learning model designed for volumetric protein pocket detection that employs Group Equivariant Non-Expansive Operators (GENEOs). These operators simplify model complexity and enable more informed domain knowledge integration by selecting specific physical and chemical properties for each operator to focus on, as well as how they should react. Unlike other methods in this field, GENEOnet has fewer model parameters, resulting in reduced training costs, and offers greater explainability, allowing the parameters to be easily interpreted. GENEOnet processes the empty space within a protein by converting it into a 3D grid of uniform blocks, known as ‘voxels’. It then identifies regions of the grid with an output value above a threshold, thus producing a list of predicted pockets, ranked according to the model’s average output value. Our experimental results show that GENEOnet performs robustly even with small training datasets of 200 proteins and surpasses other established state-of-the-art methods in various metrics. Specifically, GENEOnet’s score indicating the probability that the top-ranked pocket is the correct one is 0.764, compared to 0.702 for P2Rank, the next best performing algorithm on our PDBbind test set. Moreover, a case study considering various ABL1 kinase conformations demonstrates the excellent agreement between GENEOnet’s predictions and experimental sites. GENEOnet is available as a web service at https://geneonet.exscalate.eu, where users can access the pre-trained model for detecting and ranking protein cavities.

Details

Title

GENEOnet: a breakthrough in protein binding pocket detection using group equivariant non-expansive operators

Author

Bocchi, Giovanni¹; Frosini, Patrizio²; Micheletti, Alessandra¹; Pedretti, Alessandro³; Palermo, Gianluca⁴; Gadioli, Davide⁴; Gratteri, Carmen⁵; Lunghini, Filippo⁶; Biswas, Akash Deep⁶; Stouten, Pieter F. W.⁷; Beccari, Andrea R.⁶; Fava, Anna⁶; Talarico, Carmine⁶

¹ Department of Environmental Science and Policy, Università degli Studi di Milano, Via Celoria 10, 20133, Milano, Italy (ROR: https://ror.org/00wjc7c48) (GRID: grid.4708.b) (ISNI: 0000 0004 1757 2822)
² Department of Computer Science, University of Pisa, Largo B. Pontecorvo 3, 56127, Pisa, Italy (ROR: https://ror.org/03ad39j10) (GRID: grid.5395.a) (ISNI: 0000 0004 1757 3729)
³ Department of Pharmaceutical Sciences, Università degli Studi di Milano, Via Mangiagalli 25, 20133, Milano, Italy (ROR: https://ror.org/00wjc7c48) (GRID: grid.4708.b) (ISNI: 0000 0004 1757 2822)
⁴ Department of Electronics, Information and Bioengineering, Politecnico di Milano, Via Ponzio 34/5, 20133, Milano, Italy (ROR: https://ror.org/01nffqt88) (GRID: grid.4643.5) (ISNI: 0000 0004 1937 0327)
⁵ LIGHT S.c.a.r.l., Via Branze 45, 25123, Brescia, Italy
⁶ Dompé Farmaceutici S.p.A., Via Tommaso de Amicis 95, 80145, Napoli, Italy
⁷ Stouten Pharma Consultancy BV, Kempenarestraat 47, 2860, Sint-Katelijne-Waver, Belgium

Pages

34597

Section

Article

Publication year

2025

Publication date

2025

Publisher

Nature Publishing Group

e-ISSN

20452322

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1038/s41598-025-18132-5

ProQuest document ID

3256960060

GENEOnet: a breakthrough in protein binding pocket detection using group equivariant non-expansive operators

Jump to:

Full text

Abstract

Details

Suggested sources