1. Introduction
ABC (ATP Binding Cassette) proteins play crucial roles in diverse biological functions from bacteria to man [1,2]. Importantly, human ABC transmembrane proteins are involved in several pathophysiological processes [1,3,4]. Mutations can affect their folding, assembly, trafficking, plasma membrane stability, and function, leading to decreased functional expression [5,6]. Therefore, understanding the effect of mutations on their structure and dynamics is highly important. Several family members extrude xenobiotics (e.g., toxic molecules and therapeutic drugs) from the cell, thus, learning the substrate recognition mechanism of these multidrug transporters can improve drug development and may prevent unwanted drug interactions [2,7]. Consequently, determining the atomic level 3D structure of ABC proteins is a widely researched area [8,9,10,11,12]. For example, various steps of structure-based drug design, such as selecting the appropriate protein target, understanding its pathological mechanism at the molecular level, and developing a new therapeutic substance or redesigning a drug require the knowledge of the target protein structure at the atomic level [13].
The major protein structure determination method was X-ray crystallography for several decades [14]. This technique requires vast resources, especially in the case of transmembrane proteins, which are the target of a large fraction of prescription drugs [15,16]. The examined protein needs to be isolated and purified in a large quantity and the process of crystallization requires a trial-and-error approach [17]. Furthermore, it is rare to obtain structural information on the membrane environment from the crystal. In recent years, cryo-electron microscopy (cryo-EM) emerged and became the ultimate structure determination method for transmembrane proteins [18]. The cryo-EM maps may also include information about the membrane environment [19]. Despite these recent advances in the structure determination of transmembrane proteins, they only make up ~5% of all the protein structures in the Protein Data Bank, while close to 50% of prescription drugs target transmembrane proteins [15,16].
Researchers utilized enormous resources in the last decades to predict protein structures from sequence information [20]. In recent years, an algorithm based on deep learning, AlphaFold2, emerged with remarkable accuracy in this task [21]. Its high quality and speed of structure prediction resulted in a vast database of predicted structures, the AlphaFold Protein Structure Database, which currently holds close to 1,000,000 protein structures [22]. AlphaFold2 (AF2) and AlphaFold-Multimer are serving now as ultimate tools for learning protein structures that would be hard to resolve experimentally, such as protein complexes and transmembrane (TM) proteins. Although AF2 was not specifically trained to predict transmembrane proteins, we have recently demonstrated on ABC proteins that this novel deep learning method is able to provide high-quality structures of transmembrane proteins [23]. RoseTTAFold [24] and trRosetta [25] have also been published as high-accuracy structure predictors, based on various deep learning approaches.
ABC proteins are an excellent group of proteins to study AF2 performance on TM proteins [23] since their TM domains are not conserved because of their heterogeneous transport functions. Therefore, the structures of their transmembrane domains exhibit various scaffolds that currently enable transmembrane ABC structures to be clustered into nine distinct structural classes (Figure S1) [26]. In contrast, the nucleotide-binding domains (NBDs) of ABC proteins are highly conserved and contain Walker sequences and the ABC signature or fingerprint [8]. Notably, an ATP binding site is formed by the Walker A and B from one NBD and signature from a second NBD. Therefore, the functional form of a TM ABC protein includes two NBDs and TMDs. These domains can be encoded in one polypeptide chain in the case of full transporters, in two chains in the case of half transporters working in dimeric forms, or in four chains [27,28]. The structures of ABC proteins can be observed in various conformations [8,29]. For example, proteins with Pgp-like structures (Figure 1) exhibit a widely open conformation towards the intracellular space, capable of substrate binding, in the absence of ATP. When two ATP molecules are bound, the NBDs close and form a tight interaction, and the extracellular ends of TM helices open to provide a dissociation site towards the extracellular space. These conformations are associated with the steps of the alternating access mechanism [9,30]. Other types of proposed mechanisms for ABC proteins include the peristaltic or credit card swipe mechanisms. For transporters operating with these mechanisms, the aforementioned two distinct conformations cannot be distinguished so promptly [8,31,32].
Although various structures of numerous ABC proteins were determined in the last years using cryo-electron microscopy, 25 of the 44 physiologically and pathologically important human ABC transmembrane proteins lack experimental structures in the PDB. The AlphaFold DB, co-developed by EMBL-EBI and DeepMind [22], supplemented the set of resolved transporters, albeit only full-length transporters, since only single-chain structures are available from this database. While the significant increase in the number of reliable structures for transmembrane proteins is a welcome change, the fact that the structures have to be collected from multiple data providers poses a potential challenge to performing comparative computational and experimental studies. Although both PDB and AF2 structures can be searched with a structure for homologous ones using the DALI server [33], it cannot reliably distinguish different ABC TM folds (
2. Results
2.1. Connections between Transmembrane ABC Proteins Structures and Pfam Profiles
Running our pipeline yielded 325 PDB structures (as of 29 December 2021). We selected the top-scoring TMD matches of our Pfam searches as the base for classification (Figure 2). We found only three incorrect biological unit files (3wme, 3wmf, and 3wmg), which contained only half of the functional unit of an ABC transporter (1 NBD and 1 TMD). Instead of numbered structural classes [26], we use names related to the most known representative member of a class (e.g., Pgp-like and BtuCD-like) (Figure S1), since we find this not only more informative than numbers, but the recently renumbered classes cause confusions in publications of the last decades (e.g., type I ABC exporter in the old system corresponds to type IV transporter structure in the recently proposed system [26]). In addition, the numbering strongly suggests an order of the structural families (e.g., an ordering based on the evolutionary level of TM domains) but that is not the case.
The investigation of sequence and structure conservation led to important and interesting information regarding this diverse protein family. The HMM search showed matches in five Pfam clans (Figure 2). The ABC_membrane clan includes four PFAM families, exhibiting Pgp-like structures. Of these four families, most of the existing experimental structures belong to the ABC_membrane family. The ABC_membrane2 family contains ABCD [37,38] and a mycobacterial ABC transporter, Rv1819c [39] structures, while no experimental structure is associated with the ABC_membrane3 family. However, trRosetta and AlphaFold2 predictions also suggested a Pgp-like architecture for members of this family (e.g.,
The ABC-2 clan includes proteins that exhibit, or potentially exhibit ABCG2-like structures. Most of the related experimental structures belong to the ABC2_membrane and ABC2_membrane_3 families including ABCG and ABCA proteins, respectively. The ABC2_membrane Pfam entry matches the first five TM helices of ABCG proteins. The exclusion of TM6 is most likely caused by the non-conserved extracellular loop between TM5 and TM6. TM1 was also not matched in most ABCA TMD by the ABC2_membrane_3 profile, which involves a portion of their large extracellular loop (Figure S3). Experimental ABCG2-like structures (6xjh and 6xji) in the ABC2_membrane_2 family were determined for the same protein, PmtCD, exporting bacterial toxins [40]. The remaining families involve trRosetta or AlphaFold2 computational models. Most of the sequences of these proteins incorporate transmembrane domains and some other domains (e.g., Sodium/calcium exchanger, AAA, Dihydrodipicolinate synthetase, and Phosphotransferase enzyme in the CcmB and DUF3533 Pfam families), which are not characteristic for ABC proteins. These proteins may or may not function without NBDs and are good targets for studying the evolution of ABC proteins. In contrast to other ABC-2 clan members, the Pfam profile of ABC2_membrane_7, PDR_CDR, and YitT_membrane (PDB ID: 3hlu) does not match membrane domains, but soluble regions instead. In addition, the AF2-predicted TMD of YitT (e.g.,
The BPD_transp_1 clan includes four families with diverse structures (Figure 2 and Figure S1). Three Pfam families include proteins with experimental structures belonging to different structural classes (MacB-, Bce-, LptFG-, and MalFG-like). Notably, the known Bce-like structures (PDB IDs: 7tcg and 7tch [41]) involve two transmembrane domains with MacB-like and Bce-like folds on a single polypeptide chain. The fourth family, DUF1430, is defined by a soluble domain. In these proteins, cytosolic domains likely prohibit the binding of NBDs to the TMDs, thus they may not be considered real ABC proteins.
Members of both the ECF_trnsprt family (Gx_transp clan) and the CbiQ family (no parent clan) exhibit an EcfT-like structure. The FecCD and ABC-3 families in the Membrane_trans clan possess BtuCD-like structures.
2.2. Conformational States of ABC Protein Structures
Two major conformational states of ABC proteins, called inward-facing and outward-facing, can be distinguished [9,30]. Although this categorization is slightly simplified, since a lot of inward-facing transporters are in an occluded or some intermediate state, we aimed to group the structures focusing on these two states for the sake of automation. Since the outward-facing state is related to the ATP-induced association of the two NBDs, theoretically we could divide the structures into these two conformational states based on the presence or absence of nucleotides in the structure. However, there are several structures with bound nucleotides (e.g., 6z5u, 7oj8, 7ojh, and 7oz1), exhibiting separated NBDs (Figure S4). For example, in the 6z5u structure, the non-hydrolyzable ATP analogue, AppNHpANP, was not sufficient to trigger the transition from bottom-open to bottom-closed state [42]. As another example, a recent ABCG2 structure was determined under turnover conditions and the presence of ATP did not drive a complete closure of the two NBDs (Figure S4) [43].
Since the presence or absence of nucleotides proved to be an inefficient way of categorizing structures, we then tried to do so by calculating the RMSD between each individual structure and two typical reference structures representing either inward- or outward-facing conformations of the structural family the protein belongs to (Table S1). All the structures and their labels were manually validated after classification. We found a higher number of mislabeled structures only in the Pgp-like family, where 18 out of 118 inward-facing structures were classified as outward-facing. These RMSD calculations also uncovered some flaws in currently used structural alignment tools demonstrated in Figure S5.
We have developed 3D vector-based measures, conftors, to describe the relative orientation of domains and highlight structural differences in ABC protein structures [44]. Since the inward- and outward-facing conformation correlates with the level of NBD association (Figure 1 and Figure S1), we defined |conftor(WA/SIG)|, the distance between the Walker A in one of the NBDs and the signature motif from the other NBD to classify the conformations (Figure 1). For this definition, two residues from the Walker A (GXXXXGKT) and the signature (LSGGQ) sequences were selected based on their spatial location. Since these Lys and Gly are located at helix ends and not in loops, noisy fluctuations in the distance between their Cα atoms were prevented. These positions were identified by aligning the query NBDs to a reference NBD (Pgp NBD1 from 6c0v) using the TM-align algorithm and not by simple sequence search, since there are many ABC proteins with degenerate, non-conserved ATP-binding sites (e.g., CFTR/ABCC7 possesses a degenerate signature motif [45,46]). The distribution of the distances between Walker A and signature calculated for all structures (Figure 3) indicated a cutoff value of 14 Å for distinguishing bottom-closed and bottom-open conformations. Indeed, visual inspection of structures with|conftor(WA/SIG)| = 14 ± 2 Å confirmed this value as a rational cutoff. We aimed to identify the source of very large NBD separation, thus we compared the distance values of X-ray and cryo-EM structures. Large NBD distances were observed in structures of both datasets (Figure S6).
In spite of the above limitations, we found this type of structural classification of inward-facing and outward-facing categories important, since the opening level and the dynamic alterations in the opening affect the substrate access to the central binding pocket [47]. This is clearly evident in the case of some Pgp-like structures, which exhibit an ATP-bound conformation with a well-defined, large, outward open cavity (Figure S7) [48]. In contrast, the difference in the TMDs of ABCG structures is subtle between the apo and ATP-bound states (Figure S7). Although the TM helices are reorganized upon ATP binding, no large opening on their extracellular end is observed. It is also important to note that the static ABC structures cover different parts of a continuous space of possible conformations. This is highly prominent in the case of inward-facing conformations, which exhibit a wide range of opening levels (between 14 and 86 Å) and also various conformational states of the two NBDs relative to each other. We highlight two examples associated with this issue. First, ABCG2 (PDB ID: 6vxf) has an additional conformation named inward-facing-closed [49]. In this ATP-bound conformation, the NBDs are separated, but the intracellular ends of some TM helices are somewhat more closed than in the inward-facing conformation (Figure S7). Although the difference between TM helix ends is subtle, it makes the large drug binding cavity in the TM region inaccessible from the intracellular space. However, since the overall conformation is almost identical to the inward-facing ABCG2 structures (RMSD = 1.736 Å) we classified this structure as inward-facing. Second, the NBDs in ECF proteins show a closed conformation both in the ATP-bound (5d3m) and apo structures (4huq, 4rfs, 4hzu, 5jsz, 5x3x, 5x41, 6fnp, 6zg3, 7nnt, and 7nnu), which is most likely caused by their special TMDs and an unconventional transport mechanism [50,51,52,53]. In other types of ABC transporters, each TM domain possesses at least one coupling helix, responsible for transmitting the conformational changes from the NBDs to TMDs. In ECF transporters, the EcfT TM domain provides two long coupling helices, contacting with the two NBD/ATPase domains and stabilizing the complex [52]. This structural feature contributes to keeping the NBDs and EcfT in close proximity to each other during the dissociation of the substrate binding EcfS TM domain and probably prohibits a large separation of NBDs even in the absence of ATP. Most of the WA/SIG distance values (19 out of 2 × 11) of ECF transporters are between 14 and 19 Å and three are slightly below 14 Å.
2.3. ABC Protein Structures Predicted by AlphaFold2
The AlphaFold Protein Structure Database contains predicted structures for most of the protein sequences in the UniProt database. We collected ABC protein structures from this AF database following the same procedure as in the case of the experimentally determined structures from the PDB, namely by using Pfam searches. However, since the AlphaFold DB only contains monomeric structures, we gathered only full transporters, which encoded two NBDs and two TMDs within a single polypeptide chain. As a result, the hits belonged only to Pgp-like and ABCG2-like structural classes. While the open structures were overrepresented in the AF-prediction of 21 proteomes (Figure 4), the numbers of open/closed conformations are more balanced in the prediction of the SwissProt dataset (101 vs. 414 and 163 vs. 136 in the Pgp-like and ABCG2-like families). It is important to note that the bottom-closed, ATP-bound conformation may represent a short-lived state, resulting in a lower number of ATP-bound structures in the experimental dataset. Since these experimental structures served as the AF training set, they possibly biased the AF-predictions toward bottom-open conformations.
We noticed that there is no available experimental or AlphaFold-predicted structural data for some human dimeric ABC proteins. Since we found it critical to generate the functional forms of human ABC protein structures that can facilitate both learning their structure/function relationship and drug development, we predicted dimeric human ABC structures. Previously, we have shown that AlphaFold2 provided high-quality structures for both homodimeric and heterodimeric ABC proteins [23]. However, AlphaFold-Multimer has recently been released, and it has a significantly better performance on heterodimeric complexes when compared to the original AF2 [36]. Using AF-Multimer, we performed test multimer runs for ABC dimers with known structures (e.g., ABCG5/ABCG8, PDB IDs: 5do7, 7jr7, 7r87-7r8b) that yielded high-quality structures (Table S2). Then we performed predictions for proteins with no available structures in the PDB or which had longer unresolved regions (Table S2). For example, several ABCB family members possess an additional N-terminal TM domain (TMD0) which is not resolved or exhibits low resolution (Figure S8). Interestingly, in our AlphaFold predictions, there were no contacts between the core and TMD0, but the additional TMD0 helices were rationally oriented according to a putative membrane bilayer.
One of the most exciting tasks was to test if AF-Multimer was able to build the mitoSUR complex. Sulfonylurea receptors (SUR1/ABCC8 and SUR2/ABCC9) are octamers of four SURs and four inward-rectifying potassium Kir channels (Figure 5) [54,55]. Similar to ABCC8 and ABCC9, ABCB8 was found to regulate potassium current [56]. ABCB8 functions in the inner membrane of mitochondria and mitoK (alternative name: Coiled-coil domain-containing protein 51, CCDC51) were identified as its potassium channel partner [56]. In order to build the complex structure, we submitted eight copies of the half transporter ABCB8 and four copies of mitoK sequences to AF-Multimer. No reasonable structures were built. We also predicted the structure of four mitoK proteins without ABCB8. A complex was built, in which the pore was delineated by coiled-coil regions and the putative transmembrane helices were not aligned to match a lipid bilayer (Figure 5). Thus, the overall structure did not meet expectations of what a potassium channel should look like. Known potassium channels, even with low sequence similarity, exhibit two TM helices and a reentrant loop between these TM regions, of which the loop provides the selectivity filter with a characteristic amino acid pattern (TMxTVGYG) [57,58]. We could not find any similar potassium selectivity pattern in mitoK using less stringent regular expression patterns. There are two possible explanations for this phenomenon. On one hand, mitoK may be an additional regulatory factor for the potassium current regulated by ABCB8 and not the potassium channel itself. This seems unlikely, since the ABCB8/mitoK purified and reconstituted from bacteria exhibited potassium currents [56], and it is difficult to imagine this complex carrying a third protein (a potassium channel) during that complicated process. On the other hand, mitoK may represent a novel type of potassium channel, but AF-Multimer was not able to produce a meaningful prediction, since no previously known structure resembled this fold in its learning set. This is also unlikely in the light of our earlier study, demonstrating that AF2 was able to predict novel transmembrane folds not included in the learning and template sets [23].
2.4. The ABC3D Database
Since it is not trivial to access a list or a sublist of ABC proteins neither from the PDB nor from the AlphaFold Structural Database, we created a web application that contains the currently available full-length ABC structures (
One structural class or human subfamily can be selected by a single click, as well as structures with a specific conformation. Using the main Select button, more complex searches can be performed. Selections can be narrowed by not only structural class and conformation, but also by taxonomy, gene and protein names, and release dates. Each structure can also be selected or unselected individually. Three-dimensional structures can be visualized in the browser using Mol* [59] which helps the selection process. All structures are aligned to their corresponding reference structure which helps in comparison.
Selected structures and associated data can be downloaded as pdb files and a tab-separated tsv format, respectively. mmCif files are also available for bulk download. All of the files are packed into a zipped file. Users can also opt to download only the metadata file. The main Download menu makes getting pre-assembled files for each structural class possible.
2.5. Novel ABC Structures Accessible through the 3D-Beacons Network
In order to increase the visibility of our AF2-predicted ABC structures, we expose them through the 3D-Beacon Network (
3. Discussion
With the growing number of available structures in 3D databases, there is an increasing need for simple accession of several or all structures of a specific protein family. We developed a web application providing this search functionality for transmembrane ABC proteins and our approach can serve as a prototype for accessing bulk structural data per protein families. Based on user feedback, it will be improved and standardized to build a general framework for accessing the 3D structural data of any protein family. The focus of the further developments will be on the high-level automation of various steps (e.g., classification, structure collection, alignment). In this study, we automated the categorization of open/closed structures that was based on our manual curation of 325 experimental structures. This categorization has a limitation of hiding a finer grained clustering of structures reflecting also on occluded or various intermediate states. The automatic detection of these states requires challenging text and 3D data mining methods and will be addressed in future studies. However, the experimental conditions during structure determination may influence the conformationally flexible ABC protein structures [60,61], thus correct classification. It also remained unresolved for the static ABC structure sets, and how to address protein dynamics [60]. Extraction of protein motion using molecular dynamics simulations with the collected ABC structures is highly resource intensive [47,62]. Simplified approaches (e.g., normal mode analysis based on various network models [63,64]) may provide useful information on the dynamics of various conformers.
In addition to existing ABC structures, we exposed novel, AlphaFold-generated, dimeric structures of half ABC transporters through the implementation of the 3D-Beacons Client (
Our results also highlighted some issues associated with the increasing number of available structures of proteins and protein complexes. First, characteristic conserved regions and structures should be identified more precisely than current approaches allow. Deep learning methods may allow domain identification with higher accuracy than an HMM profile search [65]. Second, high performance methods with novel and automatic logical processes are required to analyze and compare the structures of protein complexes efficiently (Figure S5). Our structure predictions strengthen our previous results [23] that AF models of TM proteins can be as high quality as that of soluble proteins. AlphaFold-Multimer also performs outstandingly for multimers, including TM ABC proteins. It provides higher performance for homomeric interfaces than for heteromeric interfaces which is presumably because MSA of homomeric proteins readily encodes evolutionary information about the complex interfaces, while this information for heteromeric interfaces is more limited and harder to access [36]. However, careful investigation of individual structures is required to help assess the model quality and also to detect hints for interesting and important biological questions, such as the case of the mitoSUR/mitoK complex (Figure 5). This example emphasizes various current limitations of AF ABC models. Besides the single chain predictions of otherwise multimeric proteins deposited to the AlphaFold Structural Database, the models do not involve ions, glycosylation, inhibitors, and substrate molecules that will be likely solved in the near future [66,67]. The predicted structures are also static and lack information on protein dynamics, except for regions with low pLDDT scores, indicating highly dynamic disordered parts [21]. However, the spatial information associated with these regions cannot be used in the case of many ABC proteins, since these regions possess pure predictive values and unrealistically cross the lipid bilayer region in some predicted structures [23]. In order to alleviate these issues for human ABC proteins, we made the set of human ABC protein structures complete with dimeric AlphaFold predictions, where unrealistic disordered regions were removed, which opens the way to merge and improve our ABC mutation database (
4. Methods and Materials
4.1. Databases and Associated Software
Experimental structures and associated sequences (release on 29 December 2021) were downloaded from RCSB. AlphaFold predicted structures of 21 proteomes were retrieved from the AlphaFold Structure Database in July 2021 [22]. These sets were used for analysis in this study, except for a few cases, which were raised after updating our database with current experimental structures and the AF structures covering proteins in SwissProt.
ABC PFAM entries were identified at
4.2. Identification and Classification of TM ABC Proteins
Entries containing at least one TM Pfam hit and at least two NBD Pfam hits in the PDB and AF dataset sequences were selected and classified as transmembrane ABC proteins. Based on this comprehensive collection, corresponding biological unit files (.pdb1) and structural prediction files (.pdb) were downloaded from the PDB and AlphaFold Structural Database, respectively. The downloaded structures were then classified into structural families based on both Pfam hits and structural similarity to selected reference structures, listed in Table S1. From the sequence search results, PDB entries that included at least two NBDs and one TMD in the biological unit file (encoded in one or more chains) were selected as functional ABC protein structures. Then, TM-align was performed aligning every single structure to the reference structures, using the TM-score for characterizing structural similarity and to assure the reliability of the classification [72]. We considered two structures belonging to the same structural class if the TM-score resulting from their alignment was above 0.6, e.g., the Bce-like family emerged as a MacB Pfam hit with low structural similarity to the MacB reference structure (TM-score of 0.49).
To further classify the structures into inward-facing and inward-closed conformations, we used the distance between the Cα atom of the terminating amino acid of the Walker A helix (GXXXXGKT) in one of the NBDs and the signature helix (LSGGQ) in the other NBD. Since the signature sequence can be degenerate and cannot be recognized by sequence search, we identified these positions based on structural alignments with the Pgp NBD1 from 6c0v, using TM-align [72]. Structures were analyzed using MDAnalysis 2.0.0. [73] and NumPy 1.19.4. [74].
4.3. A Pipeline for Automatic Updates
In order to keep our collection updated, a Python script checks the sequence files deposited in PDBe and AlphaFold DB weekly. In case of changes, the aforementioned steps are executed automatically on the sequence of new entries. If the algorithm finds two NBD Pfam hits and no TMD hit in the sequences of the chains, we manually check the corresponding structure to avoid missing structures with novel transmembrane folds. Additional data needed for classification and the determination of conformational states is collected from cif files and UniProt [75]. The collected information is stored in a PostgreSQL database version 12.11 (
4.4. Web Applications
Both the 3dbeacon and the abc3d web applications are based on FastAPI 0.71.0 (
4.5. Running AlphaFold2
AlphaFold initial release and v2.0.0 were downloaded from github and installed as described (
4.6. Data Visualization
Molecular visualization was performed using PyMOL (The PyMOL Molecular Graphics System, Version 2.4.0 Schrödinger, LLC). Graphs were generated using Python 3.7.’s matplotlib 3.5.2. library [78].
Conceptualization: H.T. and T.H.; Methodology: E.S., I.S., S.N., M.V. and T.H.; Software: E.S., I.S., S.N., M.V. and T.H.; Formal Analysis: H.T., E.S. and T.H.; Writing—Original Draft Preparation: E.S. and T.H.; Writing—Review & Editing, all authors; Funding Acquisition, T.H. All authors have read and agreed to the published version of the manuscript.
Not applicable.
Not applicable.
Web applications are available at
We thank the Wigner Scientific Computing Laboratory (WSCLAB, the former Wigner GPU Laboratory) for computational resources.
The authors declare no conflict of interest.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure 1. Inward-facing ((a), PDB ID: 7a69) and inward-closed ((b), PDB ID: 6c0v) structures of ABCB1/MDR1/Pgp. Light and dark blue colors: TM domains; pink and hot pink: NBDs; green and pale green: the last residue in the Walker A and ABC signature helices, respectively; black: the conftor/distance between the selected amino acids from Walker A and signature.
Figure 2. Pfam and structural classes of ABC proteins. Pfam clans (rounded boxes) with ABC protein hits are depicted. Those Pfam families (boxes), which include proteins with experimental structures, are circled, colored, and labeled in bold. The labels were selected based on a widely known member of the structural family (e.g., Pgp-like). PF02588/YitT_membrane is in italic and with a dotted outline since it likely does not involve ABC family members.
Figure 3. Grouping experimental structures based on the length of conftor(WA/SIG). The distribution of distance values of all collected ABC protein structures.
Figure 4. Grouping AF2 structures based on the length of conftor(WA/SIG). The distribution of distance values of all collected AF2-predicted, full ABC structures.
Figure 5. AF2 prediction of the mitoK tetramer. (a) The central tetramer of the Kir6.2 potassium channel from the octamer structure with SUR1 (PDB ID: 6jb1). The four chains are colored differently and spheres indicate transmembrane regions. (b) The Kir6.2 tetramer predicted by AF-Multimer exhibit high pLDDT scores (blue and turquoise regions). The RMSD calculated for TMDs was 0.8 Å when compared to its experimental structure. (c) The plot of predicted align errors (PAE) indicates low values, thus a predicted structure with high quality. (d) The AF-predicted structure of mitoK, colored by chains, does not resemble a potassium channel architecture. Spheres indicate transmembrane regions. (e) The yellow and orange colors of the same structures correspond to poorly predicted regions with low pLDDT scores. (f) The high values of the PAE plot also indicate unreliable structure prediction for the mitoK tetramer.
Supplementary Materials
The supporting information can be downloaded at:
References
1. George, A.M. ABC Transporters—40 Years on; Springer: Cham, Switzerland, 2016; ISBN 978-3-319-23476-2
2. Szakács, G.; Váradi, A.; Ozvegy-Laczka, C.; Sarkadi, B. The Role of ABC Transporters in Drug Absorption, Distribution, Metabolism, Excretion and Toxicity (ADME-Tox). Drug Discov. Today; 2008; 13, pp. 379-393. [DOI: https://dx.doi.org/10.1016/j.drudis.2007.12.010] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/18468555]
3. Csanády, L.; Vergani, P.; Gadsby, D.C. Structure, Gating, and Regulation of the Cftr Anion Channel. Physiol. Rev.; 2019; 99, pp. 707-738. [DOI: https://dx.doi.org/10.1152/physrev.00007.2018] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30516439]
4. Borst, P.; Váradi, A.; van de Wetering, K. PXE, a Mysterious Inborn Error Clarified. Trends Biochem. Sci.; 2019; 44, pp. 125-140. [DOI: https://dx.doi.org/10.1016/j.tibs.2018.10.005] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30446375]
5. Veit, G.; Avramescu, R.G.; Chiang, A.N.; Houck, S.A.; Cai, Z.; Peters, K.W.; Hong, J.S.; Pollard, H.B.; Guggino, W.B.; Balch, W.E. et al. From CFTR Biology toward Combinatorial Pharmacotherapy: Expanded Classification of Cystic Fibrosis Mutations. Mol. Biol. Cell; 2016; 27, pp. 424-433. [DOI: https://dx.doi.org/10.1091/mbc.e14-04-0935] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/26823392]
6. Sarkadi, B.; Homolya, L.; Hegedűs, T. The ABCG2/BCRP transporter and its variants—From structure to pathology. FEBS Lett.; 2020; 594, pp. 4012-4034. [DOI: https://dx.doi.org/10.1002/1873-3468.13947]
7. Liu, X. Transporter-Mediated Drug-Drug Interactions and Their Significance. Adv. Exp. Med. Biol.; 2019; 1141, pp. 241-291. [DOI: https://dx.doi.org/10.1007/978-981-13-7647-4_5]
8. Locher, K.P. Mechanistic Diversity in ATP-Binding Cassette (ABC) Transporters. Nat. Struct. Mol. Biol.; 2016; 23, pp. 487-493. [DOI: https://dx.doi.org/10.1038/nsmb.3216]
9. Srikant, S.; Gaudet, R. Mechanics and Pharmacology of Substrate Selection and Transport by Eukaryotic ABC Exporters. Nat. Struct. Mol. Biol.; 2019; 26, pp. 792-801. [DOI: https://dx.doi.org/10.1038/s41594-019-0280-4]
10. Lewinson, O.; Orelle, C.; Seeger, M.A. Structures of ABC Transporters: Handle with Care. FEBS Lett.; 2020; 594, pp. 3799-3814. [DOI: https://dx.doi.org/10.1002/1873-3468.13966]
11. Moraes, I.; Evans, G.; Sanchez-Weatherby, J.; Newstead, S.; Stewart, P.D.S. Membrane Protein Structure Determination—The next Generation. Biochim. Biophys. Acta; 2014; 1838, pp. 78-87. [DOI: https://dx.doi.org/10.1016/j.bbamem.2013.07.010]
12. Varga, J.K.; E Tusnády, G. The TMCrys server for supporting crystallization of transmembrane proteins. Bioinformatics; 2019; 35, pp. 4203-4204. [DOI: https://dx.doi.org/10.1093/bioinformatics/btz108] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30793168]
13. Batool, M.; Ahmad, B.; Choi, S. A Structure-Based Drug Discovery Paradigm. Int. J. Mol. Sci.; 2019; 20, 2783. [DOI: https://dx.doi.org/10.3390/ijms20112783] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31174387]
14. Shoemaker, S.C.; Ando, N. X-rays in the Cryo-Electron Microscopy Era: Structural Biology’s Dynamic Future. Biochemistry; 2018; 57, pp. 277-285. [DOI: https://dx.doi.org/10.1021/acs.biochem.7b01031]
15. Yin, H.; Flynn, A.D. Drugging Membrane Protein Interactions. Annu. Rev. Biomed. Eng.; 2016; 18, pp. 51-76. [DOI: https://dx.doi.org/10.1146/annurev-bioeng-092115-025322] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/26863923]
16. Santos, R.; Ursu, O.; Gaulton, A.; Bento, A.P.; Donadi, R.S.; Bologa, C.G.; Karlsson, A.; Al-Lazikani, B.; Hersey, A.; Oprea, T.I. et al. A Comprehensive Map of Molecular Drug Targets. Nat. Rev. Drug Discov.; 2017; 16, pp. 19-34. [DOI: https://dx.doi.org/10.1038/nrd.2016.230] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/27910877]
17. McPherson, A.; Gavira, J.A. Introduction to Protein Crystallization. Acta Crystallogr. Sect. F Struct. Biol. Commun.; 2014; 70, pp. 2-20. [DOI: https://dx.doi.org/10.1107/S2053230X13033141]
18. Brown, C.J.; Trieber, C.; Overduin, M. Structural Biology of Endogenous Membrane Protein Assemblies in Native Nanodiscs. Curr. Opin. Struct. Biol.; 2021; 69, pp. 70-77. [DOI: https://dx.doi.org/10.1016/j.sbi.2021.03.008]
19. Farkas, B.; Csizmadia, G.; Katona, E.; Tusnády, G.E.; Hegedűs, T. MemBlob Database and Server for Identifying Transmembrane Regions Using Cryo-EM Maps. Bioinformatics; 2020; 36, pp. 2595-2598. [DOI: https://dx.doi.org/10.1093/bioinformatics/btz539]
20. Pereira, J.; Simpkin, A.J.; Hartmann, M.D.; Rigden, D.J.; Keegan, R.M.; Lupas, A.N. High-accuracy protein structure prediction in CASP14. Proteins Struct. Funct. Bioinform.; 2021; 89, pp. 1687-1699. [DOI: https://dx.doi.org/10.1002/prot.26171]
21. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A. et al. Highly accurate protein structure prediction with AlphaFold. Nature; 2021; 596, pp. 583-589. [DOI: https://dx.doi.org/10.1038/s41586-021-03819-2]
22. Varadi, M.; Anyango, S.; Deshpande, M.; Nair, S.; Natassia, C.; Yordanova, G.; Yuan, D.; Stroe, O.; Wood, G.; Laydon, A. et al. AlphaFold Protein Structure Database: Massively Expanding the Structural Coverage of Protein-Sequence Space with High-Accuracy Models. Nucleic Acids Res.; 2022; 50, pp. D439-D444. [DOI: https://dx.doi.org/10.1093/nar/gkab1061] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34791371]
23. Hegedűs, T.; Geisler, M.; Lukács, G.L.; Farkas, B. Ins and Outs of AlphaFold2 Transmembrane Protein Structure Predictions. Cell. Mol. Life Sci.; 2022; 79, 73. [DOI: https://dx.doi.org/10.1007/s00018-021-04112-1] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35034173]
24. Humphreys, I.R.; Pei, J.; Baek, M.; Krishnakumar, A.; Anishchenko, I.; Ovchinnikov, S.; Zhang, J.; Ness, T.J.; Banjade, S.; Bagde, S.R. et al. Computed structures of core eukaryotic protein complexes. Science; 2021; 374, eabm4805. [DOI: https://dx.doi.org/10.1126/science.abm4805] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34762488]
25. Du, Z.; Su, H.; Wang, W.; Ye, L.; Wei, H.; Peng, Z.; Anishchenko, I.; Baker, D.; Yang, J. The TrRosetta Server for Fast and Accurate Protein Structure Prediction. Nat. Protoc.; 2021; 16, pp. 5634-5651. [DOI: https://dx.doi.org/10.1038/s41596-021-00628-9] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34759384]
26. Thomas, C.; Aller, S.G.; Beis, K.; Carpenter, E.P.; Chang, G.; Chen, L.; Dassa, E.; Dean, M.; Duong Van Hoa, F.; Ekiert, D. et al. Structural and Functional Diversity Calls for a New Classification of ABC Transporters. FEBS Lett.; 2020; 594, pp. 3767-3775. [DOI: https://dx.doi.org/10.1002/1873-3468.13935]
27. Decottignies, A.; Goffeau, A. Complete Inventory of the Yeast ABC Proteins. Nat. Genet.; 1997; 15, pp. 137-145. [DOI: https://dx.doi.org/10.1038/ng0297-137]
28. Klein, I.; Sarkadi, B.; Váradi, A. An Inventory of the Human ABC Proteins. Biochim. Biophys. Acta BBA—Biomembr.; 1999; 1461, pp. 237-262. [DOI: https://dx.doi.org/10.1016/S0005-2736(99)00161-3]
29. Frank, G.A.; Shukla, S.; Rao, P.; Borgnia, M.J.; Bartesaghi, A.; Merk, A.; Mobin, A.; Esser, L.; Earl, L.A.; Gottesman, M.M. et al. Cryo-EM Analysis of the Conformational Landscape of Human P-Glycoprotein (ABCB1) During Its Catalytic Cycle. Mol. Pharmacol.; 2016; 90, pp. 35-41. [DOI: https://dx.doi.org/10.1124/mol.116.104190]
30. Beis, K. Structural Basis for the Mechanism of ABC Transporters. Biochem. Soc. Trans.; 2015; 43, pp. 889-893. [DOI: https://dx.doi.org/10.1042/BST20150047]
31. Prescher, M.; Bonus, M.; Stindt, J.; Keitel-Anselmino, V.; Smits, S.H.; Gohlke, H.; Schmitt, L. Evidence for a credit-card-swipe mechanism in the human PC floppase ABCB4. Structure; 2021; 29, pp. 1144-1155. [DOI: https://dx.doi.org/10.1016/j.str.2021.05.013]
32. Ford, R.C.; Beis, K. Learning the ABCs One at a Time: Structure and Mechanism of ABC Transporters. Biochem. Soc. Trans.; 2019; 47, pp. 23-36. [DOI: https://dx.doi.org/10.1042/BST20180147]
33. DALI and the Persistence of Protein Shape—PubMed. Available online: https://pubmed.ncbi.nlm.nih.gov/31606894/ (accessed on 2 June 2022).
34. Fox, N.K.; Brenner, S.E.; Chandonia, J.-M. SCOPe: Structural Classification of Proteins—Extended, Integrating SCOP and ASTRAL Data and Classification of New Structures. Nucleic Acids Res.; 2014; 42, pp. D304-D309. [DOI: https://dx.doi.org/10.1093/nar/gkt1240] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/24304899]
35. Burley, S.K.; Bhikadiya, C.; Bi, C.; Bittrich, S.; Chen, L.; Crichlow, G.V.; Christie, C.H.; Dalenberg, K.; Di Costanzo, L.; Duarte, J.M. et al. RCSB Protein Data Bank: Powerful New Tools for Exploring 3D Structures of Biological Macromolecules for Basic and Applied Research and Education in Fundamental Biology, Biomedicine, Biotechnology, Bioengineering and Energy Sciences. Nucleic Acids Res.; 2021; 49, pp. D437-D451. [DOI: https://dx.doi.org/10.1093/nar/gkaa1038] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33211854]
36. Evans, R.; O’Neill, M.; Pritzel, A.; Antropova, N.; Senior, A.W.; Green, T.; Žídek, A.; Bates, R.; Blackwell, S.; Yim, J. et al. Protein Complex Prediction with AlphaFold-Multimer. bioRxiv; 2021; [DOI: https://dx.doi.org/10.1101/2021.10.04.463034]
37. Xu, D.; Feng, Z.; Hou, W.-T.; Jiang, Y.-L.; Wang, L.; Sun, L.; Zhou, C.-Z.; Chen, Y. Cryo-EM Structure of Human Lysosomal Cobalamin Exporter ABCD4. Cell Res.; 2019; 29, pp. 1039-1041. [DOI: https://dx.doi.org/10.1038/s41422-019-0222-z]
38. Wang, R.; Qin, Y.; Li, X. Structural Basis of Acyl-CoA Transport across the Peroxisomal Membrane by Human ABCD1. Cell Res.; 2022; 32, pp. 214-217. [DOI: https://dx.doi.org/10.1038/s41422-021-00585-8]
39. Rempel, S.; Gati, C.; Nijland, M.; Thangaratnarajah, C.; Karyolaimos, A.; de Gier, J.W.; Guskov, A.; Slotboom, D.J. A Mycobacterial ABC Transporter Mediates the Uptake of Hydrophilic Compounds. Nature; 2020; 580, pp. 409-412. [DOI: https://dx.doi.org/10.1038/s41586-020-2072-8]
40. Ghilarov, D.; Inaba-Inoue, S.; Stepien, P.; Qu, F.; Michalczyk, E.; Pakosz, Z.; Nomura, N.; Ogasawara, S.; Walker, G.C.; Rebuffat, S. et al. Structural Insight into the Staphylococcus Aureus ATP-Driven Exporter of Virulent Peptide Toxins. Sci. Adv.; 2021; 7, eabj5363. [DOI: https://dx.doi.org/10.1126/sciadv.abj5363]
41. George, N.L.; Schilmiller, A.L.; Orlando, B.J. Conformational Snapshots of the Bacitracin Sensing and Resistance Transporter BceAB. Proc. Natl. Acad. Sci. USA; 2022; 119, e2123268119. [DOI: https://dx.doi.org/10.1073/pnas.2123268119]
42. Mann, D.; Fan, J.; Somboon, K.; Farrell, D.P.; Muenks, A.; Tzokov, S.B.; DiMaio, F.; Khalid, S.; Miller, S.I.; Bergeron, J.R.C. Structure and Lipid Dynamics in the Maintenance of Lipid Asymmetry Inner Membrane Complex of A. Baumannii. Commun. Biol.; 2021; 4, 817. [DOI: https://dx.doi.org/10.1038/s42003-021-02318-4]
43. Yu, Q.; Ni, D.; Kowal, J.; Manolaridis, I.; Jackson, S.M.; Stahlberg, H.; Locher, K.P. Structures of ABCG2 under Turnover Conditions Reveal a Key Step in the Drug Transport Mechanism. Nat. Commun.; 2021; 12, 4376. [DOI: https://dx.doi.org/10.1038/s41467-021-24651-2] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34282134]
44. Csizmadia, G.; Farkas, B.; Spagina, Z.; Tordai, H.; Hegedűs, T. Quantitative Comparison of ABC Membrane Protein Type I Exporter Structures in a Standardized Way. Comput. Struct. Biotechnol. J.; 2018; 16, pp. 396-403. [DOI: https://dx.doi.org/10.1016/j.csbj.2018.10.008] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30425800]
45. Szabó, K.; Szakács, G.; Hegeds, T.; Sarkadi, B. Nucleotide Occlusion in the Human Cystic Fibrosis Transmembrane Conductance Regulator. Different Patterns in the Two Nucleotide Binding Domains. J. Biol. Chem.; 1999; 274, pp. 12209-12212. [DOI: https://dx.doi.org/10.1074/jbc.274.18.12209]
46. Yeh, H.-I.; Yu, Y.-C.; Kuo, P.-L.; Tsai, C.-K.; Huang, H.-T.; Hwang, T.-C. Functional Stability of CFTR Depends on Tight Binding of ATP at Its Degenerate ATP-Binding Site. J. Physiol.; 2021; 599, pp. 4625-4642. [DOI: https://dx.doi.org/10.1113/JP281933] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34411298]
47. Nagy, T.; Tóth, Á.; Telbisz, Á.; Sarkadi, B.; Tordai, H.; Tordai, A.; Hegedűs, T. The transport pathway in the ABCG2 protein and its regulation revealed by molecular dynamics simulations. Cell. Mol. Life Sci.; 2020; 78, pp. 2329-2339. [DOI: https://dx.doi.org/10.1007/s00018-020-03651-3] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32979053]
48. Stockner, T.; de Vries, S.J.; Bonvin, A.M.J.J.; Ecker, G.F.; Chiba, P. Data-Driven Homology Modelling of P-Glycoprotein in the ATP-Bound State Indicates Flexibility of the Transmembrane Domains. FEBS J.; 2009; 276, pp. 964-972. [DOI: https://dx.doi.org/10.1111/j.1742-4658.2008.06832.x] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/19215299]
49. Orlando, B.J.; Liao, M. ABCG2 Transports Anticancer Drugs via a Closed-to-Open Switch. Nat. Commun.; 2020; 11, 2264. [DOI: https://dx.doi.org/10.1038/s41467-020-16155-2]
50. Zhang, M.; Bao, Z.; Zhao, Q.; Guo, H.; Xu, K.; Wang, C.; Zhang, P. Structure of a Pantothenate Transporter and Implications for ECF Module Sharing and Energy Coupling of Group II ECF Transporters. Proc. Natl. Acad. Sci. USA; 2014; 111, pp. 18560-18565. [DOI: https://dx.doi.org/10.1073/pnas.1412246112]
51. Bao, Z.; Qi, X.; Hong, S.; Xu, K.; He, F.; Zhang, M.; Chen, J.; Chao, D.; Zhao, W.; Li, D. et al. Structure and Mechanism of a Group-I Cobalt Energy Coupling Factor Transporter. Cell Res.; 2017; 27, pp. 675-687. [DOI: https://dx.doi.org/10.1038/cr.2017.38]
52. Rempel, S.; Stanek, W.K.; Slotboom, D.J. ECF-Type ATP-Binding Cassette Transporters. Annu. Rev. Biochem.; 2019; 88, pp. 551-576. [DOI: https://dx.doi.org/10.1146/annurev-biochem-013118-111705]
53. Setyawati, I.; Stanek, W.K.; Majsnerowska, M.; Swier, L.J.Y.M.; Pardon, E.; Steyaert, J.; Guskov, A.; Slotboom, D.J. In Vitro Reconstitution of Dynamically Interacting Integral Membrane Subunits of Energy-Coupling Factor Transporters. eLife; 2020; 9, e64389. [DOI: https://dx.doi.org/10.7554/eLife.64389] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33350937]
54. Bryan, J.; Muñoz, A.; Zhang, X.; Düfer, M.; Drews, G.; Krippeit-Drews, P.; Aguilar-Bryan, L. ABCC8 and ABCC9: ABC Transporters That Regulate K+ Channels. Pflügers Arch. -Eur. J. Physiol.; 2007; 453, pp. 703-718. [DOI: https://dx.doi.org/10.1007/s00424-006-0116-z] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/16897043]
55. Wu, J.-X.; Ding, D.; Wang, M.; Kang, Y.; Zeng, X.; Chen, L. Ligand Binding and Conformational Changes of SUR1 Subunit in Pancreatic ATP-Sensitive Potassium Channels. Protein Cell; 2018; 9, pp. 553-567. [DOI: https://dx.doi.org/10.1007/s13238-018-0530-y] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/29594720]
56. Paggio, A.; Checchetto, V.; Campo, A.; Menabò, R.; Di Marco, G.; Di Lisa, F.; Szabo, I.; Rizzuto, R.; De Stefani, D. Identification of an ATP-Sensitive Potassium Channel in Mitochondria. Nature; 2019; 572, pp. 609-613. [DOI: https://dx.doi.org/10.1038/s41586-019-1498-3] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31435016]
57. Heginbotham, L.; Lu, Z.; Abramson, T.; MacKinnon, R. Mutations in the K+ Channel Signature Sequence. Biophys. J.; 1994; 66, pp. 1061-1067. [DOI: https://dx.doi.org/10.1016/S0006-3495(94)80887-2]
58. Doyle, D.A.; Morais Cabral, J.; Pfuetzner, R.A.; Kuo, A.; Gulbis, J.M.; Cohen, S.L.; Chait, B.T.; MacKinnon, R. The Structure of the Potassium Channel: Molecular Basis of K+ Conduction and Selectivity. Science; 1998; 280, pp. 69-77. [DOI: https://dx.doi.org/10.1126/science.280.5360.69]
59. Sehnal, D.; Bittrich, S.; Deshpande, M.; Svobodová, R.; Berka, K.; Bazgier, V.; Velankar, S.; Burley, S.K.; Koča, J.; Rose, A.S. Mol* Viewer: Modern Web App for 3D Visualization and Analysis of Large Biomolecular Structures. Nucleic Acids Res.; 2021; 49, pp. W431-W437. [DOI: https://dx.doi.org/10.1093/nar/gkab314]
60. Jones, P.M.; George, A.M. Is the Emperor Wearing Shorts? The Published Structures of ABC Transporters. FEBS Lett.; 2020; 594, pp. 3790-3798. [DOI: https://dx.doi.org/10.1002/1873-3468.13941]
61. Neumann, J.; Rose-Sperling, D.; Hellmich, U.A. Diverse Relations between ABC Transporters and Lipids: An Overview. Biochim. Biophys. Acta BBA—Biomembr.; 2017; 1859, pp. 605-618. [DOI: https://dx.doi.org/10.1016/j.bbamem.2016.09.023]
62. Farkas, B.; Tordai, H.; Padányi, R.; Tordai, A.; Gera, J.; Paragi, G.; Hegedűs, T. Discovering the chloride pathway in the CFTR channel. Cell. Mol. Life Sci.; 2019; 77, pp. 765-778. [DOI: https://dx.doi.org/10.1007/s00018-019-03211-4]
63. Hoffmann, A.; Grudinin, S. NOLB: Nonlinear Rigid Block Normal-Mode Analysis Method. J. Chem. Theory Comput.; 2017; 13, pp. 2123-2134. [DOI: https://dx.doi.org/10.1021/acs.jctc.7b00197] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/28379696]
64. Zhang, S.; Krieger, J.M.; Zhang, Y.; Kaya, C.; Kaynak, B.; Mikulska-Ruminska, K.; Doruker, P.; Li, H.; Bahar, I. ProDy 2.0: Increased scale and scope after 10 years of protein dynamics modelling with Python. Bioinformatics; 2021; 37, pp. 3657-3659. [DOI: https://dx.doi.org/10.1093/bioinformatics/btab187] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33822884]
65. Zheng, W.; Zhou, X.; Wuyun, Q.; Pearce, R.; Li, Y.; Zhang, Y. FUpred: Detecting Protein Domains through Deep-Learning-Based Contact Map Prediction. Bioinformatics; 2020; 36, pp. 3749-3757. [DOI: https://dx.doi.org/10.1093/bioinformatics/btaa217] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32227201]
66. Hekkelman, M.L.; de Vries, I.; Joosten, R.P.; Perrakis, A. AlphaFill: Enriching the AlphaFold Models with Ligands and Co-Factors. bioRxiv; 2021; [DOI: https://dx.doi.org/10.1101/2021.11.26.470110]
67. Bagdonas, H.; Fogarty, C.A.; Fadda, E.; Agirre, J. The Case for Post-Predictional Modifications in the AlphaFold Protein Structure Database. Nat. Struct. Mol. Biol.; 2021; 28, pp. 869-870. [DOI: https://dx.doi.org/10.1038/s41594-021-00680-9]
68. Gyimesi, G.; Borsodi, D.; Sarankó, H.; Tordai, H.; Sarkadi, B.; Hegedűs, T. ABCMdb: A Database for the Comparative Analysis of Protein Mutations in ABC Transporters, and a Potential Framework for a General Application. Hum. Mutat.; 2012; 33, pp. 1547-1556. [DOI: https://dx.doi.org/10.1002/humu.22138]
69. Tordai, H.; Jakab, K.; Gyimesi, G.; András, K.; Brózik, A.; Sarkadi, B.; Hegedus, T. ABCMdb Reloaded: Updates on Mutations in ATP Binding Cassette Proteins. Database; 2017; 2017, bax023. [DOI: https://dx.doi.org/10.1093/database/bax023]
70. Eddy, S.R. Accelerated Profile HMM Searches. PLoS Comput. Biol.; 2011; 7, e1002195. [DOI: https://dx.doi.org/10.1371/journal.pcbi.1002195]
71. Cock, P.J.A.; Antao, T.; Chang, J.T.; Chapman, B.A.; Cox, C.J.; Dalke, A.; Friedberg, I.; Hamelryck, T.; Kauff, F.; Wilczynski, B. et al. Biopython: Freely Available Python Tools for Computational Molecular Biology and Bioinformatics. Bioinformatics; 2009; 25, pp. 1422-1423. [DOI: https://dx.doi.org/10.1093/bioinformatics/btp163]
72. Zhang, Y.; Skolnick, J. TM-Align: A Protein Structure Alignment Algorithm Based on the TM-Score. Nucleic Acids Res.; 2005; 33, pp. 2302-2309. [DOI: https://dx.doi.org/10.1093/nar/gki524]
73. Michaud-Agrawal, N.; Denning, E.J.; Woolf, T.B.; Beckstein, O. MDAnalysis: A Toolkit for the Analysis of Molecular Dynamics Simulations. J. Comput. Chem.; 2011; 32, pp. 2319-2327. [DOI: https://dx.doi.org/10.1002/jcc.21787] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/21500218]
74. Harris, C.R.; Millman, K.J.; van der Walt, S.J.; Gommers, R.; Virtanen, P.; Cournapeau, D.; Wieser, E.; Taylor, J.; Berg, S.; Smith, N.J. et al. Array Programming with NumPy. Nature; 2020; 585, pp. 357-362. [DOI: https://dx.doi.org/10.1038/s41586-020-2649-2] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32939066]
75. UniProt Consortium UniProt: The Universal Protein Knowledgebase in 2021. Nucleic Acids Res.; 2021; 49, pp. D480-D489. [DOI: https://dx.doi.org/10.1093/nar/gkaa1100] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33237286]
76. Wojdyr, M. GEMMI: A Library for Structural Biology. J. Open Source Softw.; 2022; 7, 4200. [DOI: https://dx.doi.org/10.21105/joss.04200]
77. Bayer, M. SQLAlchemy. The Architecture of Open Source Applications Volume II: Structure, Scale, and a Few More Fearless Hacks; Brown, A.; Wilson, G. 2012; Available online: https://aosabook.org (accessed on 10 March 2022).
78. Hunter, J.D. Matplotlib: A 2D Graphics Environment. Comput. Sci. Eng.; 2007; 9, pp. 90-95. [DOI: https://dx.doi.org/10.1109/MCSE.2007.55]
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
The number of unique transmembrane (TM) protein structures doubled in the last four years, which can be attributed to the revolution of cryo-electron microscopy. In addition, AlphaFold2 (AF2) also provided a large number of predicted structures with high quality. However, if a specific protein family is the subject of a study, collecting the structures of the family members is highly challenging in spite of existing general and protein domain-specific databases. Here, we demonstrate this and assess the applicability and usability of automatic collection and presentation of protein structures via the ABC protein superfamily. Our pipeline identifies and classifies transmembrane ABC protein structures using the PFAM search and also aims to determine their conformational states based on special geometric measures, conftors. Since the AlphaFold database contains structure predictions only for single polypeptide chains, we performed AF2-Multimer predictions for human ABC half transporters functioning as dimers. Our AF2 predictions warn of possibly ambiguous interpretation of some biochemical data regarding interaction partners and call for further experiments and experimental structure determination. We made our predicted ABC protein structures available through a web application, and we joined the 3D-Beacons Network to reach the broader scientific community through platforms such as PDBe-KB.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details





1 Department of Biophysics and Radiation Biology, Semmelweis University, 1085 Budapest, Hungary
2 Department of Biophysics and Radiation Biology, Semmelweis University, 1085 Budapest, Hungary; Faculty of Electrical Engineering and Informatics, Budapest University of Technology and Economics, 1111 Budapest, Hungary; Wigner Research Centre for Physics, 1121 Budapest, Hungary
3 Department of Structural and Molecular Biology, University College London, London WC1E 6BT, UK
4 European Bioinformatics Institute, European Molecular Biology Laboratory, Hinxton CB10 1SD, UK
5 Department of Biophysics and Radiation Biology, Semmelweis University, 1085 Budapest, Hungary; ELKH-SE Biophysical Virology Research Group, Eötvös Loránd Research Network, 1052 Budapest, Hungary