Introduction
The biological function of proteins is directly linked to dynamic changes of their structures. Conformational flexibilities, heterogeneities, and polymorphisms are known to enable interactions among biomolecules, promote promiscuity with different binding partners, and are essential for enzymatic activity (Tompa and Fuxreiter, 2008; Hensen et al., 2012). This is most evident for motor proteins such as myosin or dynamin, where cyclic structural changes are crucial for their function. Thus, for a mechanistic molecular understanding of biological processes, the structure and the associated dynamics of the key components need to be characterized in great detail, ideally on a single-molecule level (Lerner et al., 2018; Lerner et al., 2021).
While NMR spectroscopy is an excellent tool to map conformationally excited states and intermediates (Neudecker et al., 2012) the determination of dynamic biomolecular structures of large systems is extremely challenging. To-date no individual technique fully maps structures and dynamics on all time scales and on a length scale necessary to understand large molecular systems. Thus, multiple experimental techniques need to be combined to probe different aspects and unveil structures of large multi-domain proteins (Felekyan et al., 2012; Kilic et al., 2018; Lerner et al., 2021). Here, we present and apply a framework that integrates short and long-range distances with shape information amended by time-resolved spectroscopy and molecular dynamics simulations for dynamic structures. Our framework identifies functional elements as building blocks, and balances experimental information in a meta-analysis to generate integrative dynamic structures with a small number of informative distances.
We apply our framework to study molecular mechanisms of a guanylate binding protein (GBP), a class of soluble proteins that belong to the dynamin superfamily and to the class of interferon-γ induced effector molecules (Praefcke and McMahon, 2004). GBPs are important for innate cell-autonomous immunity in mammals. GBPs form supramolecular complexes during infection and are recognized for their immune activity against a wide range of intracellular pathogens such as viruses (Anderson et al., 1999; Itsui et al., 2009), and bacteria (Kim et al., 2011; MacMicking, 2012; Li et al., 2017). Noteworthy, a GBP in mice translocates from the cytosol to endomembranes and attacks the plasma membrane of eukaryotic cellular parasites by the controlled formation of productive and supramolecular complexes (Kravets et al., 2016). As a prime example for a GBP, we study the human GBP1 (hGBP1). hGBP1 shows nucleotide-dependent dimerization (Ghosh et al., 2006), and the formation of supramolecular structures promoted by GTPase activity (Shydlovskyi et al., 2017). X-ray crystallography on the full-length hGBP1 revealed a folded and fully structured protein with the typical architecture of a dynamin superfamily member. hGBP1 consists of a large GTPase domain (LG domain), an alpha-helical middle domain, and an elongated, also purely alpha-helical, effector domain comprising the helices α12 and α13, with a length of 120 Å (Prakash et al., 2000; Figure 1A). X-ray crystallography (Ghosh et al., 2006) and biochemical experiments (Ince et al., 2017) identified the LG domains as interface for GTP induced homo-dimerization. Like for other membrane-associated dynamins that form tubular shaped assemblies to fuse or divide membranes in cells (Faelber et al., 2012; Reubold et al., 2015), cylindrical and tubular structures have been observed for hGBP1 (Shydlovskyi et al., 2017). For hGBP1 neither molecular structures of these tubules nor precursor structures in solution that could inform on the assembly pathway are known (Cui et al., 2021). Previous FRET and DEER experiments on hGBP1 dimers identified two conformers. In the dominant dimer, the two C-terminal α13 helices associate (Vöpel et al., 2014). This is in line with live-cell experiments that highlight the relevance of helix α13 for the immune response (Tietzel et al., 2009; Li et al., 2017; Piro et al., 2017). Previously, we identified monomeric and dimeric forms of farnesylated hGBP1 by SEC-SAXS and ultracentrifugation (Figure 1—figure supplement 1). These experiments lead to the hypothesis that specific intramolecular interactions stabilize the GTPase and act as a safety mechanism preventing hGBP1 dimerization (Lorenz et al., 2020). Here, to unravel the conformational changes necessary for the formation of a fully bridged dimer (b-hGBP1:L)2 (Figure 1B), we study non-farnesylated hGBP1, where nucleotide ligands L (GTP) are bound and the effector domains
Figure 1.
DEER and FRET distance network that probes structural arrangement of the human guanylate binding protein 1 (hGBP1) and potential dimerization pathways.
(A) The network is shown on top of the crystal structure (hGBP1, PDB-ID: 1DG3). hGBP1 consists of three domains: the LG domain (blue), a middle domain (gray) and the helices α12/13 (green/orange). The amino acids highlighted by the labels were used to attach spin-labels and fluorophores for DEER-EPR and FRET experiments, respectively. Magenta and black lines represent the DEER- and FRET-pairs, respectively. In hGBP1 the C-terminus is post-translationally modified and farnesylated for insertion into membranes (red). (B) Potential different pathways for the formation of a functional hGBP1 homodimer where the substrate binding LG domains and the helix α13 associate. The association of the helix α13 requires flexibility (red arrows). This flexibility could be induced at different stages of a dimerization pathway.
Figure 1—figure supplement 1.
Structural knowledge on hGBP1.
Crystallographic and model-free structural knowledge on hGBP1 is highlighted by solid contour lines of the domains. Model-based and indirect structural information is highlighted by dashed contour lines of the domains. (A) In the crystallographic monomer (PDB-ID: 1DG3) the LG- (blue), middle- (gray), helix α12 (green), and helix α13 (orange) of hGBP1 adopt an extended conformation where helix α12/13 attach to one side of the LG-domain (Prakash et al., 2000). (B) In crystallographic dimer data of the LG-domain (2BC9, 2B92, 2B8W) and in a crystallographic dimer of the full-length protein (PDB-ID: 1F5N) require helix α13 to be located at diastral sites of the LG:LG domain dimer (C) FRET and DEER experiments on hGBP1 verified crystallographic LG:LG domain interaction (solid lines) and identified two dimer conformations. In the minor (10% population) conformation helix α13 were separated. In the major conformation (90% population) two α13 helices associate (Vöpel et al., 2014). (D) A crystallographic structure on farnesylated hGBP1. The farnesyl anchor (red) attached to helix α13 extends along helix α12 and binds between α12 and α13. The farnesylated hGBP1 and the non-farnesylated hGBP1 crystal structures are largely consistent (RMSD 1.4 Å). (E) SEC-SAXS experiments on farnesylated hGBP1 identified a monomer and (F) two dimer conformations (Lorenz et al., 2020).
By probing hGBP1’s flexibility (red arrows, Figure 1B), we discern the different dimerization paths (black arrow, Figure 1B). Either the flexibility is substrate independent (
Experimentally, we map the motions of the monomeric non-farnesylated hGBP1 in the absence and in the presence of the non-hydrolysable ligand GDP-AlFx, corroborated by GTP control experiments. By combining experimental information through integrative modeling, we also resolve hGBP1 structures that explain the molecular prerequisites for dimerization. To generate structures, we use information from small-angle X-ray scattering (SAXS), electron paramagnetic resonance (EPR) spectroscopy by site-directed spin labeling (Klare and Steinhoff, 2009), ensemble and single-molecule fluorescence spectroscopy (Hellenkamp et al., 2018). smFRET and DEER independently yield distance restraints for modeling, the former with the advantage of being a single-molecule technique that can be applied under ambient conditions, whereas the latter uses a single type of label that is smaller compared to FRET labels, simplifying treatment of the label for modeling purposes. For dynamic information, we apply neutron spin-echo spectroscopy (NSE) and filtered fluorescence correlation spectroscopy (fFCS) (Felekyan et al., 2012; Lerner et al., 2021). We resolve structures of two new conformational states by integrative modeling and mapped hGBP1’s kinetics from nanoseconds to milliseconds. Interrogating conformational dynamics by a network of 12 FRET pairs (Figure 1A), we generate a temporal spectrum of hGBP1’s internal motions. Finally, we discuss potential implications of the detected protein flexibility and conformers controlling the formation of multimers via an opening like a pocketknife. This allows us to understand the mechanisms excreting the function of this large multi-domain system, that is, the programmed and controlled oligomerization.
Results
Experimental equilibrium distributions
We performed DEER, FRET and SAXS experiments to probe short distances, long distances, and molecular shapes, respectively. For the DEER and FRET experiments, we used engineered non-farnesylated hGBP1 cysteine variants (Figure 1A) labeled with MTSSL (R1) as spin label and with Alexa488-Alexa647 as FRET pair (Förster radius
Figure 2.
Probing the structure of hGBP1 in solution experimentally.
The left panels illustrate the characteristic properties probed by the experiments: (A) small angle X-ray scattering (SAXS), (B) double electron-electron resonance spectroscopy (DEER), and (C) Förster resonance energy transfer spectroscopy (FRET). In general, all middle panels display representations of the experimental data (dark yellow curves). The right panels show model-free analysis (red). Predicted experimental data based on a full-length X-ray crystal structure (PDB-ID: 1DG3) are shown in blue. To the top of the experimental curves, either data noise weighted, w.res., or unweighted residuals, res., are shown (middle panels). DEER and FRET experiments sense distances between labels that are flexibly coupled to specific labeling sites (exemplified for the double cysteine variant Q344C/A496C). The time-dependent responses of the sample (middle) inform on the inter-label distance distributions (right panels). The recovered distance distributions are compared to structural models by simulating the spatial distribution of the labels around their attachment point (left panels). The spatial distributions of the MTSSL-labels (B, left), as well as the donor and acceptor dye (C, left), are shown in magenta, green, and red, respectively. All distances resolved by EPR and FRET are compiled in Appendix 1—table 1 (A) Left: In SAXS the scattered intensity
Figure 2—figure supplement 1.
Small-angle X-ray scattering measurements on the nucleotide-free hGBP1.
(A) Measured SAXS data of hGBP1 at different protein concentrations. The scattering curves are not normalized by the protein concentration. (B) Structure factor of the 29.9 mg/mL solution extracted from the SAXS data. The structure factor is obtained by the background corrected SAXS curves at highest concentration scaled through division by the form factor (empty circles). The fitted structure factors according to the Percus-Yevik structure factor include the correction for the protein asymmetry factor
Figure 2—figure supplement 2.
DEER-spectroscopy on a network of MTSSL spin-labeled pairs of the hGBP1 resolves pairwise inter-label distance distributions.
(A) At the top, the network of spin-labeled hGBP1 is shown superposed to a crystal structure of hGBP1. A rotamer library analysis (RLA) simulates for the crystal structure (PDB-ID: 1DG3), the FRET major state (M1), the minor state (M2) inter-spin distance distributions. To the left, experimental background corrected DEER-traces and simulated DEER-traces based on a RLA of different structural models; to the right, inter-spin distance distribution as determined by Tikhonov regularization of the experimental DEER-trace. (B) Parametrization of the EPR-MTSSL label for accessible volume calculations. Top the distance distributions for the spin-pair N18C/Q577C of the hGBP1 crystal structure (PDB-ID: 1DG3) as calculated by the MTSSL-Wizard (Hagelueken et al., 2012) is overlaid by the distance distribution as calculated by accessible volume calculations with the parameter set as provided below. For visual comparison, the rotamers are overlaid with the accessible volume calculated for the labeling position N18C. To parameterize the MTSSL label, we used the variant N18C/Q577C as reference and optimized the simulated linker-length, the label-radius and the linker-width until the distance distribution as determined by the AV-calculations agrees best with the distance distributions as determined by the MTSSL-Wizard (Hagelueken et al., 2012) and MMM (Polyhach, Bordignon et al.). The best agreement was found using a linker-length of 8.5 Å, a linker-width of 4.5 Å and a label-radius of 4.0 Å. All rigid body dockings were performed using this parameter set.
Figure 2—figure supplement 3.
Quality controls for labeling based methods.
(A) Assessment of the labeling on the protein activity by comparison of the GTPase activity (left) and the self-oligomerization of hGBP1 (right). The effect of labeling on GTPase activity of hGBP1 as measured by the specific activities of 1µM single cysteine hGBP1 mutants at 25°C, either unlabeled or modified by Alexa488 or MTSSL at their free cysteines. Specific activities of the hGBP1 variants labeled by Alexa488 and Alexa647. The effect of the labels on the oligomerization of hGBP1 was assessed by size exclusion chromatography of 20 µM of unlabeled (top, right) and double labelled hGBP1 Cys9 (bottom, right) in the presence of 150–200 µM GppNHp or GDP AlFx or in the absence of any nucleotide. (B) The fluorescence properties of the dye were studied by time-resolved anisotropies and dynamic quenching. The donor Alexa488 was predominantly freely rotating. This is highlighted by the fast-initial decay of the time-resolved anisotropy,
DEER and FRET experiments on engineered non-farnesylated hGBP1 cysteine variants probed distances between specific labeling sites (Figure 1A) - exemplified for the dual cysteine variant Q344C/A496C (Figure 2B and C). The inter-spin distances recovered for Q344C/A496C by a model free DEER analysis are clearly shifted by ~2.5 Å towards shorter distances compared to the distances simulated for an X-ray structure of non-farnesylated hGBP1 (PDB-ID: 1DG3) using a rotamer library analysis (RLA) approach (Polyhach et al., 2011; Figure 2B, right). This shows that the protein exhibits conformations, where the spin-labels come closer to each other than suggested by the crystal structure. Overall, the experimental inter-spin distributions,
FRET experiments using ensemble time-correlated single photon counting (eTCSPC) recovered inter-fluorophore distance distributions,
We simulate the positional distribution of the dyes by their accessible volume (AV) (Cai et al., 2007; Muschielok et al., 2008; Sindbert et al., 2011) to compare structures and FRET experiments (Sindbert et al., 2011; Kalinin et al., 2012). In the comparison we considered uncertainty estimates of the experimental distances (Appendix 2) and accounted for interactions of the dyes with the protein by the accessible contact volume (ACV) (Dimura et al., 2016). The fraction of dyes in an ACV was calibrated by time-resolved anisotropy experiments (Figure 2—figure supplement 3B, Appendix 1—table 1). Moreover, the anisotropy was used to estimate uncertainties using experimental informed orientation factor distributions (Dale et al., 1979). The dyes are only weakly quenched to an extent that is expected for their local environment validating the used model of a mobile dye (Appendix 1—table 2). In this case, the approximation showed to be accurate (Peulen et al., 2017). Activity assays show that the dyes and the mutations only weakly affect the protein function (Appendix 2, Figure 2—figure supplement 3A). This provides compelling evidence that the distances can be used for structural interpretations.
Distances of M1 agree better with the full length X-ray structure than M2 (Figure 2C, right, Appendix 1—table 1) - the sum of uncertainty weighted squared deviations, , for M1 is significantly smaller than for M2 (
To sum up, EPR-DEER at cryogenic temperatures detected small deviations to the crystal structure. SAXS and FRET detected clear deviations at room temperature. To describe the FRET data at least two states are necessary, which are not detected in the DEER experiments most likely due to re-equilibration of the two conformations during sample freezing. Temperature-dependent measurements revealed that these states are also populated at higher physiological temperatures (Appendix 2, Figure 2—figure supplement 3C).
Identification and quantification of molecular kinetics
The distance information of the SAXS, DEER, and FRET experiments provides evidence for a motion of the middle-domain, the LG-domain, and α12/13. We probe the global and the inter-domain motion by single-molecule (sm)FRET experiments with Multiparameter Fluorescence Detection (MFD) and Neutron Spin Echo (NSE) experiments (Sisamakis et al., 2010; Biehl et al., 2011). The NSE experiments are most sensitive up to a correlation time of 200 ns. The filtered fluorescence correlation spectroscopy (fFCS) of our MFD data is most sensitive from sub-microseconds to milliseconds. Thus, by combining NSE with MFD-fFCS, we effectively probe for conformational dynamics from nano- to milliseconds.
An analysis of the NSE data is visualized in Figure 3A, which displays the effective diffusion coefficient
Figure 3.
Conformational dynamics of hGBP1 studied by neutron spin echo (NSE), single molecule (sm) FRET with multi-parameter fluorescence detection (MFD), and molecular dynamics (MD) simulations.
(A) Effective diffusion coefficients of hGBP1,
Figure 3—figure supplement 1.
Neutron spin echo spectroscopy (NSE) on the hGBP1 resolves internal dynamics on the nanosecond timescale.
(A) Intermediate scattering function as measured by NSE with fits according to the rigid body models. The numbers to the right show the respective wave-vectors from top down with
Figure 3—figure supplement 2.
Single-molecule fluorescence measurements.
Multi-parameter fluorescence detection histograms of different variants of Alexa488 and Alexa647 labeled human guanylate binding protein 1. The dashed blue lines are either static FRET-lines (see Materials and methods) considering linker broadening (top panels) or Perrin-equations for a dye with two rotational correlation times (Appendix 2, Equations 21 and 22).
Figure 3—figure supplement 3.
Sub-ensemble fluorescence decays of single-molecule FRET measurements on different FRET-labeled (Alexa488, Alexa647) variant of hGBP1.
Considering the distinct fluorescence species and acquired data, the fluorescence decays of the 12 samples are display in the following colors:
Figure 3—figure supplement 4.
Global analysis of filtered fluorescence correlation spectroscopy of FRET-labeled variants for the hGBP1 probing its internal dynamics from µs to ms.
The model function (red) line is a global (single) model for all depicted fFCS curves. In the MFD histograms high FRET, H, and low FRET, L, species were identified to generate a variant specific set of filters, that are compiled in Appendix 1—table 6. These filters were used to calculate two species cross-correlation functions,
Figure 3—figure supplement 5.
Single-molecule fluorescence measurements under dimer conditions and in the presence of nucleotides.
(A) Multiparameter single-molecule fluorescence measurements of a set of comparable hGBP1 variants that is weakly affected in their GTP hydrolysis to different extent by the introduced mutations and labels. The labeling positions N18C and Q254C are on opposing sites of the molecule. LP and UP refer to labeled protein and unlabeled protein, respectively. In the presence of UP (10 µM) and GDP-AlFx (100 µM) hGBP1 forms a dimer and undergoes significant conformational changes. These conformational changes were detected for the variants with weakly (N18C/Q577C) and variants stronger affected in their GTP hydrolysis (Q254C/Q540C) & (N18C/Q577C). The mutation Q577C has for the labeled and the unlabeled hGBP1 no effect on the specific activity. The mutation Q540C affects GTP hydrolysis activity of the labeled and the unlabeled hGBP1 equally strong. The mutation Q254C affects the GTP hydrolysis activity only the presence of a dye. (B) Control measurements of the variant N18C/Q577C to study the effect of GTP binding on hGBP1. Under single-molecule conditions (pM concentration) the intensity-based FRET indicators (FD/FA and the FRET efficiency, E) are independent on the presence of the substrate GTP. In the presence of GTP and hGBP1 at micromolar concentration oligomerization / dimerization occurs and hGBP1 undergoes a conformational change highlighted by a change of the FRET indicators.
To cover sub-µs to ms dynamics, we performed MFD smFRET experiments on freely diffusing molecules. We determine for every molecule the average fluorescence lifetime of the donor, 〈
To quantify the dynamics, we performed filtered FCS (fFCS) and jointly analyze all species cross-correlation functions (
We found for hGBP1 in solution a conformationally heterogeneous ensemble that can be approximated by conformers M1 and M2 (TCSPC), no significant shape changes of non-farnesylated hGBP1 on a timescale up to 200 ns (NSE), and complex kinetics spanning the µs-range mainly associated to α12/13 and the middle domain (fFCS) that is unaffected by a nucleotide analog as a substrate. Based on the distance and the dynamic information, we propose a complex motion of α12/13 relative to the LG and the middle domain and additional intermediate conformational states, captured by fFCS through their kinetic fingerprint.
Essential motions determined by molecular dynamics simulations
We performed molecular dynamics (MD) simulations without experimental restraints to (
To sum up, the MD simulations cover timescales of a few microseconds, show potential directions of motions, and identified a pivot point between the LG and the middle domain. In agreement with NSE on the simulation timescale, the overall shape is mainly conserved, and large conformational changes are rare events. The helices α12/13 were mobile and exhibited a limited ‘rolling’ motion along the LG and middle domain that could connect the conformers
Experimentally guided structural modeling
Altogether, the SAXS, NSE, EPR, and FRET measurements give a unified and consistent view on hGBP1 conformational dynamics. As recommended in Lerner et al., 2021, the assessment of sample properties and function (effects of mutations and labeling on enzyme properties, temperature effects of the conformational equilibrium), the estimation of the uncertainty of the determined interlabel distances and the consistency check between distinct measurement methods in Appendix 2 provide confidence that our data are suitable for quantitative integrative structural modeling.
Thus, we used the obtained structural experimental information, the kinetics, the MD simulations, and the prior structural information provided by existing crystal structures to create new structures for M1 and M2 by experimentally guided structural modeling. Previously, we developed approaches for integrative modeling using FRET data (Sindbert et al., 2011; Kalinin et al., 2012) that could successfully resolve three short-lived conformational states of proteins in two benchmark studies: (1) For a large GTPase with synthetic simulated data (Dimura et al., 2016) and (2) the enzyme Lysozyme (T4L) of the bacteriophage T4 with experimental ensemble and single-molecule data (Dimura et al., 2020; Sanabria et al., 2020). Here, we extended this framework to incorporate DEER and SAXS data. The framework combines the experimental data in meta-analysis via their information content. A detailed description of our integrative modeling can be found in Materials and methods and Appendix 3.
In a nutshell, we generate quantitative structures in three major steps (Figure 4A): (
Figure 4.
Integrative modeling workflow and structure validation.
A detailed description and the used data can be found in Appendix 3. (A) The workflow combines rigid body docking (RBD), structural refinements, and molecular dynamics (MD) simulations. Rigid bodies (RBs) are identified by MD simulations and principal components analysis (PCA) (Materials and methods). (B) RBD representation of hGBP1: LG-domain (blue), the middle domain (gray), helix α12 (green), helix α13 (orange). The numbers correspond to the RB amino acid ranges. The crosses mark the FRET (black) and the EPR (magenta) labeling positions. The RBD considers the label distribution illustrated for a FRET pair by semi-transparent green (donor) and red (acceptor) surfaces. (C) Left: outline of (Appendix 3, Equation 27) and (Appendix 3, Equation 28) for all (M1, M2) pairs of structures (left). Confidence levels of the meta-analysis (Materials and methods, Equation 20) that discriminates (M1, M2) pairs (right). Red and dark yellow regions correspond to p-values smaller than 0.68 and 0.95, respectively. (D) Experimental validation of the best pair of structures. Comparison of experimental (for DEER and for FRET ) and modeled label distances (for DEER and FRET ). Specific symbols display label distances for label pairs with distinct (▲) and equal (●) values for M1 and M2, respectively (see Appendix 1—table 1). For SAXS the scattering curve (black line) of the structure pair (M1, M2) is compared to the experimental data (orange line) by the weighted residuals to the top. (E) The standard deviation, SD, of the pairwise Cα-Cα distance of the experimental ensemble with a p-value <0.68 (lower triangles) highlights the structural uncertainty. normalized by the computed by the experimental uncertainty validates the structures. (F) Root mean square fluctuations (RMSF) of the Cα atoms of structures with a p-value <0.68 are displayed for the globally aligned ensemble.
Figure 4—figure supplement 1.
Analysis of molecular dynamics simulations, conformational space, identification of flexible regions, ensemble selection.
(A) Root mean squared deviation (RMSD)
Figure 4—figure supplement 2.
Analysis of structure generation and discrimination.
(A) Cumulative probability α that for a given F-value a proposed structural model is significantly worse than the best-found structural model. The experimental degrees of freedom (dofd,SAXS) for SAXS was varied from 11 to 24 taking the values 11, 12, 13, 14, 15, 16, 17, 18, 20, 22, 24 with colors varying from blue to yellow. The cumulative probabilities were calculated using the best model as a reference () and an estimate of dofm ~10 for the degrees of freedom of the model (Appendix 3, Equation 29). The dofd,SAXS was varied to assess the influence of the relative weights of DEER, FRET and SAXS (Materials and methods, Equation 20). (B) Fraction of discriminated structures
We generate new structures (Figure 4A, steps 4–5) by sampling the conformational space of a coarse grained (cg) hGBP1 representation using FRET and DEER restraints. The representation (Figure 4B) is based on an order-parameter based rigidity analysis (Figure 4—figure supplement 1D), knowledge on the individual domains (Low and Löwe, 2010; Chen et al., 2017). It can reproduce the motion of the MD simulations (Figure 3F). For maximum parsimony, the DEER, FRET, and SAXS data were described by pairs of the structures (M1, M2) ranked by their agreement with SAXS, and DEER, FRET using and , respectively (Figure 4C; Appendix 3). The pair best agreeing with SAXS has a middle domain kinked towards the LG domain. A SAXS ensemble analysis revealed species population fractions for M1 between ~0.1–0.7 (Figure 4—figure supplement 1E, p-value = 0.68). A meta-analysis by Fisher’s method jointly scores pairs of structures considering all available data (Figure 4A, step 6b) and estimates for the effective degrees of freedom (dof) of the representation and the experiments (Figure 4C). A stability test demonstrates that varying the dofs has a minor influence on the structure (Figure 4—figure supplement 2A). A combined p-value of 0.68 discriminates 95% of all (M1, M2) pairs (Figure 4C, red area; Figure 4—figure supplement 2B) leaving models with average RMSDs of 11.2 Å and 14.5 Å for M1 and M2, respectively. The uncertainties are largest for α12/13 (Figure 4E). The pair of structures are validated for DEER and FRET comparing experimental and modeled average distances (Figure 4D
The standard deviation of pairwise Cα distances, , reveals alignment free regions of low and high variability (Figure 4E, lower triangles). To check if the variability exceeds the expectances based on experimental precision, is normalized by computing a weighted (normalized) precision, (Figure 4E upper triangles). The reference is the precision of "ideal and perfect" model ensembles, determined using the experimental uncertainties under the assumption, that the best experimental determined model is the ground truth. For M1, this procedure yields a distribution for the weighted precision of the recovered structural models that fluctuates around unity, the theoretical optimum (Figure 4E, left). The weighted precision for M2 close to the C-terminus (end of helix α12 and α13) is lower than expected (Figure 4E, right), presumably due to granularity of the model or systematic experimental errors. The heterogeneity of the structural ensembles judged by their root-mean-squared-fluctuations (RMSF) is in the expected range of ~7 and~9 Å, for M1 and M2 respectively (Figure 4F). We deposited the conformational ensemble with all meta data at the prototype archiving system PDB-Dev with the ID: PDBDEV_00000088.
To visualize differences among the structural models, we aligned the selected conformers to the LG domain. This demonstrates that in M1 and M2 α12/13 binds at two distinct regions of the LG domain (Figure 5A, red spheres). In M1, α12/13 binds to the same side of the LG domain as in the known crystal structure (PDB-ID: 1DG3). In M2, α12/13 binds to the opposing side of the LG domain. In a global alignment of the M1 and M2 structures, the best representatives of the ensembles visualize the transition between M1 and M2. A rearrangement of residues 306–312 results in a rotation of the middle domain around a pivot point (Figure 5B, cyan circle) and describes the experimental data. The relocation of α12/13 agrees well with global motions identified by PCA of the MD simulations (Hamelberg et al., 2004). In the transition from M1 to M2 α12/13 ‘rolls’ along the LG domain, while the middle domain rotates and kinks towards the LG domain. M1 is comparable to the crystal structure except for a kink of the middle towards the LG domain, the movement of α12/13 stops on the opposite side of the LG domain.
Figure 5.
Selected conformers and corresponding dimer models based on integrative modeling structures using of DEER, FRET, and SAXS data.
(A) All structures for M1 and M2 were aligned to the LG domain and are represented by orange and gray dots, indicating the Cα atoms of the amino acids F565 and T481, respectively. The structures best agreeing with all experiments are shown as cartoon representation (ribbon presentation see Figure 5—figure supplement 1). Non-rejected structures (p-value = 0.68, Figure 4—figure supplement 1E) represented by red spheres. The ensemble has been at deposited at PDB-Dev with the ID: PDBDEV_00000088. (B) Global alignment of all selected structures (p-value = 0.68). In the center, the structures best representing the average of the selected ensembles are shown. The transition from M1 to M2 (average correlation times 10–150 µs) can be described by a rotation around the region connecting the LG with the ligand binding site (magenta cross) and the middle domain (cyan circle). (C) Potential hGBP1:hGBP1 dimer structures constructed by superposing the head-to-head interface of the LG domain (PDB-ID: 2B92) to the full-length crystal structure (1DG3). The LG and middle domain are colored in blue and gray, respectively. Helices α12 and α13 are colored in green and orange, respectively.
Figure 5—figure supplement 1.
Selected conformers models based on integrative modeling structures using of DEER, FRET, and SAXS data.
(A) The pair of structural models (M1, M2) selected from the structural ensemble generated by DEER- and FRET-measurements best corresponding to the SAXS scattering data is shown in a cartoon representation. Both structural models are aligned to the LG domain.
Discussion
In non-farnesylated hGBP1 we found two conformations (M1 and M2), determined corresponding structures by integrative modeling, and mapped the M1/M2 exchange dynamics by NSE spectroscopy and fFCS. NSE showed no shape changes on the ns-timescale up to 200 ns. fFCS on a network of FRET-pairs revealed considerable dynamics on slower time scales (2–300 µs, Figure 3). The distribution of dynamics over such a wide range is indicative of a frustrated/rugged potential energy landscape with several substates and multiple kinetic barriers. Structural models for M1 and M2 based on SAXS, DEER, and FRET data revealed that the middle domain kinks towards the LG domain and that the helices α12/13 are bound on opposite sides of the LG domain. Notably, largest relative changes in DA distances are correlated with the fastest relaxation time (Figure 3D, Appendix 1—table 6). These findings are self-consistent, as the conformational transition from M1 to M2 and vice versa is complex and may cause a distribution of relaxation times, indicating a rough energy landscape with several intermediates, and the dynamics is mainly associated to α12/13. Analogous to protein folding, where (Chung et al., 2012) monitored the transition from the unfolded to the folded state and defined a transition path time, it would be intriguing to define an effective time for the conformational transition from M1 to M2. The conformational transition time would be a convolute of all observed relaxation times (Figure 3, Appendix 1—table 6) that is expected to be in the sub-millisecond time range. To sum up, the experiments can be described by two conformational states separated by a rugged energy landscape, resulting in slow transition invisible on the NSE timescale. The smFRET measurements demonstrate that this transition is an intrinsic property of non-farnesylated hGBP1 that does not depend on the presence of substrate (pathway (i) in Figure 1B).
To understand the functional relevance of M1 and M2, various observations and existing experimental information on farnesylated hGBP1 must be considered. We previously speculated that the farnesyl anchor acts as a ‘safety latch’ that attaches α12/13 to the LG domain. Nevertheless, we identified monomeric as well as dimeric forms of farnesylated and non-farnesylated hGBP1 by SEC-SAXS that both require large structural rearrangements (Lorenz et al., 2020). Thus, the dimerization, as the first step in oligomerization of hGBP1, is a feature that demands flexibility of the structure as deduced from major structural rearrangements described so far (Vöpel et al., 2014; Ince et al., 2017; Shydlovskyi et al., 2017). In particular, large movements of the LG, the middle domain and helices α12/13 against each other are required to establish the elongated building blocks of the polymer (Shydlovskyi et al., 2017). It is also most conceivable that multiple dynamically interchanging configurations of the sub-domains need to be sampled to assemble the highly ordered protein. Dynamins and farnesylated hGBP1 form highly ordered oligomers (Shydlovskyi et al., 2017) requiring at least two binding sites. We previously showed that non-farnesylated hGBP1 forms dimers via the LG domains (in a head-to-head manner)
Structure-wise, we found that the middle domain is kinked towards the LG domain as found for other dynamins (Low and Löwe, 2010; Chen et al., 2017). Moreover, our data supports two conformations with distinct binding sites of helix α12/13 that can be explained by major rearrangements of the region connecting the middle and the LG domain. Prakash et al., 2000 described already the interconnecting region of LG and middle domain, which comprise residues 279–310 including a small β-sheet and α-helix 6. The packing of helix α6 (residues 291–306) against α1/β1 of the LG domain and against helix α7 of the middle domain was hypothesized to stabilize the relative location of LG and middle domain against each other. Most intriguingly, the Sau group reported on the importance of helix α6 for full catalytic activity of hGBP1 and for oligomer formation. They could also clearly establish the relationship between oligomer formation and defensive activity against hepatitis C virus showing that impairing catalytic activity and oligomer formation by mutations leads also to a decreased antiviral activity (Pandita et al., 2016). These observations support our conclusions as to the importance of the movements around the pivot point located close to α-helix 6. Similar movements have been reported for other dynamin-like proteins, where the GTPase domain rearranges with respect to the middle domain along the catalytic cycle (Faelber et al., 2012; Kalia et al., 2018; Cui et al., 2021).
Previous data revealed two hGBP1 dimer conformations. In the major populated D2 conformation two α13 helices dimerize while in the minor D1 conformation helix α13 are separated (Kravets et al., 2012; Vöpel et al., 2014; Kravets et al., 2016; Shydlovskyi et al., 2017). Our new findings in this work lead to a common model which describes the reaction pathway of hGBP1 from a monomer to the formation of mesoscale droplets in vitro and living cells (Figure 6). We found that M1 is the prevailing conformation in solution. Thus, even though hGBP1 is flexible it likely first dimerizes via the LG domain to form a stable D2 dimer. All structural requirements for this multi-step conformational rearrangement for positioning the two interaction sites and defining the molecular polarity are already predefined in the monomeric hGBP1 molecule. In the absence of substrate and other GBP molecules, hGBP1 adopts at least two distinct conformational states. Upon addition of GTP, the LG domain can bind to another protomer, whilst the conformational dynamics appear to remain unchanged (Vöpel et al., 2014) which agrees with our current findings. When two GTP-bound hGBP1s associate, a head-to-head dimer is formed either in a M1:M1, M2:M2 or a M1:M2 configuration. As the M1:M2 dimer has a higher stability, the α13 helices of the two subunits associate and the equilibrium is shifted towards the M1:M2 dimers (Vöpel et al., 2014).
Figure 6.
Potential oligomerization pathways of the human guanylate binding protein 1 (hGBP1) summarizing current experimental findings (Kravets et al., 2012, Vöpel et al., 2014; Kravets et al., 2016; Shydlovskyi et al., 2017).
In the presence and absence of a nucleotide, hGBP1 is in a conformational exchange with a Pivot point between LG and middle domain resulting in at least two conformational states M1 and M2 with a correlation time of 2–300 µs. Binding of a nucleotide to the LG domain activates hGBP1 for dimerization. After hGBP1 dimerization via the LG domains conformational changes of the middle domains and the helices α12/13 lead to an association of both helices α13. The species fractions for respective populations are given as numbers on top of the wells of a schematic energy landscape (black line). The substrate GTP lowers the activation barrier (red line). Under turn-over of GTP, farnesylated hGBP1 further self-assembles to form highly ordered, micelle-like polymers.
Figure 6 highlights the capability of hGBP1 to form networks during phase separation. Notably, hGBP1 shares these features with other proteins that also undergo phase separation. As observed in this work, conformational flexibility, multivalent interactions and amphiphilicity were reported as important factors for phase separation (Banani et al., 2017; Cui et al., 2021). Moreover, directionality is introduced because hGBP1’s interaction sites have distinct affinities that define the polarity of the formed molecular assembly. The high affinities of LG domains ensure formation of a dimeric encounter complex already at low concentrations in the first step. The conformational flexibility of hGBP1’s effector domain promotes the second key step for multimerization - the association of helices α13 that makes the dimer amphiphilic.
In a more general view, our results on hGBP1 demonstrate that the exchange between distinct protein conformations is usually encoded in its design (pathway
In a broader perspective, this work and further experimental studies Hellenkamp et al., 2017; Borgia et al., 2018; Lerner et al., 2018; Dimura et al., 2020; Sanabria et al., 2020; Lerner et al., 2021; Agam et al., 2022, Hamilton et al., 2022 demonstrate the great capabilities of integrative label-based studies in combination with other experimental techniques to resolve the structure and dynamics of proteins under native conditions. This information on the promiscuous nature of proteins can contribute to shape a dynamic view on these macromolecules that links structural states and conformational dynamics with function. In case of hGBP1, the intrinsic flexibility is crucial for oligomerization (Figure 1 and Figure 6). Moreover, the obtained knowledge paves the way toward dynamic structural biology where structural models and kinetic information can be archived and disseminated in databases such as the prototype archiving system PDB-Dev (Berman et al., 2019).
Materials and methods
Protein expression and labeling
Expression and purification
SAXS experiments were performed on native non-farnesylated hGBP1 variants. Cysteine non-farnesylated variants for EPR and fluorescence experiments are based on cysteine-free hGBP1 (C12A/ C82A/ C225S/ C235A/ C270A/ C311S/ C396A/ C407S/ C589S) and were constructed in a pQE80L vector (Qiagen, Germany) following the instructions of the QuikChange site-directed mutagenesis kit (Stratagene, USA) according to Vöpel et al., 2009; Vöpel et al., 2014. Neither amino acid positions in direct proximity to the nucleotide binding pocket nor inside the G domain dimerization interface nor charged amino acids on the protein surface were taken into consideration for labeling (Tsodikov et al., 2002). All chosen positions had an accessible surface area (ASA) value higher than 60 Å2. Previously, these mutations were shown to only weakly affect non-farnesylated hGBP1’s function (Vöpel et al., 2009; Vöpel et al., 2014). New cysteines were introduced at various positions of interest (N18C, Q254C, Q344C, T481C, A496C, Q525C, V540C, Q577C). The GTPase activity of the labeled and unlabeled non-farnesylated hGBP1 variants was quantified by an assay as previously described (Kunzelmann et al., 2006) (Appendix 2). The mutagenesis was verified by DNA sequencing with a 3130xl sequencer (Applied Biosystems, USA). hGBP1 was expressed in BL21-CodonPlus(DE3)-RIL (Supplier Agilent) and purified following the protocol described previously (Praefcke et al., 1999). A Cobalt-NTA-Superflow was used for affinity chromatography. No glycerol was added to any buffer as it did not make any detectable differences. To not interfere with the following labeling reactions, the storage buffer did not contain DTT or DTE. Protein concentrations were determined by absorption at 280 nm according Gill and Hippel using an extinction coefficient of 45,400 M–1 cm–1. Tests of enzyme activity and function demonstrate that the effect of mutations and labeling on non-farnesylated hGBP1’s function is small (Appendix 2).
Protein labeling
FRET labeling was performed in two steps. To start the first labeling reaction, a solution with a hGBP1 concentration 100–300 µM in labeling buffer containing 50 mM Tris-HCl (pH 7.4), 5 mM MgCl2, 250 mM NaCl was gently mixed with a 1.5-fold molar excess of Alexa647. After 1 hour incubation on ice, the unbound dye was removed using a HiPrep 26/20 S25 desalting column (GE Healthcare, Germany) with a flow rate of 0.5 ml/min. After this first labeling step, double, single, and unlabeled proteins were separated based on the charge difference introduced by the coupled dyes using anion exchange chromatography on a ResourceQ column (GE Healthcare, Germany) and a salt gradient running from 0 to 500 mM NaCl over 120 ml at a pH of 7.4 and flow rate of 2.0 ml/min. The peaks in the elugram were analyzed for their degree of labeling (dol) by measuring their absorption by UV/Vis spectroscopy at a wavelength of 280 nm and 651 nm. The fraction with the highest, single-acceptor labeled protein amount was labeled with a fourfold molar excess of Alexa488 C5 maleimide (Alexa488). The unreacted dye was separated as described for the first labeling step. Finally, the degrees of labeling (dol) for both dyes were determined (usually 70–100% for each dye). The dol was determined by absorption using 71,000 M–1 cm–1 and 265,000 M–1 cm–1 as extinction coefficients for Alexa488 and Alexa647, respectively. The labeled proteins were aliquoted into buffer containing 50 mM Tris-HCl (pH 7.9), 5 mM MgCl2, 2 mM DTT, shock-frozen in liquid nitrogen and stored at –80 °C. We determined a Förster radius
The spin labeling reactions were conducted at 4 °C for 3 hr using an 8-fold excess of (1-Oxyl-2,2,5,5-tetramethylpyrroline-3-methyl) methanethiosulfonate (MTSSL) as a spin label (Enzo Life Sciences GmbH, Germany). The reaction was performed in 50 mM Tris, 5 mM MgCl2 dissolved in D2O at pH 7.4. Unbound spin labels were removed with Zeba Spin Desalting Columns (Thermo Fisher Scientific GmbH, Germany) equilibrated with 50 mM Tris, 5 mM MgCl2 dissolved in D2O at pH 7.4. Concentrations were determined as described before. Labeling efficiencies were determined by double integration of CW room temperature (RT) EPR spectra by comparison of the EPR samples to samples of known concentration. In all cases, the labeling efficiencies were ~90–100%.
Small angle X-ray scattering
SAXS experiments were performed on the beamlines X33 at the Doris III storage ring, DESY and at the BM29, ESRF (Pernot et al., 2013) using X-ray wavelengths of 1.5 Å and 1 Å, respectively. On BM29 a size exclusion column (Superdex 200 10/300 GL, GE Healthcare) was coupled to the SAXS beamline (SEC-SAXS). The scattering vector
SAXS allows determining the shape and low-resolution structure of proteins in solution by the measured scattering intensity
SAXS data was analyzed using the ATSAS software package (Petoukhov et al., 2012). Theoretical scattering curves of the crystallographic and the simulated structures of the monomer were calculated and fitted to the experimental SAXS curves using the computer program CRYSOL. The distance distribution function
Pulse EPR (DEER) experiments
Experiments were performed and are described by Vöpel et al., 2014. Briefly, experiments were carried out at X-band frequencies (~9.4 GHz) with a Bruker Elexsys 580 spectrometer equipped with a split-ring resonator (Bruker Flexline ER 4118X-MS3) in a continuous flow helium cryostat (CF935; Oxford Instruments) controlled by an Oxford Intelligent Temperature Controller ITC 503S adjusted to stabilize a sample temperature of 50 K. Sample conditions for the EPR experiments were 100 µM protein in 100 mM NaCl, 50 mM Tris-HCl, 5 mM MgCl2, pH 7.4 dissolved in D2O with 12.5% (v/v) glycerol-d8. DEER inter spin-distance measurements were performed using the four-pulse DEER sequence (Martin et al., 1998; Pannier et al., 2000):
(1)
with observer pulse (
The DEER data was analyzed using the software DeerAnalysis which implements a Tikhonov regularization (Jeschke et al., 2006). Background correction of the DEER signal dipolar evolution function (2)
was performed assuming an isotropic distribution of the spin-labeled hGBP1 molecules in frozen solution that is described by
(3)
Briefly, the resulting form factor is modulated with the dipolar frequency
(4)
that is proportional to the cube of the inverse of the inter-spin distance (5)
from a given distance distribution was calculated by means of a kernel function
(6)
with for nitroxide spin labels. The optimum (7)
The regularization parameter was varied to find the best compromise between smoothness, that is, the suppression of artifacts introduced by noise, and resolution of . The optimum regularization parameter was determined by the L-curve criterion, where the logarithm of the smoothness of is plotted against the logarithm of the mean square deviation , allowing to choose the distance distribution with maximum smoothness representing a good fit to the experimental data.
Theoretical inter spin label distance distributions for MTSSL spin labels attached to structural models have been calculated using the rotamer library analysis (RLA) implemented in the freely available software MMM (Polyhach et al., 2011).
Neutron spin echo
Neutron spin echo (NSE) was measured on IN15 at the Institut Laue-Langevin, Grenoble, France. The NSE data were described by rigid body diffusion of non-farnesylated hGBP1 to detect intra-molecular dynamics. Four incident neutron wavelengths with 8, 10, and 12.2, and 17.5 Å were used. The buffer composition for NSE experiments was 50 mM TRIS, 5 mM MgCl2, 150 mM NaCl at pD 7.9 in heavy water (99.9 atom % D). The protein concentration was 30 mg/mL. The measured NSE spectra are shown in Figure 3—figure supplement 1. Effective diffusion coefficients
The rigid body diffusion (8)
where is the 6x6 diffusion tensor, which was calculated using the HYDROPRO program (Ortega et al., 2011).
The full NSE spectra were described by rigid body diffusion and internal protein dynamics according to Inoue et al., 2010:
(9)
with
where
The parameters
Dynamic light scattering was measured on a Zetasizer Nano ZS instrument (Malvern Instruments, Malvern, United Kingdom) in D2O buffer identical to that used in the NSE experiment. Autocorrelation functions were analyzed by the CONTIN like algorithm (Provencher, 1982) to obtain the translational diffusion coefficient
Fluorescence spectroscopy
Ensemble and single-molecule FRET experiments were performed at room temperature in 50 mM Tris-HCl buffer (pH 7.4) containing 5 mM MgCl2 and 150 mM NaCl. All ensemble measurements were performed at concentrations of labeled protein of approximately 200 nM. The single-molecule (sm) measurements were performed at concentrations of labeled protein of approximately 20 pM to assure that only single-molecules were detected. All sm MFD-measurements probing the hGBP1 apo state were performed under two conditions: (
Ensemble fluorescence time-correlated single-photon-counting (TCSPC) measurements of the donor fluorescence decay histograms were either performed on an IBH-5000U (HORIBA Jobin Yvon IBH Ltd., UK) equipped with a 470 nm diode laser LDH-P-C 470 (Picoquant GmbH, Germany) operated at 8 MHz or on a EasyTau300 (PicoQuant, Germany) equipped with an R3809U-50 MCP-PMT detector (Hamamatsu) and a BDL-SMN 465 nm diode laser (Becker & Hickl, Germany) operated at 20 MHz. The donor fluorescence was detected at an emission wavelength of 520 nm using a slit-width that resulted in a spectral resolution of 16 nm in the emission path of the machines. A cut-off filter (495 nm) in the detection path additionally reduced the contribution of the scattered light. All measurements were conducted at room temperature under magic-angle conditions. Typically, 14·106–20·106 photons were recorded at TAC channel-width of 14.1 ps (IBH-5000U) or 8 ps (EasyTau300). When needed, the analysis considers differential non-linearities of the instruments by multiplying the model function with a smoothed and normalized instrument response of uncorrelated room light. The fits cover the full instrument response function (IRF) and 99.9% of the total fluorescence. The IRFs had typically FWHM of 254 ps (IBH-5000U) or 85 ps (PicoQuant EasyTau300).
Single-molecule fluorescence spectroscopy data was acquired on a custom MFD setup with polarized excitation and detection in the ‘green’ and ‘red’ detection channels (Sisamakis et al., 2010). Briefly, a beam of linearly polarized pulsed argon-ion laser (Sabre, Coherent) was used to excite freely diffusing molecules through a corrected Olympus objective (UPLAPO 60X, 1.2 NA collar (0.17)). The laser was operated at 496 nm and 73.5 MHz. An excitation power of 120 µW at the objective has been used during experiments. The fluorescence light was collected through the same objective and spatially filtered by a 100 µm pinhole which defines an effective confocal detection volume of ~3 fl. A polarizing beam-splitter divided the collected fluorescence light into its parallel and perpendicular components. Next, the fluorescence light passed a dichroic beam splitter that defines a ‘green’ and ‘red’ wavelength range (below and above 595 nm, respectively). After passing through band pass filters (AHF, HQ 520/35 and HQ 720/150) single photons were detected by two ‘green’ (either τ-SPADs, PicoQuant, Germany or MPD-SPADs, Micro Photon Devices, Italy) and two ‘red’ detectors (APD SPCM-AQR-14, Perkin Elmer, Germany). Two SPC 132 single photon counting boards (Becker & Hickel, Berlin) have recorded the detected photons stream. Thus, for each detected photon the arrival time after the laser pulse, the time since the last photon and detection channel number (so, polarization and color) were recorded.
Burst-wise single molecule analysis
Briefly, as the first step in the burst-wise analysis, fluorescence bursts were discriminated from the background signal of 1–2 kHz of the single-molecule measurements by applying an intensity threshold criterion. Next, the anisotropy and the fluorescence averaged lifetime, , were determined for each burst. Moreover, the background, the detection efficiency-ratio of the ‘green’ and ‘red’ detectors, and the spectral cross-talk were considered to determine the FRET efficiency,
FRET-lines
By relating fluorescence parameters, FRET lines serve as a visual guide to interpret histograms of MFD parameters determined for individual molecules. The fluorescence weighted lifetime of the donor, 〈
Fluorescence decay analysis
Fluorescence decay analysis was performed using ChiSurf, an open-source software tailored for the global analysis of multiple fluorescence experiments. (10)
Here,
We assume that the same distribution of FRET-rate constants quenches all fluorescent states of the donor (quasi-static homogeneous model Peulen et al., 2017). Thus, can be expressed by:
(11)
where is the FRET-induced donor decay. The MFD measurements demonstrate that the major fraction of the dyes is mobile (Appendix 2). Therefore, we approximate (12)
Here,
For rigorous uncertainty estimates (13)
In the analysis of the seTCSPC data, the FRET-sensitized emission of the acceptor, , was considered to reduce the overall photon noise and a typical width of 12 Å was consistent with the data. was described by the convolution of , and :
(14)
All (15)
Filtered species cross-correlation functions
Filtered fluorescence correlation spectroscopy of the acquired MFD data was performed as previously described (Felekyan et al., 2012). In a global analysis, all 48 fFCS curves (two (16)
Herein (17)
Here,
We assume that the same diffusion term can describe all correlation curves of a sample and that the molecules diffuse in a 3D Gaussian illumination/detection profile. Under these assumptions is
(18)
where
The kinetic terms of the (19)
Here,
MD simulations and principal component analysis
MD simulations
We performed molecular dynamics (MD) and accelerated MD (aMD) (Hamelberg et al., 2004) simulations to identify collective degrees of freedom, essential movements, and correlated domain motions of hGBP1 by Principal Component Analysis (PCA) (Hamelberg et al., 2004). Molecular dynamics simulations were performed using the Amber14 package (Case et al., 2015) and the ff14SB force field. The simulations were started from a known crystal structure of the full-length non-farnesylated protein (PDB code: 1DG3) protonated with the program PROPKA (Bas et al., 2008) at a pH of 7.4, neutralized by adding counter ions and solvated in an octahedral box of TIP3P water (Jorgensen et al., 1983) with a water shell of 12 Å around the solute. The obtained system was used to perform unbiased MD simulations and aMD simulations (Hamelberg et al., 2004). Five unrestrained all-atom MD simulations were performed. Three of the five simulations were conventional MD (2 µs each) and two aMD simulations (200 ns each). The ‘Particle Mesh Ewald’ method (Darden et al., 1993) was utilized to treat long-range electrostatic interactions; the SHAKE algorithm (Ryckaert et al., 1977) was applied to bonds involving hydrogen atoms. For all MD simulations, the mass of solute hydrogen atoms was increased to 3.024 Da and the mass of heavy atoms was decreased respectively according to the hydrogen mass repartitioning method (Hopkins et al., 2015). The time step in all MD simulations was 4 fs with a direct-space, non-bonded cutoff of 8 Å. For initial minimization, 17500 steps of steepest descent and conjugate gradient minimization were performed; harmonic restraints with force constants of 25 kcal·mol–1 ·Å–2, 5 kcal·mol–1·Å–2, and zero during 2500, 10,000, and 5000 steps, respectively, were applied to the solute atoms. Afterwards, 50 ps of NVT simulations (MD simulations with a constant number of particles, volume, and temperature) were conducted to heat up the system to 100 K, followed by 300 ps of NPT simulations (MD simulations with a constant number of particles, barostat and temperature) to adjust the density of the simulation box to a pressure of 1 atm and to heat the system to 300 K. A harmonic potential with a force constant of 10 kcal·mol–1 ·Å–2 was applied to the solute atoms at this initial stage. In the following 100 ps NVT simulations the restraints on the solute atoms were gradually reduced from 10 kcal·mol–1 ·Å–2 to zero. As final equilibration step, 200 ps of unrestrained NVT simulations were performed. Boost parameters for aMD were chosen by the method as previously suggested (Pierce et al., 2012).
Principal components analysis (PCA)
In the MD simulations we found fluctuations of RMSD around the average structure of at most 8 Å RMSD for GTP bound and GTP free hGBP1 (Figure 4—figure supplement 1A). A correlation analysis of these RMSD trajectories reveals that the dynamics is complex (non-exponential) and predominantly in the 10–100 ns regime (Figure 4—figure supplement 1B). Structures deviating the most from the X-ray structure kink at the connector of the LG and the middle domain (Figure 3G). A PCA reveals that the first five principal components describe overall more than 60% of the variance of the MD and aMD simulations (Figure 3F). For PCA the GTPase domain (the least mobile domain) was superposed. The mode vectors of the principal components mapped to a crystal structure of hGBP1 (PDB-ID: 1DG3) illustrate the amplitude and the directionality of the principal components (Figure 3F). The first component (1) describes a motion of the middle domain towards the LG domain. In the second component (2) the middle domain and α13 move in opposite directions. The third component (3) is like the first component with a two times smaller eigenvalue. Component (4) is like the second component, except that the middle domain and α12/13 move in the same direction. Component (5) captures a similar directionality of motion for the middle domain and α12/13 as the second component. In component (5) however, the movement of α12/13 describes a breathing motion of the catalytic LG domain. The major motions of the PCA can be described by a rotation of the middle domain relative to the GTPase domain (Figure 3F, cyan sphere).
Integrative modeling
A detailed description of our integrative modeling with all steps can be found in Appendix 3. In short, DEER, FRET (eTCSPC), and the SAXS data were used to generate integrative structure for the states M1 and M2. Based on the experimental data and the MD simulations, the protein was decomposed into a set of rigid bodies. The assembly of the rigid bodies was sampled using DEER and FRET restraints and refined by NMSim (Ahmed and Gohlke, 2006) and MD simulations. All pairs of structures for M1 and M2 were enumerated and scored against the DEER, FRET and SAXS data. The probability for a pair of structures for the DEER and FRET, (20)
The probability and take the data information content into account. For SAXS the number of Shannon channels was used. For DEER and FRET, the information content was estimated using a greedy backward elimination feature selection algorithm to assess the effective number of informative measurements (Dimura et al., 2016). The estimates for the information content were varied to assess the impact on the resulting structures. Finally, an F-test on is used to discriminate pairs.
The structure generation follows the workflow (Figure 4A). Starting from the crystal structure (Figure 4A, steps 1–3), we generate new structures (Figure 4A, steps 4–5). A set of rigid bodies (RBs) (Figure 4B, Appendix 3) was defined based on the motions observed in the MD simulations (Figure 3F) taking the following information into account: (1) An order-parameter based rigidity analysis (Figure 4—figure supplement 1D); (2) Knowledge on the individual domains within the dynamin family (Low and Löwe, 2010; Chen et al., 2017); (3) Position dependent FRET and DEER properties (Appendix 1—table 1); and (4) The SAXS experiments that suggest a kink in hGBP1’s middle domain (Appendix 3, Structure representation). To this RB assembly, we applied DEER and ensemble FRET restrains for guided RB docking (RBD) (Appendix 3) (Kalinin et al., 2012). In RBD, the DEER and FRET restraints were treated by AV and ACV simulations, respectively (Appendix 3, Simulation of experimental parameters). AVs for DEER restraints were calibrated (Figure 2—figure supplement 2) against established simulation approaches (Polyhach et al., 2011; Hagelueken et al., 2012).The RBD structures were corrected for their stereochemistry (Appendix 3, Generation of structures) and were clustered into 343 and 414 groups for the states M1 and M2, respectively. Group representatives were used as seeds for short (1–2 ns) MD-simulations. The MD trajectories were clustered into 3395 and 3357 groups for M1 and M2, respectively, before being ranked by the DEER, FRET, and SAXS individually (Appendix 3, Individual ranking of structures). For well-balanced structures and equalized experimental contributions Fisher’s method fused the experimental data in a meta-analysis (Figure 4A, step 6b) considering estimates for the degrees of freedom (dof) of the protein representation and the data (Moore, 1980; Mertens and Svergun, 2017) (Appendix 3, Model discrimination and quality assessment; Figure 4C, Combined screening). In the meta-analysis a p-value of 0.68 discriminates 95% of all structural models (Figure 4C, red area; Figure 4—figure supplement 2B). The quality of the selected structures is judged by comparison to the experiments and making use of the data uncertainty. The local quality of the structures was assessed by checking if their variabilities is above the statistically expectation. Reference structure ensembles are computed to normalize the experimental model precision to a reference precision (Appendix 3, Assessment of model precision and quality).
Data availability
The following material is available at Zenodo in two locations: Experimental data (https://doi.org/10.5281/zenodo.6534557): (i) fluorescence decays recorded by eTCSPC used to compute the distance restraints in Appendix 1—table 1 and Appendix 3—table 1, (ii) single-molecule multiparameter fluorescence data: all raw data, burst selection and calibration measurements, fFCS (filters and generated correlation curves) (iii) double electron-electron resonance (DEER) EPR data used for structural modeling, (iv) neutron spin-echo data and SAXS structure factor of non-farnesylated hGBP1. Scripts for structural modeling of conformational ensembles through integrative/hybrid methods using FRET, DEER and SAXS together with the initial and selected structural ensembles (https://doi.org/10.5281/zenodo.6565895). The experimental SAXS data and the ab initio analysis thereof are available in the SASBDB (ID SASDDD6). Structure models of hGBP1 based on experimental restraints were deposited to PDB-Dev (PDB-Dev ID: PDBDEV_00000088) using the FLR-dictionary extension (developed by PDB and the Seidel group) available on the IHM working group GitHub site (https://github.com/ihmwg/flrCIF; IHM Working Group, 2022). Further data sets generated during and/or analyzed during the current study are available from the corresponding author on request.
Detailed description of the experimental files available on Zenodo (doi 10.5281/zenodo.6534557):
Folder | Content |
---|---|
EPR_data.zip | double electron-electron resonance (DEER) EPR data |
FRET_data.zip | eTCSPC FRET data including fit results. Sample name includes labeled aa and used dyes |
NSE.zip | neutron spin echo data |
SMD_hGBP1-[sample].tgz | raw single molecule FRET data used for filter FCS and MFD analysis and calibration measurements. Sample name includes labeled aa |
Subfolders for an eTCSPC FRET measurement contain the following files:
File | Content |
---|---|
decay.dat | FRET or donor only decay |
prompt.txt | IRF |
whitelight.txt | Used to create linearization table for tac gates to do full correlation |
Subfolder for eTCSPC FRET measurement ‘Fit_results’ contains the following files:
File | Content |
---|---|
fit_data.txt | Fitted data |
fit_fit.txt | Fit curve |
fit_info.txt | Fit results |
fit_wr.txt | Residuals of the fit |
Subfolders for a single molecule FRET measurement are structured the following:
Folder Name | Content |
---|---|
‘Al488_Al647’ | Describes the used dyes, contains the measurement of one sample under various conditions including all files |
‘Sample’ | Raw data and burst analysis, including info file on burst selection under info-folder |
‘BID’ | Subfolder in burst analysis describing selected bursts used to create filters for fFCS |
‘fFCS’ | Subfolder in burst analysis, contains used lifetime filters, correlation curves and fits |
‘LP’ | Labeled protein, including all measurement files |
‘LP_nucleotide_UP’ | Labeled protein with a nucleotide and additional unlabeled protein, including all measurement files |
‘LP_UP’ | Labeled protein and additional unlabeled protein, including all measurement files |
‘buffer’ | Buffer measurement for background |
‘H2O’ | Water measurement for IRF |
‘Rh110’ | Free dye measurements for g-factor calibration |
Rh101 | Free dye measurements for g-factor calibration |
DNA | Calibration measurement for detection efficiency |
Code availability
Most general custom-made software is directly available from http://www.mpc.hhu.de/en/software. The software ChiSurf is available at https://github.com/Fluorescence-Tools/ChiSurf (Peulen et al., 2021). General algorithms and source code are published under https://github.com/Fluorescence-Tools. Additional computer code custom-made for this publication is available upon request from the corresponding authors.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2023, Peulen, Hengstenberg et al. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Guanylate binding proteins (GBPs) are soluble dynamin-like proteins that undergo a conformational transition for GTP-controlled oligomerization and disrupt membranes of intracellular parasites to exert their function as part of the innate immune system of mammalian cells. We apply neutron spin echo, X-ray scattering, fluorescence, and EPR spectroscopy as techniques for integrative dynamic structural biology to study the structural basis and mechanism of conformational transitions in the human GBP1 (hGBP1). We mapped hGBP1’s essential dynamics from nanoseconds to milliseconds by motional spectra of sub-domains. We find a GTP-independent flexibility of the C-terminal effector domain in the µs-regime and resolve structures of two distinct conformers essential for an opening of hGBP1 like a pocket knife and for oligomerization. Our results on hGBP1’s conformational heterogeneity and dynamics (intrinsic flexibility) deepen our molecular understanding relevant for its reversible oligomerization, GTP-triggered association of the GTPase-domains and assembly-dependent GTP-hydrolysis.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer