1 Introduction
Proteins made with global substitution of a single amino acid type by a selectively fluorinated analogue greatly facilitate their analysis by NMR spectroscopy (Sharaf and Gronenborn, 2015). Structural perturbations caused by the fluorine substitutions can be kept to a minimum if a single fluorine atom is installed in a methyl group as the resulting group has the freedom to respond to the increased spatial requirement of the C–F moiety by preferential population of those rotamers that are most readily accommodated by the chemical environment. Recently, we showed that the Escherichia coli peptidyl–prolyl isomerase B (PpiB), which contains five leucine residues, can be produced with high-level uniform substitution of leucine for (2,4)-5-fluoroleucine (FLeu1), (2,4)-5-fluoroleucine (FLeu2), or 5,5′-difluoroleucine (diFLeu; Fig. ) using cell-free protein synthesis (Tan et al., 2024). As demonstrated by X-ray crystal structures, the structural perturbations caused by these amino acid substitutions were minimal (Frkic et al., 2024a). Furthermore, the coupling constants were inversely correlated with the chemical shifts in a first experimental confirmation of the -gauche effect predicted by Oldfield and co-workers based on quantum calculations (Feeney et al., 1996). In the structure of PpiB, the leucine residues are isolated from each other. In contrast, the three leucine residues of GB1 are arranged such that methyl groups of neighbouring leucine residues can make van der Waals contacts (Fig. ). This situation may produce through-space scalar () couplings. Scalar couplings through non-bonded interactions are common for atoms containing free electron pairs (Hierso, 2014), and a coupling of 21 has been reported between two fluorotryptophan residues in a protein (Kimber et al., 1978).
Figure 1
Chemical structures of the fluorinated leucine analogues used in the present work. (2,4)-5-fluoroleucine (where fluorine is on the carbon), (2,4)-5-fluoroleucine (where fluorine is on the carbon), and 5,5′-difluoro-L-leucine are referred to in the following as FLeu1, FLeu2, and diFLeu, respectively.
[Figure omitted. See PDF]
1.1 Experimental procedures1.2 Fluorinated leucine analogues
Initially, the fluorinated leucine analogues FLeu1 and FLeu2 (Fig. ) were synthesised following published protocols (Moody et al., 1994; August et al., 1996; Charrier et al., 2004). diFLeu with and without substitutions was synthesised as described (Maleckis et al., 2022). Subsequently, FLeu1, FLeu2, and diFLeu were obtained as salts from Enamine (Ukraine).
1.3 Expression vectors
Expression vectors were based on pETMCSI (Neylon et al., 2000) and constructed with a C-terminal tag following a TEV cleavage site. The amino terminus was preceded by the 5′-nucleotide sequence of the T7 gene 10 to ensure high expression yields, which added the hexapeptide MASMTG. The full nucleotide and amino acid sequences are shown in Table S1 in the Supplement.
1.4 Protein expression
All protein samples were expressed by continuous exchange cell-free protein synthesis (CFPS) following an established protocol (Apponyi et al., 2008; Ozawa et al., 2012). The gene of the GB1 construct was PCR-amplified with eight-nucleotide overhangs to generate circularised DNA suitable for use in CFPS (Wu et al., 2007). Leucine was omitted when preparing the acid-soluble amino acid mixture. The fluoroleucine of interest was added from an aqueous stock solution to the outer buffer at a final concentration of 4 . The pH of the outer buffer was adjusted to 7.5. The CFPS reaction was conducted at 30 for 16 using 1 inner reaction mixture of S30 cell extract made from the E. coli BL21 strain and 10 outer buffer.
1.5 Protein purification
Proteins were purified using a 1 Ni–NTA gravity column (GE Healthcare, USA) equilibrated with buffer A (50 Tris–, pH 7.5, 300 ) using buffer B (same as buffer A but with 10 imidazole) for washing and buffer C (same as buffer A but with 300 imidazole) for elution. The purified proteins were dialysed overnight against a storage buffer (50 HEPES, pH 7.5, 100 ) and concentrated using an Amicon centrifugal ultrafiltration tube with a molecular weight cut-off of 3 .
1.6 Protein mass spectrometry
Intact protein mass analysis was performed using an Orbitrap Elite Hybrid Ion Trap–Orbitrap mass spectrometer equipped with an UltiMate 3000 UHPLC (Thermo Scientific, USA). The protein samples were injected via an Agilent ZORBAX SB-C3 Rapid Resolution HT Threaded Column using a 5 %–80 % gradient of acetonitrile with 0.1 % formic acid. The data were collected in positive ion mode. The protein masses were obtained by deconvolution using the Xtract function in the Qual Browser software tool of the program Xcalibur 3.0.63 (Thermo Fisher Scientific, USA).
1.7 Protein NMR conditions
All NMR spectra were measured at 25 on a 400 Bruker Avance NMR spectrometer equipped with a SmartProbe, allowing detection with decoupling. The protein solutions were in 90 % 10 % with 20 MES buffer, pH 6.5, and 100 or with 50 HEPES buffer, pH 7.5, and 100 . All spectra reported of GB1 made with diFLeu were recorded in MES buffer. All spectra reported of GB1 made with FLeu2 were recorded in HEPES buffer. All spectra reported of GB1 made with FLeu1 were recorded in HEPES buffer unless indicated otherwise. 0.1 trifluoroacetate (TFA) was added as an internal reference and calibrated to 75.25 .
1.8 Results
1.9 Protein yields and purity
Up to 2.7 of protein was obtained from 1 inner reaction mixture of the CFPS setup (Table S2 in the Supplement). The amino acid sequence of native GB1 contains three leucine residues, and the additional TEV cleavage site present in our constructs adds a fourth leucine residue. Intact protein mass spectrometry indicated that the predominant species contained fluorinated leucine analogues at all four sites. The species containing three or two fluorinated leucine analogues were also detected, but their intensity indicated that the chance of canonical leucine at any of the four sites was below 10 % (Fig. S1 in the Supplement). Mass spectra of GB1 produced with diFLeu in the presence of some canonical leucine delivered the natural protein as the main species followed by protein containing single leucine-for-diFLeu substitutions, illustrating the strong preference of the E. coli leucyl-tRNA synthetase for L-leucine over diFLeu (Fig. S2 in the Supplement). Complete exclusion of L-leucine from the CFPS reaction could not be achieved due to amino acid impurities in the S30 extract.
1.10 Protein stability
Thermal denaturation measured by circular dichroism at 216 showed that the melting temperatures the GB1 samples made with fluorinated leucine analogues ranged between about 66 and 72 , i.e. 9–15° lower than for the wild-type protein (Fig. S3 in the Supplement), indicating that the presence of groups decreases the stability of the protein.
1.11 1D NMR spectra
Figure shows the 1D NMR spectra of the GB1 variants produced with diFLeu (GB1-d), FLeu1 (GB1-1), or FLeu2 (GB1-2). In addition, Fig. b shows the spectrum of GB1 produced with diFLeu in the presence of canonical L-leucine (GB1-dd). The 1D NMR spectra resolve the signals of all fluorine atoms.
The chemical shifts were insensitive to the buffer and pH but very sensitive to the immediate chemical environment. A striking illustration are the very different chemical shifts observed in GB1-d when the sample was prepared with the addition of L-leucine to produce samples predominantly containing single diFLeu residues (GB1-dd; Fig. a and b). Comparison of the high-field and low-field ends of the spectra of GB1-dd and GB1-d shows that minor peaks observed for the GB1-d sample correspond to main peaks observed with GB1-dd and vice versa. The minor peaks in Fig. a can thus be attributed to a small amount of canonical leucine in the protein preparation. Conversely, the minor peaks in the spectrum of GB1-dd appear to correspond to peaks of the fully fluorinated GB1-d sample, although the only minor species present in significant amounts contains no more than two diFLeu residues. This indicates that the presence of a second diFLeu residue is sensed only if it is in the immediate neighbourhood. Position 7 features two neighbouring leucine sites (Fig. ), yet the group seems to sense predominantly a single neighbour, while the chemical shift of the group is less well conserved between the major species in GB1-dd and the minor species in GB1-d (Fig. a and b).
Figure 2
Solution structure of GB1 (PDB ID 3GB1; Kuszewski et al., 1999). The methyl groups of Leu5, Leu7, and Leu12 are shown in a space-filling representation with the - and -carbon atoms in orange and yellow, respectively. The side chain of Leu5 is inaccessible to the solvent, whereas the groups of Leu7 and Leu12 are partially and highly accessible, respectively. The colour code of the other amino acids is red for negatively charged, blue for positively charged, grey for hydrophilic, and yellow for hydrophobic amino acids.
[Figure omitted. See PDF]
Figure 3
1D NMR spectra of GB1 made with fluorinated leucine analogues using FLeu1 to produce GB1-1, FLeu2, for GB1-2 and diFLeu for GB1-d and GB1-dd. All spectra were recorded with decoupling during acquisition using a 0.5 recovery delay between scans. The resonance assignments are indicated by the sequence numbers of the four leucine sites. (a) GB1-d prepared with 4 diFLeu. Spectrum recorded of a 2 protein solution in 20 MES buffer, pH 6.5. Red dots mark the resonances assigned to groups. (b) GB1-dd prepared with a mixture of 0.5 leucine and 4 diFLeu. Spectrum recorded of a 4 protein solution in 20 MES buffer, pH 6.5. Stereospecific assignments are indicated as in (a). (c) Spectrum of a 2 solution of GB1-1 in 50 HEPES, pH 7.5. (d) Spectrum of a 2.2 solution of GB1-2 in 50 HEPES, pH 7.5.
[Figure omitted. See PDF]
In the case of the GB1-2 sample, minor peaks arose because the FLeu2 amino acid synthesised in house contained about 10 % FLeu1 as an impurity.
In the case of residue 59, which is in the flexible TEV protease recognition site of the C-terminal peptide segment of the protein construct, the chemical shifts are hardly impacted by the rest of the protein, as indicated by their conservation between the spectra of Fig. a and b. For the diFLeu residue in position 59 (Fig. a and b), we base the stereospecific assignment on the chemical shifts observed for this position in GB1-1 and GB1-2 (Fig. c and d). The stereospecific assignments of the other diFLeu residues were determined by 2D NMR experiments described below. They attributed the high-field signals of residues 5, 7, and 12 to the groups. Notably, the respective signals in GB1-1 are also at higher field than the corresponding signals in GB1-2 (Fig. c and d).
The relaxation times were of the order of 0.3 , and the full line widths at half height ranged between 7 and 18 . relaxation rates of GB1-d indicate that no signal possesses an intrinsic line width much greater than 12 (Table S3 in the Supplement). For a sample produced with deuterated diFLeu, where all five protons of the isopropyl group are replaced by deuterium (Maleckis et al., 2022), the measurements indicated a maximal intrinsic line width of 7 , suggesting that dipolar relaxation by the nearest protons contributes significantly to the relaxation. The broadest lines were observed for residue 5, the side chain of which is deeply buried in the core of the protein (Fig. ), hence expected to feature the least flexibility and the fastest transverse relaxation rates. Residue 7 is the next-most-buried residue, while the side chain of residue 12 is more highly accessible to the solvent (particularly the group; see Fig. ), and residue 59 is completely solvent-exposed.
Recording the NMR spectra without decoupling of the spins revealed broad multiplets with overlap between some of the resonances (Fig. ). The multiplet of each group is composed of a triplet of doublets due to two-bond couplings, , within each group (47 ) and the coupling with the methine proton of the isopropyl group. couplings obey a Karplus relationship (Williamson et al., 1968; Gopinathan and Narasimhan, 1971). If the coupling is small, the envelope of the multiplet appears like a triplet, but can also be as large as 44 (Tan et al., 2024), in which case the multiplet appears like a quartet. The resonances of residue 5 in GB1-1 and GB1-2 are examples of these two limiting cases (Fig. c and d).
Figure 4
1D NMR spectra without decoupling recorded of the samples from Fig. . (a) GB1-d made with diFLeu, (b) GB1-dd made with diFLeu diluted with canonical L-leucine, (c) GB1-1 made with FLeu1, and (d) GB1-2 made with FLeu2. The resonance assignments are indicated in blue. Red dots identify stereospecific assignments of the groups in diFLeu residues.
[Figure omitted. See PDF]
Interestingly, the multiplet of the group of residue 12 in GB1-dd displays narrower lines than the group (Fig. b) in agreement with a narrower signal in GB1-1 than in GB1-2 (Fig. ). This observation aligns with the greater solvent exposure of the group (Fig. ). The inverse correlation between NMR line width and solvent exposure suggests that faster rotation of the groups about the bond results in slower transverse relaxation.
1.12 NMR resonance assignmentsThe large couplings observed indicate that resonance assignments can be achieved by coherence transfer between and spins and linking the resonances of the isopropyl groups to the backbone protons by [,]-TOCSY and [,]-NOESY spectra. The chemical shifts of the groups are near 4 , and the methine resonances are between 1 and 2 . For GB1 made with diFLeu, a [,]-COSY spectrum connected the NMR signals belonging to the same residue (Fig. ).
Figure 5
[,] correlation spectrum of a concentrated solution of GB1-d (about 10 ). The -decoupled 1D NMR spectrum is shown at the top. The [,]-COSY spectrum was recorded with the pulse sequence 90°() – – 90°(),90°() – acquisition(). The cross peaks with the methine proton of the isopropyl groups, which identify the pairs of groups belonging to the same residue, are assigned in blue. Parameters: 51 , 217 , 1.4 total recording time.
[Figure omitted. See PDF]
To probe for the presence of scalar through-space couplings in GB1-d, we recorded a [,]-TOCSY spectrum. The spectrum yielded both intra-residual and through-space correlations (Fig. a). Interestingly, the intra-residual cross peak of residue 5 could not be observed, whereas the inter-residual connectivities with the nearest neighbour (residue 7) were intense. Residue 7 in turn showed cross peaks to residues 5 and 12, which were more intense than the intra-residual cross peaks. The absence of the intra-residual cross peak of residue 5 indicates that scalar couplings cannot be relied upon to connect the NMR signals of the groups of each diFLeu residue.
Figure 6
[,]-TOCSY and [,]-NOESY spectra of GB1-d. The 1D NMR spectrum is shown at the top along with the resonance assignments. Cross sections through the diagonal peaks at 220.4 are shown in red. (a) [,]-TOCSY spectrum (mixing time 60 ) recorded of an 0.8 protein solution. Parameters: DIPSI-2 mixing with 4200 RF field strength, 26 , 105 , 14 total recording time. (b) [,]-NOESY spectrum (mixing time 200 ) recorded of a 10 solution of GB1-d. Parameters: 13.5 , 108 , 12 total recording time, processed with 20 exponential line broadening in the dimension. Without cropping, the diagonal peak in the cross section would exceed the boundary of panel (b) twofold.
[Figure omitted. See PDF]
Notably, some of the most intense [,]-TOCSY cross peaks came about by couplings. To exclude the possibility of TOCSY cross peaks arising from nuclear Overhauser effects (NOEs), we also recorded a [,]-NOESY spectrum (Fig. b). The NOESY spectrum produced the intra-residual cross peak of residue 5 with greater intensity than the inter-residual NOEs. This illustrates the different dependence of NOEs and couplings on the internuclear distance, with couplings depending on close contacts between the fluorine atoms to create the necessary orbital overlap. Notably, although the NOESY spectrum had been recorded of a GB1-d sample with over 10-fold-higher protein concentration, the cross-peak intensities were markedly poorer in the NOESY than in the TOCSY spectrum.
The GB1 samples prepared with FLeu1 or FLeu2 (GB1-1 and GB1-2, respectively) offer fewer opportunities for couplings. The conformation shown in Fig. excludes direct contacts between groups, whereas van der Waals contacts between atoms of groups are arguably possible in view of the greater C–F bond length and larger van der Waals radius of fluorine compared with hydrogen. Even so, direct fluorine–fluorine contacts in GB1-2 depend on specific rotamer combinations of neighbouring groups and may be infrequent if the groups rotate.
Experimentally, residue 7 produced inter-residual cross peaks in the [,]-TOCSY spectra of GB1-1 and GB1-2 (Fig. ). In the case of GB1-1, the cross peaks were about 100 times smaller than the diagonal peaks. In the case of GB1-2, residue 7 produced cross peaks both with residue 5 and residue 12, and those cross peaks were only about 10 times smaller than the diagonal peaks. No single conformation of the group of residue 7 can simultaneously engage in fluorine–fluorine contacts with residues 5 and 12 (Fig. ), suggesting that the group populates multiple rotamers. Furthermore, the cross peak observed between the groups of residues 7 and 12 in GB1-1 suggests that these side chains enjoy greater conformational freedom than captured by the NMR structure 3GB1, which was determined with the aim of presenting the single best approximation to the average structure.
Figure 7
[,]-TOCSY spectra of 2 solutions of GB1-1 and GB1-2 recorded with 60 mixing time. The 1D NMR spectra are plotted on top with the resonance assignments in blue. (a) TOCSY spectrum of GB1-1 recorded in about 3 using 8.5 and 128 . The cross peaks between residues 7 and 12 are about 100 times weaker than the diagonal peaks. (b) TOCSY spectrum of GB1-2 recorded in about 12 using 8 and 171 . The cross peaks are about 10 times smaller than the diagonal peaks.
[Figure omitted. See PDF]
The fluorine–fluorine contacts observed in GB1-2 recapitulate the two strongest cross peaks observed with the high-field resonance of residue 7 in GB1-d (Fig. a). Assuming that the side-chain conformations are conserved between GB1-2 and GB1-d, this affords stereospecific assignments of GB1-d, assigning the high-field signals of residues 7 and 12 and the low-field signal of residue 5 to the spins of the respective groups. Given this assignment, the weaker interaction between the low-field signals of residues 7 and 12 indicates a contact between a and a group, which cannot occur in either GB1-1 or GB1-2. The generally greater cross-peak intensities observed in GB1-d may be a consequence of the greater steric crowding associated with the spatial demands of multiple fluorine atoms, bringing the spins into closer contact. In addition, the greater density of spins in GB1-d opens the chance for multiple magnetisation transfer steps during the TOCSY mixing period.
1.13 Estimates of from [,]-TOCSY spectraUsing the sample of GB1-d produced with deuterated diFLeu (Fig. S4a in the Supplement) to minimise the relaxation of , we recorded [,]-TOCSY spectra with increasing mixing time (27.6, 41.4, and 55.3 ), measured the integrals of cross peaks () and diagonal peaks () and calculated the ratio . In the approximation of a two-spin system and assuming that the cross peaks and diagonal peaks relax at the same rate, the ratio is expected to evolve during isotropic mixing with ) (Braunschweiler and Ernst, 1983). The largest couplings found in this way were about 2–3 (Fig. S5 in the Supplement).
1.14 Heteronuclear NMR for residue assignment
Heteronuclear [,]-NOESY (HOESY) spectra recorded with 150 mixing times showed NOEs with nearby protons (Fig. ). These NOEs delivered residue-specific resonance assignments as many of the corresponding nuclei were also detected in conventional homonuclear [,]-NOESY spectra. For example, residue 5 in GB1-1 and GB1-d displays NOEs to a resonance at about 0.8 . This resonance matches a -proton of Leu5, which in wild-type GB1 is the most high-field resonance due to aromatic ring currents from Phe28. In all three samples, the NMR signal of residue 5 produced stronger HOESY cross peaks than the other fluorinated leucine residues, while residue 59 delivered relatively weak cross peaks if any. This result indicates that a group produces stronger HOESY cross peaks when it is buried in the core of the protein than when it is solvent exposed and can rotate in an unhindered manner. The NMR assignments of residue 7 were confirmed similarly by comparison of the cross peaks observed in the HOESY and [,]-NOESY spectra.
Figure 8
[,]-HOESY spectra of GB1 produced with FLeu1, FLeu2, or diFLeu. The spectra were recorded with a mixing time of 150 . The corresponding 1D NMR spectra are shown at the top along with the resonance assignments. (a) HOESY spectrum of a 2.2 solution of GB1-1 in MES buffer, recorded using 38 , 136 , 9.6 total recording time. (b) HOESY spectrum of a 2 solution of GB1-2, recorded using 30 , 136 , 34 total recording time. (c) HOESY spectrum of a 10 solution of GB1-d, recorded using 31 , 108 , 2.3 total recording time.
[Figure omitted. See PDF]
In the case of the diFLeu residue in position 12, stronger cross peaks were detected for the low-field signal assigned to the than the group (Fig. c). In addition, the group of this residue displays a negative cross peak with the water resonance (at 4.75 ), as do both NMR signals of residue 59, indicating intermolecular NOEs with hydration water molecules featuring sub-nanosecond residence times (Otting et al., 1991). This confirms the solvent exposure of these fluorine atoms and agrees with the stereospecific assignments of residue 12 made by comparing the couplings with the protein structure. We observed no other negative NOE cross peaks with the groups.
Starting from the assignment of residue 5, the cross peaks observed in the [,]-TOCSY spectra provided an additional, straightforward assignment pathway for the spins in GB1-d and GB1-2 (Fig. b and 7b). In GB1-dd, as in GB1-1, the group of residue 12 produced only weak HOESY cross peaks. The HOESY spectrum thus did not identify the NMR signals belonging to the same diFLeu residue in position 12. This link, however, was easily established by correlations with the proton of the isopropyl group observed in a short-delay , correlation experiment.
1.15 Measurement of couplings and -gauche effectcouplings are governed by a Karplus relationship describing their dihedral angle dependence, and thus provide information about the rotameric states of the groups. Quantitative measurements of couplings in the 1D NMR spectra recorded without decoupling were hampered by spectral overlap and the presence of sample heterogeneities (Fig. ). Narrower multiplets were obtained for samples prepared with diFLeu versions that had been synthesised with instead of groups (Maleckis et al., 2022), where couplings were manifested in a sample made with diFLeu containing a group, whereas these splittings were absent from a sample prepared with deuterated diFLeu containing a group. A simple comparison of these spectra shows a correlation between the chemical shifts and splittings due to couplings (Fig. S4).
For more quantitative measurements of the couplings, we recorded short-delay , correlation experiments (Tan et al., 2024), which encode the coupling constants in the relative peak intensities of versus cross peaks. The results confirm the correlation between the coupling constants and the chemical shifts (Fig. ). This correlation is a hallmark of the -gauche effect, which associates a high-field chemical shift with the rotameric state of the group that positions the spin trans relative to the proton of the isopropyl group (Feeney et al., 1996; Tan et al., 2024; Frkic et al., 2024a, b). Conversely, the NMR resonance is shifted low-field if the spin is positioned trans relative to a carbon atom. The -gauche effect is most clearly illustrated by the low-field and high-field signals of residue 5.
Figure 9
Short-delay , correlation experiments for the measurement of coupling constants. The experiments were conducted with a constant-time evolution period of 7 to evolve the couplings and a refocusing INEPT period of 2.5 (Tan et al., 2024). The resonance assignments are indicated in blue. Red numbers indicate the coupling constants (in Hz) derived from the relative intensities of the – versus – cross peaks. (a) Spectrum recorded of a 2 solution of GB1-1. Parameters: 7 , 128 , 1.3 total recording time. (b) Spectrum recorded of a 2 solution of GB1-2 using 6.9 , 171 , 2.8 total recording time. (c) Spectrum recorded of a 0.8 solution of GB1-d using 7 , 180 , 5.3 total recording time. (d) Spectrum recorded of a 0.6 solution of GB1-dd produced with diFLeu diluted with Leu using 7 , 181 , 22.3 total recording time.
[Figure omitted. See PDF]
The immediate chemical environment of the spins also affects their chemical shifts. For example, the coupling of the high-field resonance of residue 7 in GB1-dd is smaller than for residue 5 (26 versus 40 ; Table S4 in the Supplement), yet the resonance appears more high-field in the spectrum (Fig. d). Quite generally, the chemical shifts of residue 7 are very sensitive to the presence or absence of fluorinated residues in positions 5 and 12 (Fig. c and d), highlighting the impact of the chemical environment.
Interestingly, the spins of residue 7 showed significantly different couplings between the GB1-d and GB1-dd preparations, suggesting somewhat different populations of the different rotameric states of the groups. The associated changes in chemical shifts are high-field and low-field as predicted by the -gauche effect.
Very large and very small couplings indicate that groups are trapped in pure trans or gauche rotamers, respectively, showing that the rotation of a group about the bond axis can be halted by the steric restraints in the tightly packed core of the protein. In the case of the E. coli peptidyl–prolyl cis–trans isomerase B (PpiB) produced with FLeu and diFLeu, we determined couplings ranging between 9 and 44 (Tan et al., 2024). The couplings observed in GB1 are less extreme, suggesting that each group populates more than a single rotamer. Using residue 59 located in the flexible TEV cleavage motif as a reference, a coupling of about 22 is indicative of a group that populates all three possible staggered rotamers. The different couplings observed for residue 5 (Table S4), which is the most deeply buried leucine side chain in the wild-type protein, thus indicate clear conformational preferences for its groups. The largest couplings were observed for diFLeu rather than FLeu1 or FLeu2 residues, as expected for fluorine–fluorine interactions biasing the conformational space of the groups (Marstokk and Møllendal, 1997; Wu et al., 1998; Lu et al., 2019).
On a technical note, the short-delay , correlation experiments delivered the chemical shifts with much greater sensitivity than the [,]-COSY experiment recorded without heteronuclear decoupling (Fig. ), assisting with the resonance assignments by comparison with [,]-NOESY spectra. In terms of sensitivity, the short-delay , correlation experiments were also far superior to the HOESY spectra. The chemical shifts of the spins were well conserved between the samples made with FLeu1, FLeu2, and diFLeu, ascertaining the resonance assignment of GB1-dd by comparison with GB1-d.
1.16 -NMR spectroscopyThe chemical shifts of the groups in the singly fluorinated leucine analogues FLeu1 and FLeu2 were shifted upfield by between 5.6 and 8.2 relative to the shifts of the methyl groups in the wild-type protein (Fig. S6 in the Supplement). Highly conserved and chemical shifts of the GB1 variants indicate that the 3D fold of the protein remains unchanged by the fluorinated leucine analogues. Therefore, any differences in chemical shifts reflect local rather than global effects. The heteronuclear single-quantum coherence (-HSQC) spectra showed the cross peaks of the groups in the dimension near 90 for GB1-1 and GB1-2 and about 86 for GB1-d (Fig. S7). In the dimension, most groups displayed two different chemical shifts for the diastereotopic spins, which, except for residue 5 in GB1-1, were unresolved in the short-delay , correlation experiments (Fig. ). The intensities of the -HSQC cross peaks of the groups of residue 5 were rather weak (similar to those of groups of other buried amino acid residues), which correlates with the relatively broad NMR signals observed for this residue. The other groups showed more intense -HSQC cross peaks on par with solvent-exposed groups. The methyl cross peaks of Leu5 are relatively weak also in wild-type GB1 (Fig. S6; Goehlert et al., 2004).
2 Discussion
The conformational impact of the fluorination of leucine methyl groups has previously been investigated in solution for only a single protein, PpiB, which contains five isolated leucine residues (Tan et al., 2024; Frkic et al., 2024a). The current findings recapitulate many of the findings made for PpiB.
- i.
In the case of the most buried residue, residue 5, the rotation of the groups is sufficiently hindered to bias the populations of the different rotamers in favour of a trans configuration of the group and a gauche configuration of the group as defined by the coupling constants. The size of the couplings indicates that these conformational biases are more pronounced in GB1-d than in GB1-1 and GB1-2, which may be attributed to unfavourable electrostatic interactions between parallel and antiparallel C–F bonds in a 1,3-difluoropropane moiety intrinsically limiting the conformational freedom (Marstokk and Møllendal, 1997; Wu et al., 1998; Lu et al., 2019).
- ii.
The large chemical shift dispersion of the NMR signals over many parts per million is mainly due to the -gauche effect, which attributes high-field and low-field shifts to trans and gauche rotamers (Feeney et al., 1996). Intermediate chemical shifts correlate with intermediate coupling constants and are thus indicative of averaging between different rotamers. A -gauche effect in leucine side chains has previously been reported also for the chemical shifts of leucine groups, which correlate with couplings with the -carbon (MacKenzie et al., 1996). To the best of our knowledge, the present work is only the third experimental example of the -gauche effect in groups (Fig. S8; Tan et al., 2024; Frkic et al., 2024b).
- iii.
The more solvent accessible groups displayed less extreme couplings and less extreme chemical shifts, suggesting more extensive averaging between different rotameric states. Larger and smaller couplings as well as greater chemical shift dispersions have been observed previously in PpiB (Tan et al., 2024), suggesting that the groups populate more than a single rotamer even in the buried residue 5.
- iv.
The line widths of the NMR signals vary greatly between different residues and, most strikingly for residue 12, between different groups. Narrow signals correlate with high solvent exposure. For wild-type GB1, order parameter values determined by relaxation measurements have been reported for the methyl groups of Leu12 ( 0.15), Leu7 (0.15), and Leu5 (0.55), showing that the methyl group symmetry axes are subject to motions, which are more prominent in situations of high solvent exposure (Goehlert et al., 2004). The relatively high order parameter of Leu5 correlates with NMR signals that are broader than any others.
- v.
For any given position in the protein, the relative chemical shifts observed between FLeu1 and FLeu2 are strongly predictive of the stereospecific assignments of a diFLeu residue at the same site. The same feature also prevails in PpiB (Tan et al., 2024).
The present work shows, for the first time, that through-space couplings can readily be detected between singly fluorinated groups in a protein. In previous work, we detected couplings between genetically encoded -phenylalanine and -tyrosine residues in the core of PpiB (Orton et al., 2021). Notably, however, the NMR spectra of PpiB constructs with multiple groups showed additional resonances, suggesting structural perturbations arising from the additional space requirements of groups. The NMR spectra of GB1 made with fluorinated leucine analogues also display weak additional resonances, but there is no evidence that they are due to structural heterogeneity. Instead, the additional signals are consistent with chemical heterogeneity arising from incomplete substitution of canonical leucine by fluorinated leucines or incomplete optical purity of the synthesised fluoroleucine. Cell-free protein synthesis enables the requisite high level of global substitution of canonical amino acids by fluorinated analogues.
The observation of couplings in GB1 is non-trivial as they depend on direct contact between the fluorine atoms. Crystal structures of PpiB showed that groups often populate multiple staggered rotamers that differ by rotation about the bond with the carbon atom they are bound to (Frkic et al., 2024a, b). Based on the 3D structure of wild-type GB1 (Fig. ), only specific rotamer combinations generate fluorine–fluorine contacts. A crystal structure of ubiquitin synthesised chemically with two FLeu1 residues indicated that the lowest energy conformation avoids fluorine–fluorine contacts (Alexeev et al., 2003). In the case of 1,3-difluoropropane, it is known that the polarity of C–F bonds discourages rotamers that produce fluorine–fluorine contacts (Marstokk and Møllendal, 1997; Wu et al., 1998; Lu et al., 2019). Therefore, the privileged attraction between fluorine atoms in perfluorinated polymers such as Teflon does not govern the interaction between the single fluorine atoms of groups. Nonetheless, the transient contacts arising from random rotations of the groups in GB1-1, GB1-2, and GB1-d suffice to generate observable couplings. As noted previously (Orton et al., 2021; Tan et al., 2024), the much steeper distance dependence of couplings compared with NOEs (Ernst and Ibrom, 1995; Mallory et al., 2000) strongly favours the detection of transient fluorine–fluorine contacts by [,]-TOCSY rather than [,]-NOESY experiments (Fig. ).
For the side chain of Leu12 in GB1, a very different angle has been reported by the crystal structures (1PGA, 1PGB; Gallagher et al., 1994; 2QMT; Frericks Schmidt et al., 2007) versus the NMR solution structure (3GB1; Kuszewski et al., 1999). As a result, the crystal structures expose the methyl group to the solvent, while the solution structure exposes the methyl group. The observation of NOEs with water together with different NMR line widths indicative of more facile rotation of the than group fully agree with the conformation of Leu12 depicted in Fig. , indicating that fluorinated leucine residues do not alter the solution structure. Simple rotations of the groups allow for accommodating the fluorine atoms in the energetically most favourable rotamers.
Establishing sequence-specific resonance assignments of the NMR spectra by 2D NMR techniques rather than site-directed mutagenesis is straightforward for small proteins like GB1. For larger proteins, site-specific selective installation of the fluorinated amino acids by genetic encoding (Orton et al., 2021; Qianzhu et al., 2020, 2022, 2024) will be helpful. Work towards this goal is in progress.
Data availability
The NMR data are available at 10.5281/zenodo.15266133 (Otting, 2025).
The supplement related to this article is available online at
Author contributions
YJT, EHA, and IDH prepared the protein samples and performed 1D NMR measurements. AM synthesised fluorinated leucine analogues with and without deuteration. GO coordinated the project, performed the 2D NMR measurements, and prepared the final paper and figures.
Competing interests
At least one of the (co-)authors is a member of the editorial board of Magnetic Resonance. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.
Disclaimer
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.
Acknowledgements
We thank Eliza Tarcoveanu for initial syntheses of FLeu1 and FLeu2.
Financial support
This research has been supported by the Australian Research Council (grant nos. CE200100012 and DP230100079).
Review statement
This paper was edited by Michael Summers and reviewed by Ad Bax and one anonymous referee.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2025. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Using cell-free protein synthesis, the protein G B1 domain (GB1) was prepared with uniform high-level substitution of leucine by (2
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details

1 ARC Centre of Excellence for Innovations in Peptide and Protein Science, Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
2 Latvian Institute of Organic Synthesis, Aizkraukles 21, 1006 Riga, Latvia