Prokaryotic transcription factors (TFs) responsive to exogenous physicochemical signals and their target promoters have been an invaluable source of biological parts for designing heterologous gene expression (Baneyx, 1999; Terpe, 2006; Yin et al., 2007). Most heterologous gene expression designs rely upon placing the DNA of the TF and its promoter artificially close to the target sequence for transcription of the gene(s) of interest (GoI; Marschall, 2017). The literature contains a large number of variations on such a basic theme, most often in the form of dedicated plasmid vectors that may be also endowed with additional features to optimize expression of such GoI, i.e. by modifying copy number, toxicity, mRNA stability, translation efficiency and other features (Zerbs et al., 2009; Kaur et al., 2018; Kent and Dixon, 2020). The majority of the studies describing heterologous expression systems are based on Escherichia coli as the host and the TFs/promoters are often retrieved from regulatory networks of the same organism. Popular devices of this sort include many variants of LacI/Plac (Heyneker et al., 1976; Lanzer and Bujard, 1988; Glascock ad Weickert, 1998), TetR/PtetA (Skerra, 1994; Korpela et al., 1998; Ramos et al., 2005), AraC/ParaBAD (Guzman et al., 1995; Haldimann et al., 1998), RhaS/PrhaBAD (Egan and Schleif, 1993; Haldimann et al., 1998; Kelly et al., 2016) and cI857/PL (Elvin et al., 1990; Love et al., 1996), which can be combined with the use of the selective viral T7 RNA polymerase/PT7 transcription machinery (Studier and Moffatt, 1986; Sørensen and Mortensen, 2005). Signals for triggering expression with these modules may vary from the natural inducers to synthetic structural alternatives as well as mere physical cues, e.g. temperature, cell density and medium composition (Andrews et al., 1996; Scholz et al., 2003; Henßler et al., 2005; Tang et al., 2008; Choudhary and Schmidt-Dannert, 2010; Tang and Cirino, 2011; Kelly et al., 2016; Taylor et al., 2016; Ellefson et al., 2018; Meyer et al., 2019). In principle, these systems, along with specialized strains, have thus far covered most – if not all – of the necessities for engineering E. coli. However, while use of numerous different TF-based transcriptional control systems have enabled the engineered control of gene expression for many organisms, surprisingly – with few exceptions (Balzer et al., 2013) – little work has attempted to standardize the description of transcriptional devices or rigorously and quantitatively describe their function within disparate genetic contexts.
A growing number of expression devices have been inspected in recent years owing to the push for standardization brought about by synthetic biology (Canton et al., 2008; Beal et al., 2020). Nonetheless, the best studied modules aimed at engineering complex expression circuits are largely limited to E. coli. In contrast, many other useful expression systems – especially those intended for non-model Gram-negative microorganisms – are poorly characterized and subject to all types of capricious designs and vector architectures. As a consequence, despite the large number of potentially useful regulatory parts available for controlling heterologous gene expression (Calero et al., 2016), reliable metrics assessing the performance of these regulatory parts within different genetic contexts is lacking. Recent developments are improving this state of affairs. Since 2013, an expanding collection of genetic tools (including expression systems), following the assembly rules of the so-called Standard European Vector Architecture (SEVA; Silva-Rocha et al., 2013), enable reusability, portability and comparative assessment of promoter and genetic device performance in a variety of Gram-negative hosts (Calero et al., 2016; Tas et al., 2021). The SEVA standard has been further expanded (Arce-Rodríguez et al., 2021) to propose a specific architecture for heterologous expression modules consisting of a fixed DNA sequence that acts as the scaffold for incorporation of the TF gene and the target promoter. In this design, all elements are organized in an otherwise invariable array of segments within a SEVA vector frame. The value of this format for expression cargoes was shown in the design and performance of a module based on the AlkS/PalkB regulatory node. This device included parts from the OCT plasmid of Pseudomonas oleovorans (van Beilen et al., 2001; Makart et al., 2007; Rojo, 2009), and gene expression can be triggered by n-octane as well as by the gratuitous inducer dicyclopropyl ketone (DCPK).
In this work, we have extended the standardization of expression modules to three additional platforms based on regulatory nodes mined from Gram-negative bacteria, i.e. the E. coli, IPTG-inducible LacIq/Ptrc (Brosius et al., 1985), XylS/Pm from Pseudomonas putida, which responds to m-toluate (3-methylbenzoate, 3-mBz; Ramos et al., 1987; Gawin et al., 2017), and ChnR/PchnB from Acinetobacter sp., a cyclohexanone-responsive system (Steigedal and Valla, 2008). One more expression module originated from the regulatory node CprK/PDB3 of the Gram-positive organism Desulfitobacterium hafniense, which can be induced by chlorinated molecules, e.g. 3-chloro-4-hydroxyphenylacetic acid (CHPA; Kemp et al., 2013). In order to comply with the SEVA format, the biological parts employed in this study were edited to erase undesired restriction sites that could interfere with cloning and their DNA sequences debugged (Silva-Rocha et al., 2013; Arce-Rodríguez et al., 2021) in order to improve their performance without affecting their functionality. As explained below, the key parameters and compositional constraints that rule the performance of each system could then be rigorously determined and the data evaluated for fitting specific bioengineering goals. Since each of the pieces chosen for this study came from a different biological context and are controlled by diverse mechanisms, their formatting in a comparable genetic arrangement permitted a faithful appraisal of their expression output in a given host as well as the influence of the standardization. The work below showcases both the value and feasibility of standardizing five types of modules and paves the way to a much ampler repertoire of choices for designing regulatory networks with intended phenotypic outputs.
Results and discussion Rationale for formatting heterologous gene expression devicesThe five expression systems under examination have each a different type of mechanism for controlling transcription of their target promoters. LacI/Ptrc is a representative of the plethora of devices based on the LacI repressor, the DNA binding ability of which to its cognate operator depends not only on inducer addition (e.g. IPTG) but also on the intracellular levels of the regulator itself (Calos, 1978). A somewhat equivalent situation but with an activator is embodied in XylS/Pm, the functioning of which depends as much on the inducer 3-mBz as on the levels of the transcriptional factor: overproduction of XylS activates Pm without any effector (Inouye et al., 1987; Mermod et al., 1987; Ramos et al., 1987; Spooner et al., 1987). In contrast, ChnR/PchnB (Steigedal and Valla, 2008; Benedetti et al., 2016a,b), in which the TF belongs to the same family of regulatory proteins as XylS (Iwaki et al., 1999; Tobes and Ramos, 2002), is transcriptionally silent unless cells face the cognate inducer (cyclohexanone). The transcriptional factor of the CprK/PDB3 node belongs to the CRP/FNR group of regulators and, following binding to the CHPA effector, binds a target sequence (termed dehalobox) within the sequence of the promoter region (Kemp et al., 2013; Benedetti et al., 2016a,b). Finally, the regulator n-octane/dicyclopropylketone (DCPK)-responsive AlkS/PalkB (Canosa et al., 2000; van Beilen et al., 2001; Rojo, 2009) is a member of the unusual MalT family of regulators, which require ATP binding for activity. In addition, alkS expression is subject in Pseudomonas to catabolite repression mediated by Crc, but the variant used in this work has been erased of that feature (Arce-Rodríguez et al., 2021). Although the five regulatory nodes under study do not exhaustively cover all possible architectures for inducible transcription, they represent many of those which have been used to this end in various expression vectors available thus far (Brautaset et al., 2009; Balzer et al., 2013; Marschall, 2017).
The first stage of this study involved the editing of individual components of each of the regulatory nodes and their assembly to meet SEVA standards (Silva-Rocha et al., 2013; Arce-Rodríguez et al., 2021; Fig. 1A), so that all share the following common features, namely, (i) a constitutive promoter driving the expression of (ii) the gene encoding the regulatory protein that is transcribed divergently with respect to its target promoter, (iii) a spacer sequence (buffer) that insulates expression of the relevant elements from adjacent DNA features and (iv) the cognate promoter that is activated upon addition of the inducer molecule. All expression systems were flanked by SEVA-compatible PacI-AvrII sites, allowing for easy swapping between SEVA vector backbones. Once inserted into these restriction sites, a downstream multiple cloning site (MCS) facilitates the addition of any gene(s) of interest and it is followed by a universal M13 (F24) priming site added to the backbone for sequencing/checking purposes. The functional cargo ends up with a unique SpeI site such that, upon insertion into any SEVA vector as a PacI-SpeI DNA segment, the expression cargoes described above are bracketed by the strong terminators T0 and T1. This structure ensures transcriptional insulation of the DNA segments encoding the TF and its cognate promoter (Fig. 1A). The basic architecture described above facilitates modular DNA assembly and eliminates potential effects on expression unrelated to the functional elements of the regulator/promoter systems themselves – provided that they are inserted at the same point of a given replicon and thus share an identical backbone.
Fig. 1. Main features of the standardized expression systems.A. Schematic representation of the main features of a standardized pSEVA-compatible expression cargo, indicating the relevant elements involved in the regulatory device, i.e. transcriptional regulator, cognate promoter and a buffer sequence that separates both features. A handy MCS for cloning procedures is located downstream of the regulatory device. In addition, there are R24 and F24 oligonucleotide priming sites that can be used for checking/sequencing purposes. The unique restriction sites PacI, AvrII and SpeI allow for module exchange within SEVA platform backbones, in such a way that the whole module is flanked by appropriate transcriptional terminators T0 and T1. Note that the standard does not include an RBS for expression of the GoI inserted in the MCS.B. Implementation of the standard in modules AlkS/PalkB, ChnR/PchnB and CprK/PDB3 and incorporation in vector pSEVA231 originated plasmids pSEVA239, pSEVA2311 and pSEVA2312, respectively. As indicated in the text, the standard could be applied partially to the XylS/Pm expression cargo, resulting in plasmid pSEVA238. This construct bears a spacer that is longer and different of the others and has the native PS2 promoter driving expression of the xylS.C. Composition of LacIq/Ptrc module variants. The IPTG-dependent expression module [LacI/Ptrc]C borne by pSEVA234C is the only one that follows all the terms of the standard. In the partially standardized device pSEVA234, lacI is expressed from PlacIq and is transcribed in the same orientation as the target promoter Ptrc. Two additional expression devices were constructed based on this non-standardized expression cargo by flipping the lacI segment, including the promoter PlacIq (pS634r) or substituting it by the Pneo one (pS634rk).
The general formatting of regulated transcription devices according to SEVA rules explained above had to be adapted case by case to the specific features of each expression module. As mentioned above, four out of five of the expression systems tested in this work are positively regulated. Among these, three (AlkS, ChnR and CprK) were exclusively dependent on inducer addition. This greatly facilitated streamlining their components, following a standard layout that encompasses (i) a P3/Pneo promoter and a translation initiation region (TIR) to drive the constitutive production of the regulatory proteins AlkS, ChnR and CprK and (ii) a neutral, 150-bp long DNA segment that separates the TF gene from its target promoter, identical in length and sequence in all three cases (Fig. 1B). This DNA spacer contains no known regulatory elements and serves to alleviate the negative supercoiling from the transcription of the flanking promoters (Wu et al., 1988). As indicated above, the alkS variant used in this study had its Crc recognition removed to mitigate catabolite repression in Pseudomonas hosts (Hernández-Arranz et al., 2016; Arce-Rodríguez et al., 2021). The organization of the constructs resulting from implementing these criteria to AlkS/PalkB, CprK/PDB3 and ChnR/PchnB is shown in Fig. 1B. The other positively regulated device (XylS/Pm) required a singular handling as its activity relies both on inducer and intrinsic levels of the TF (Inouye et al., 1987; Mermod et al., 1987; Spooner et al., 1987; Ramos et al., 1990; Michan et al., 1992; Kessler et al., 1994; Zwick et al., 2013). In order to leave the system dependent on inducer-only we kept the relatively weak native Ps2 promoter driving the expression of xylS (Gallegos et al., 1996; González-Pérez et al., 2004) instead of replacing it by the stronger P3/Pneo as in the other cases. Furthermore, instead of the standard spacer used for formatting AlkS/PalkB, CprK/PDB3 and ChnR/PchnB we entered a 0.8-kb synthetic DNA segment derived from vector pJB653 (Blatny et al., 1997) that was modified to eliminate restriction sites conflicting with the SEVA standard. Since this buffer sequence was merely meant to insulate the TF/promoter pair, we anticipated little or no influence of its length or sequence on the functionality of the device (Fig. 1B). Finally, the LacIq/Ptrc expression cargo was the only one examined based on a repression mechanism. Although a positive control may in principle deliver a tighter transcriptional regulation (Yin et al., 2007; Engstrom and Pfleger, 2017), IPTG-inducible devices remain to this day the most widely used small-scale expression systems. In the best-case scenario, when the operator in the cognate promoter is occupied by the repressor, transcription cannot proceed. The main issue is thus reducing basal expression without affecting inducibility of the system. A large number of lac-derived expression systems are based on the lacIq allele, a promoter-up mutation of lacI that increases production of the repressor by 10-fold (Calos, 1978). Having these considerations in mind, we engineered two versions of this expression system. First, a fully SEVA-standardized LacI/Ptrc expression cassette (named hereafter [LacI/Ptrc]C) that fulfilled all the specifications described for ChnR/PchnB, AlkS/PalkB and CprK/PDB3 (Fig. 1C). And second, a LacIq-based variant derived from the widely used plasmid pTrc-99A (Amann et al., 1988), which was edited to meet SEVA compositional rules while maintaining the features of the original construct. These involved the lacIq gene oriented in the same direction as its target Ptrc promoter, but separated by a neutral 231-bp DNA fragment (Fig. 1C).
In sum, the [LacI/Ptrc]C, AlkS/PalkB, ChnR/Pchn and CprK/PDB3 modules were considerably reshaped in respect to their naturally occurring configuration, while (owing to the reasons explained above) XylS/Pm pair was only partially reformed. In the four fully standardized expression devices, the gene encoding the TF is preceded by a TIR, and the whole is transcribed from the P3/Pneo promoter. Also, the TF gene is divergently located and isolated from its target promoter by means of a common DNA insulator sequence. In addition, the partially modified but still SEVA-compatible LacIq/Ptrc variant with co-directional TF gene and target promoter was also included as indicated above. With this panel of constructs at hand, we set out to investigate the functioning parameters delivered by each system as explained next.
Analysis of expression dynamics by flow cytometryTo prevent the occurrence of effects unrelated to the features of the regulation/promoter systems themselves, all replicons used in this study shared an identical backbone, i.e. plasmid pSEVA231. This vector harbours the following functional modules: a medium copy-number pBBR1 origin of vegetative replication, a kanamycin resistance gene and a multiple cloning site (MCS) default cargo (Silva-Rocha et al., 2013). Following SEVA composition rules, the expression cargoes were inserted in exactly the same location, i.e. between the PacI-AvrII sites, so that they were placed right upstream the MCS. This originated plasmids pSEVA234 (containing LacIq/Ptrc), pSEVA234C (encompassing the fully standardized [LacI/Ptrc]C derivative), pSEVA238 (harbouring the XylS/Pm cassette), pSEVA239 (carrying the regulator/promoter regions of AlkS/PalkB), pSEVA2311 (bearing the ChnR/PchnB regulatory pair) and pSEVA2312 (accommodating the CprK/PDB3 expression system). In order to monitor, quantify and compare their activity, a reporter gene encoding the monomeric and superfolder GFP (msfGFP), bearing a TIR motif (Miller and Lindow, 1997), was cloned downstream of each of the regulator/promoter systems under study at the same position of the MCS (see Experimental procedures). The resulting constructs (termed pS234·M, pS234C·M, pS238·M, pS239·M, pS2311·M and pS2312·M) carried the LacIq/Ptrc, [LacI/Ptrc]C, XylS/Pm, AlkS/PalkB, ChnR/PchnB and CprK/PDB3 expression systems, respectively. In all instances, the strain of reference E. coli CC118 (Manoil and Beckwith, 1985) was adopted as the host and the resulting transformants were grown under identical conditions. Also, all experiments were run in LB medium in shaken-flask cultivations as a standard culture condition widely used in common laboratory practices.
In a first series of experiments, we inspected the gross performance of these expression systems in terms of bacterial growth, inducibility and reporter output by means of flow cytometry as a way to expose expression noise and cell-to-cell variability. Fig. 2 shows the fluorescence patterns of cells harbouring the expression system under study along time after the addition of saturating amounts (1.0 mM) of the corresponding inducer, as indicated (increasing inducer > 1.0 mM does not have an effect on the expression level, data not shown). E. coli CC118 carrying vector pSEVA23M with a promotor-less msfGFP gene inserted in the same backbone as the constructs under analysis was used as negative expression control. Perusal of the frequency distribution histograms for green fluorescent cells revealed significant differences among the regulator/promoter systems examined. In the absence of inducer, cells remained mostly non-fluorescent in the strains with plasmids bearing the XylS/Pm, CprK/PDB3 and ChnR/PcnhB modules. In contrast, a detectable fraction of bacteria with the AlkS/PalkB device produced fluorescence under non-induced conditions. This was exacerbated in the case of both IPTG-inducible expression systems, in which a quite significant part of cells showed a detectable GFP signal which was indicative of a significant basal level. As expected, Fig. 2 shows that all expression systems responded to the addition of the corresponding inducer, as reflected by the movement of fluorescence signals towards higher intensity values over time. In all cases, fluorescence values fell in a single peak – suggesting a mono-modal expression pattern. But the results also showed non-identical shapes (indicating varying degrees of expression heterogeneity) and the fluorescence accumulated with different kinetics depending on time course and the system under analysis. In all instances, 4 h of induction appeared to bring about saturation of GFP output of cells with any of the expression systems.
Fig. 2. Analysis of fluorescence distribution with flow cytometry. Cultures of E. coli CC118 harbouring the corresponding expression plasmids, as indicated in each plot, were grown under the conditions explained in Experimental Procedures up to the mid-exponential phase (OD600 ~ 0.4). At this point, gene expression was induced by addition of the corresponding effectors (t = 0) as indicated in each case. E. coli CC118 carrying the promoter-less pSEVA237M plasmid (C-, grey plots) was used as negative fluorescence control. Samples were collected every hour after induction (t = 1, 2, 3 and 4 h), cells were fixed and the msfGFP signal was analyzed with a flow cytometer. Plots show the results of representative samples.
Further inspection of the results obtained by means of the flow cytometry experiments described in the previous section allowed us to draw additional features of the expression systems under consideration, both in the OFF and ON states. First, in order to assess the tightness of regulation, we compared basal, uninduced levels of fluorescent emissions in each case. This is an important quality of expression systems, as even low basal activity can be detrimental for the host’s physiology if the product of the GoI is toxic and/or causes metabolic or regulatory burden. Ideally, inducible heterologous expression devices should have the lowest possible background. Fig. 3 shows a detailed analysis of the fluorescence values measured during mid-exponential growth without any inducer addition (t = 0). The data exposed significant differences among the x-mean values of fluorescence basal levels. The LacI-dependent expression systems were by far the leakiest device. As displayed in Fig. 3, the LacIq/Ptrc device delivered a basal expression approximately 70-fold higher than ChnR/PchnB which behaved as the tightest device under non-inducing conditions (fluorescence x-mean of 1.39 versus 0.02 arbitrary units, respectively). Full standardization of the IPTG-inducible device imposed lacI to be expressed through a heterologous promoter and being divergently transcribed with respect to the GoI. Both transcriptional units were also separated by a standardized DNA buffer sequence. This arrangement reduced the background fluorescence of this regulator/promoter pair to one-third (x-mean fluorescence decreased to 0.41 arbitrary units in the [LacI/Ptrc]C derivative). Yet it still roughly doubled that of the AlkS/PalkB expression system, which still showed a considerable basal activity of around 0.24 arbitrary units. In contrast, XylS/Pm, CprK/PDB3 and ChnR/PchnB proved to be the most stringently regulated expression systems under study, with basal activities of 0.075, 0.041 and 0.02 arbitrary units, respectively (Fig. 3).
Fig. 3. Tightness of regulator/promoter modules. Basal expression of E. coli CC118 cells harbouring the expression plasmids indicated, in the absence of any added inducer (t = 0). Bars represent the mean value of the geometric mean (x-mean) of the msfGFP fluorescence ± standard deviation (SD) obtained from flow cytometer experiments and plotted for comparison. Values correspond to three independent experiments with two technical replicates each. Basal expression levels were ordered from highest to lowest. By far, the leakiest system was the negatively regulated LacIq/Ptrc module borne by plasmid pS234·M.
Other important parameters included induction kinetics and net promoter strength. Fluorescence accumulated steadily along time in cells bearing the regulatory/promoter devices upon addition of the corresponding inducer. But significant differences were observed among them with respect to their transcriptional capacity (i.e. maximal GFP output) in the activated ON state (Fig. 4A). The XylS/Pm cassette produced the highest GFP signal, followed by the AlkS/PalkB system. In contrast, the lowest reporter levels were obtained with CprK/PDB3 and either of the IPTG-inducible expression devices. Taking the XylS/Pm expression system as reference, both CprK/PDB3 and LacIq/Ptrc produced fourfold less fluorescence. This was followed by [LacI/Ptrc]C, which reached only one-seventh of the maximum XylS/Pm range. ChnR/PchnB occupied an intermediate position, i.e. about half of the capacity exhibited by the best performers XylS/Pm and AlkS/PalkB. In terms of absolute reporter output, the ranking of fully induced transcriptional strength under the specific conditions tested was XylS/Pm > AlkS/PalkB > ChnR/PchnB > CprK/PDB3 = LacIq/Ptrc > [LacI/Ptrc]C.
Fig. 4. Dynamic range and inducibility of expression cargoes.A. The means of the geometric mean (x-mean) of msfGFP fluorescence ± standard deviation of the activated expression of cultures of E. coli CC118 cells carrying the corresponding expression plasmids (as indicated), obtained from three independent cytometry experiments, were plotted for comparison of their expression levels along time (t = 0, before induction; t = 1, 2, 3 and 4 h after adding the corresponding inducer, respectively).B. Fold induction of the different expression systems calculated from results of the cytometry experiments. Mean ± SD data were ordered from the highest to the lowest dynamic range. Results were statistically tested by means of a one-way variance analysis (ANOVA) for multiple comparisons. Significant differences between the expression systems are represented in the graph with * (P-value ≤ 0.05), ** (P-value ≤ 0.01), *** (P-value ≤ 0.001) or **** P-value ≤ 0.0001). The rest of comparisons were not statistically significant.
The ratio between the fully induced output and the basal level enabled calculation of inducibility, i.e. the dynamic range of the system. As shown in Fig. 4B, the ChnR-regulated device displayed the highest ratio with 464-fold increase of GFP signal following full induction (Fig. 4B, green bar). High figures were also observed in XylS/Pm (Fig. 4B, orange bar) with a 291-fold increase. This was followed by CprK-dependent and AlkS-regulated expression systems (134-fold and 64-fold changes, respectively). Once more, the LacI-regulated expression systems were by far the least performing of the devices upon induction. In this case, GFP fluorescence barely increased by sevenfold with the [LacI/Ptrc]C system and as little as fourfold in LacIq/Ptrc. Under the conditions of the assays and taking into account the dynamic range as a descriptor of the expression capabilities, the regulatory devices analysed from best to poorest would be ChnR/PchnB > XylS/Pm > CprK/PDB3 > AlkS/PalkB > [LacI/Ptrc]C > LacIq/Ptrc.
Issues with standardization of the LacI/Ptrc systemThe clearly non-ideal behaviour of the [LacI/Ptrc]C and LacIq/Ptrc in the tests above prompted us to examine in more detail additional circumstances that could influence their activity and therefore make their standardization more problematic. In both cases, Ptrc is a chimaera of the E. coli trp operon and lacUV5 promoters, including a single proximal lacO1 operator from the lac operon and the optimally spaced E. coli consensus –35 and –10 motifs. However, in the case of pSEVA234C (Table S1, GenBank accession number
Flow cytometry profiles embody valuable information on cell-to-cell homogeneity: A sharp peak indicates low dispersion and high uniformity, while a wider peak indicates variation of expression levels among individual cells. A robust indicator of the level of heterogeneity is the so-called coefficient of variation (CV), a parameter that relates the standard deviation with the mean value which is usually expressed as a percentage. In practical terms, the lower the CV, the less noise there is. As CV is unit-less and dimension-less, this proxy can be used to compare experimental results over time as well as those obtained in different platforms.
Fig. 5 shows the CVs (expressed as percentage) of E. coli CC118 harbouring the corresponding expression systems before and at different time points after induction. At time point 0, right before adding each inducer, most of the expression systems had a low spread, i.e. a small CV that was around 45 to 50%. The exception were the LacI-dependent regulatory systems that displayed a mean CV of 63% for the LacIq/Ptrc and 76% for the [LacI/Ptrc]C. This noisy behaviour is likely due to the suboptimal repression of the Ptrc promoter exerted by the LacI regulator, which translates into a broader expression profile. This is consistent with the high basal expression levels displayed by both systems, as indicated previously (Figs 2 and 3). Upon activation by their cognate inducers, cell-to-cell fluorescence variations increased in all expression systems. Early in the analysis, i.e. after 1 h induction, CV values exceeded 60% in all samples, but decreased steadily over time, recovering in most cases the initial values after 4 h of effector addition. Yet, XylS/Pm and [LacI/Ptrc]C showed a conspicuously uneven signal distribution in the ON state throughout the whole period, especially at early times after induction. In particular, their CV value reached ~ 100% dispersion, i.e. the standard deviation was similar to the mean value after the addition of the inducer. Although the spreading of GFP signals decreased over time in both XylS/Pm and [LacI/Ptrc]C devices, it remained higher than any of the other expression systems staying at a good CV of 80% at late induction times. The general behaviour of the expression systems in terms of cell-to-cell heterogeneity was coherent with earlier observations with equivalent transcriptional devices (Balzer et al., 2013). Since noise stems in some cases from low intracellular levels of the TF involved (Silva-Rocha and de Lorenzo, 2010), the unexpected variability of the XylS/Pm regulatory system could be explained, at least in part, by the low XylS protein abundance when expressed from its natural, low-activity promoter Ps2 (Marqués and Ramos, 1993), which results in strong stochastic effects (Goñi-Moreno et al., 2017). Repressor levels and geometrical/topological effects (Kolesov et al., 2007; Chong et al., 2014) can also account for the considerable increase in noisy GFP levels of cells bearing the [LacI/Ptrc]C expression system (Fig. 5).
Fig. 5. Population heterogeneity. Cell-to-cell heterogeneity of E. coli CC118 populations harbouring the corresponding expression systems along time, expressed as the mean of the percentage of the coefficient of variation (CV*100) ± standard deviation at each time point of the flow cytometry experiments (t = 0 before induction, t = 1, 2, 3 and 4 h after addition of the corresponding inducers). The noisiest systems were the standardized version of the [LacI/Ptrc]C and the XylS/Pm regulatory devices, while the ones with the lowest cell-to-cell heterogeneity were CprK/PDB3 and AlkS/PalkB.
Since genetic constructs often require altogether separate transfer functions of the various regulatory devices, we next analysed potential crosstalk among effectors of the regulatory devices. To this end, cultures of E. coli CC118 harbouring each of the expression cargoes, were independently supplemented with the corresponding inducers and the fluorescent output after 3 h was inspected as before with flow cytometry (Fig. 6). A control strain carrying the promoterless msfGFP plasmid pSEVA237M (Table S1), was also included in the assay. As expected, each expression system responded to its bona fide effector. In contrast, none of the inducers affected either the control or strains carrying plasmids with non-associated transcriptional devices, which showed fluorescence patterns indistinguishable from untreated samples. The growth curves were independent of whether the cultures were supplemented with the cognate inducer or a different one: none of the inducers at the concentrations employed in this work (1.0 mM) inhibited growth of E. coli or changed the overall growth profiles – neither specific growth rates nor final optical densities (not shown). These experiments accredited that expression systems under study are not influenced by inducers other than their cognate effectors and therefore they are mutually orthogonal.
Fig. 6. Cross-talk among expression systems. E. coli CC118 cells transformed with the corresponding expression plasmids, as indicated above each graph, were either non-induced (NI) or added with their bona fide effector as well as with the rest of the heterologous inducers involved in the expression systems under analysis, i.e. IPTG (purple), 3-mBz (orange), DCPK (blue), cyclohexanone (CHx; green) and CHPA (red). The effect of the corresponding inducers on the GFP fluorescence driven by each regulatory device was then analysed by flow cytometry after two hours of treatment, as indicated. The same host strain harbouring the promoterless msfGFP gene (pSEVA237M) was also tested as negative control. Note that the expression systems responded only to their own inducers and not to the effectors activating other regulators.
To further characterize the expression systems under analysis, we investigated if any of the effectors of the regulatory proteins caused physiological stress in the host. As different types of insults ultimately converge in generation of reactive oxygen species (ROS; Cabiscol et al., 2000; Zhao and Drlica, 2014) we inspected their formation in cells following exposure to the corresponding inducers. This scenario is not devoid of precedents, as toxicity of some typical inducers (e.g. IPTG), has been documented (Dvorak et al., 2015). This holds true also for aromatic molecules and other organic compounds, which often cause redox stress (Nikel et al., 2016). On this basis, potential toxic effects and intracellular accumulation of ROS upon addition of the inducers to cells with each of the expression systems was assessed by analysing growth profiles and by direct quantification of ROS by flow cytometry (Fig. 7). As shown in Fig. 7A, the addition of inducers at the concentration employed in this work (1.0 mM) did not inhibit or changed the overall growth profiles of E. coli. In particular, neither the specific growth rate nor the final optical density was affected by the addition of any inducer, compared to the growth pattern of a non-treated control culture. In contrast, diamide (a drainer of NADPH; Kosower and Kosower, 1995) and hydrogen peroxide (H2O2) acted as bacteriostatic agents in cell exposed to either chemicals, leading to growth arrest by approximately one hour after addition. Redox stress monitoring was then set up using 5-(6)-carboxy-2′,7′-dichlorodihydrofluorescein diacetate (carboxy-H2DCFDA) as the ROS-sensitive probe. The results shown in Fig. 7B exposed an expected ROS accumulation in the control samples treated with diamide or H2O2, but no significant oxidative stress in the others.
Fig. 7. Inducer-mediated oxidative damage (ROS).A. Growth curves of E. coli CC118 cells carrying plasmid pSEVA237M treated with different inducers (at 1.0 mM concentration) or with ROS-producing agents, i.e. H2O2 at 1.5 and 5 mM (light and dark pink, respectively) and diamide at 250 μM (light green). Addition of the different chemicals was done at mid-exponential growth (t = 2.5 h, indicated with a red arrow). Non-induced cells (NI) were included as a control.B. Representative examples of cytometry experiments with samples of E. coli CC118 cells carrying plasmid pSEVA237M treated as described above (see also Experimental procedures). Two hours after the treatment, cell aliquots were analysed using the probe carboxy-H2DCFDA. This non-fluorescent molecule is readily converted to a green-fluorescent form upon oxidation with ROS within the cell. The colour code of each graph is same as in (A).
In sum, the results above indicated that the inducers of the expression modules had neither ROS effects nor an impact in the growth of the bacterial host. Therefore, they can be considered as safe agents in this context, at least at the concentrations used in the experiments.
ConclusionInducible expression of heterologous genes is not only at the basis of recombinant DNA-based Biotechnology but also an essential component of the genetic circuits which are at the core of contemporary Synthetic Biology (Engstrom and Pfleger, 2017). The literature contains dozens of regulatory nodes of existing bacterial gene networks which have been refactored in the shape of a conditional expression device. As is the case with other DNA segments encoding properties of interest, standardization of such constructs may bring less flexibility but can also add a considerable value in terms of portability and reusability (Canton et al., 2008; Røkke et al., 2014; Beal et al., 2020). In this work we have chosen five types of inducible expression modules intended to be used as transcriptional SEVA cargoes and we have examined the feasibility of applying a rigorous composition standard to them. The results clearly indicate that not every TF/promoter pair can be reformatted to this end – or not so easily. Because of being dependent on a minimal number of factors, those most amenable to standardization are those (i) with promoters which have little or no activity under uninduced conditions, (ii) are subject to positive control, (iii) are regulated by TFs entirely inactive in the absence of the inducer and (iv) are independent of intracellular TF abundance. The only two cases in our study that duly me et all these criteria are the ChnR/PchnB and CprK/PDB3 modules, and therefore they were the ones where a rigorous format could be applied. To a lesser extent, but still amenable to standardization is the AlkS/PalkB module, the flaw in this case being its significant basal level in the absence of induction. The issue can be traced to PalkB, not the regulator, and therefore could be plausibly fixed with other promoter variants. The XylS/Pm device is a popular expression system in Gram-negative bacteria which can be found in the literature in diverse configurations. Complete standardization is however complicated by the ability of XylS to activate Pm in the absence of inducers if overproduced (Ramos et al., 1987). The partial formatting embodied in pSEVA238 is nevertheless enough for an excellent performance as an expression tool. Finally, the data above with LacI and Ptrc showcases the difficulty of standardizing expression modules based on effector-responsive repressors. The results shown in Fig. 2 and Fig. S1 expose a large number of variables at issue that influence operation of such systems and thus make them less amenable to a rigorous configuration paired with others thereof. In sum, we argue that the outcome of the experiments above on the one hand consolidate some expression devices as full-fledged SEVA cargoes and in the other they can guide future attempts of standardizing transcriptional regulation tools for Synthetic Biology and Biotechnology in general.
Experimental procedures Strains, plasmids, culture media and growth conditionsThe relevant properties of the strains and plasmids used in this work are listed in Table S1. E. coli DH10B, DH5α and CC118 strains were used for cloning procedures. E. coli CC118, harbouring the corresponding plasmid(s) under analysis, was used for assessing msfGFP activity driven by the different expression systems. Bacteria were grown routinely at 37°C in LB medium (10 g l−1 of tryptone, 5 g l−1 of yeast extract and 5 g l−1 of NaCl). When required, Km (75 μg ml−1) was added to the media. The necessary inducers, isopropyl-β-D-1-thiogalactopyranoside (IPTG), 3-methylbenzoate (3-mBz), cyclohexanone (CHx), dicyclopropyl ketone (DCPK) or 3-chloro-4-hydroxyphenylacetic acid (CHPA), were added where indicated to a final concentration of 1.0 mM. For growth curves, E. coli CC118 cells transformed with the plasmids indicated in each case were grown overnight in LB medium at 37°C under standard conditions as described above and then diluted 1:20 in multiwell microtiter plates containing 190 μl of LB medium supplemented with kanamycin. Cells were then grown with rotatory shaking in a Victor2 plate reader (PerkinElmer, Shelton, CT, USA) at 30°C to avoid excessive condensation on the plate lids during 2.5 h. After reaching mid-exponential growth (OD600 ~ 0.4), cells were treated with the effector of choice or the oxidative stress-inducing agent at the indicated concentrations (see below) and the incubation at 30°C resumed for 24 h. Bacterial growth was recorded as the OD600 measured every 15 min. Data shown corresponds to two independent experiments with eight technical replicates each. Unless otherwise indicated, DNA synthesis was performed by GeneArt (Thermo Fisher Scientific; Waltham, MA, USA). Primers used in this work were provided by Sigma-Aldrich (St. Louis, MO, USA). Restriction enzymes and Phusion DNA polymerase, used for PCR amplifications, were purchased from New England BioLabs (Beverly, MA, USA). DNA AmpliTools Green Master Mix (Biotools B&M Labs SA, Madrid, Spain) was routinely used for colony PCR verifications. The fidelity of DNA fragments was later confirmed by restriction mapping and DNA sequencing of relevant features (Secugen Sequencing and Molecular Diagnostics, Madrid, Spain).
DNA constructsGeneral methods for DNA manipulation were performed as described elsewhere (Sambrook et al., 1989). Oligonucleotides used in PCRs are listed in Table S2. The ITPG-inducible expression system was obtained from the plasmid pTrc99a. The expression cassette containing the lacIq gene and the Ptrc promoter was amplified by PCR using primers TRC-F and TRC-R. These primers introduced PacI and AvrII restriction sequences in the flaking region of the fragment. A 1.4-kb PCR fragment was gel-purified and cloned in vector pSEVA221 previously digested with PacI/AvrII, leading to plasmid pSEVA224. The 3-mBz-inducible XylS/Pm expression system was constructed based on a cassette present in vector pJB653 and derivatives thereof (Blatny et al., 1997). In that system, the gene encoding the XylS activator is placed in a divergent orientation relative to its target Pm promoter. As the XylS/Pm DNA fragment presented many restriction sites used in SEVA multi-cloning site, the cassette sequence was edited in silico and the resulting variant was synthetized de novo by GeneArt. The 2.0-kb DNA fragment with the edited XylS/Pm sequence was cloned as a PacI/AvrII fragment into the pSEVA226 reporter vector, generating plasmid pSEVA226-XylS-Pm, which was then used to produce plasmid pSEVA238 (Calles et al., 2019). The AlkS/PalkB expression cassette based on the cognate system of P. putida GPo1 was fully synthetized by GeneArt in order to fulfil SEVA standards and was cloned into vector pSEVA421 to produce pSEVA429 (Arce-Rodríguez et al., 2021). Then, a PacI-AvrII fragment containing the complete regulatory device was re-cloned in vector pSEVA231 digested with the same enzymes, leading to plasmid pSEVA239. The design and construction of a standardized vector carrying the ChnR/PchnB expression system is described elsewhere (Benedetti et al., 2016b). Finally, the CprK/PDB3 expression system was assembled (Benedetti, 2014) by amplifying cprK1 from plasmid pMLK1 (Kemp et al., 2013) by PCR with primers 5′-CPRK1 and 3′-CPRK1, harbouring AvrII and PacI sites, respectively. The resulting DNA fragment was then digested with both enzymes and cloned into vector pSEVA231 digested with the same endonucleases, producing the intermediate plasmid pSEVA231-CPRK1. A separate DNA fragment contained the cognate promoter PDB3, which was followed by a buffer sequence that separated the promoter from the divergent promoter P3/Pneo, which was introduced along with the TIR sequence 5′-GAT TAA CTT TAT AAG GAG GAA AAA-3′ (Miller and Lindow, 1997), to drive the constitutive expression of cprK. This fragment, containing the CHPA-inducible PDB3 promoter plus the constitutive P3/Pneo promoter, was cloned into the NheI/AvrII sites of plasmid pSEVA231-CPRK1, producing plasmid pSEVA2312 that harbours the complete CprK/PDB3 expression cassette. The corresponding reporter plasmid pS2312·M was obtained as indicated below. The promotorless msfGFP reporter was obtained from plasmid pGA-LacI, containing an edited PstI-less version of msfGFP, in a PCR reaction using oligonucleotides msfGFP-HindIII-F, which contained a TIR motif and a HindIII site, and msfGFP-SpeI, bearing a SpeI restriction site. The amplified fragment was then digested with HindIII and SpeI, gel purified and ligated into the corresponding sites of pSEVA231, producing plasmid pSEVA237M (Benedetti et al., 2016b; Calles et al., 2019). All plasmids used to monitor expression activity arising from the different systems, i.e. pS234·M, pS234C·M, pS238·M, pS239·M, pS2311·M and pS2312·M, were constructed by restriction of pSEVA237M with HindIII-SpeI and ligation into the corresponding expression system previously cleaved within the same endonucleases.
Plasmid pSEVA634 was built by moving the cargo module from pSEVA234 onto pSEVA631 via restriction digestion with PacI and AvrII and ligation. Plasmids pS634r and pS634rκ were constructed with the uracil-excision (USER) cloning method. To this end, a 1,227-bp fragment containing PlacI and lacIq was PCR-amplified from vector pSEVA234 with primers lacIq-revert_F/lacIq-revert_R and incorporated into the linearized, 3,249-bp pSEVA634 backbone (separately amplified with primers pSEVA634_F/pSEVA634_R) to yield plasmid pS634r. In the case of plasmid pS634rκ, a 304-bp DNA fragment containing the standard SEVA translation initiation sequence (TIR), Pneo, and a buffer sequence was commercially synthesized (Integrated DNA Technologies, Leuven, Belgium). This fragment was amplified with primers std-reg_F/std-reg_R and fused to the linearized 4,211-bp pS634r backbone. The fluorescence-reporter plasmid variants pS634·M, pS634r·M, and pS634rκ·M were created by amplifying the msfGFP gene from pSEVA237M with primers msfGFP_F/msfGFP_R, followed by restriction-digestion of the PCR amplicon as well as pSEVA634, pS634r, or pS634rκ with EcoRI and HindIII and ligation.
Fluorescent flow cytometryQuantification of msfGFP expression was made with fluorescence flow cytometry on strains containing the relevant constructs. In each case, bacteria were inoculated into filtered LB medium and pre-grown to stationary phase. Overnight cultures were then diluted to an OD600 of 0.05 in fresh filtered LB medium and incubated at the appropriate temperatures to an OD600 ~ 0.4, when the corresponding inducer was added. This time point was considered as t = 0. Following exposure to the inducer, 1 ml samples were harvested at various time points, as indicated in the text, and spun down in a tabletop centrifuge at 13 000 g during 1 min; then, the cells were washed with 500 μl of filtered phosphate buffered saline, centrifuged again as indicated above and resuspended in 300 μl of 0.4% (w/v) paraformaldehyde and finally incubated at room temperature for 10 min. After cell fixation, cells were washed twice in 500 μl of filtered phosphate buffered saline, resuspended in 600 μl of the same buffer and stored on ice until analysis. The final OD600 was adjusted to a value < 0.4 in all samples before flow cytometry analysis. Single-cell fluorescence of the samples was then analysed by flow cytometry using a GALLIOS cytometer (Perkin Elmer; Shelton, CT, USA) or with a MACSQuant™ VYB cytometer (Miltenyi Biotec, Bergisch Gladbach, Germany). GFP was excited at 488 nm, and the fluorescence signal was recovered with a 525/40 nm band pass filter. At least 25 000 events were counted in each aliquot. The GFP signal was quantified under the experimental conditions tested by firstly gating the cells in a side scatter against forward scatter plot, and then the GFP-associated fluorescence was recorded in the FL1 channel (515–545 nm). Data processing was performed using the FlowJo™ software (
Quantification of green (msfGFP) fluorescence in cultures of E. coli CC118 harbouring either plasmid pSEVA634M, pS634r·M, or pS634rκ·M was performed in 96-well plates in a Synergy Max plate reader (BioTek Instruments; Winooski, VT, USA). Overnight precultures in LB medium were diluted 1:100. The production of msfGFP was induced via the addition of IPTG at 1 mM after 2.5 h of cultivation (corresponding to the early exponential phase in microtiter plate cultures). Fluorescence was measured at an excitation wavelength of 483/9 nm and an emission wavelength of 514/9 nm, with the gain set at 80. The expression performance of the IPTG-inducible standardized device was likewise analysed in cultures of E. coli CC118 carrying plasmid pS234·M or pS234C·M, treated under different conditions as described. In this case, cells were grown in 96-well microtiter plates at 30°C in a Victor-2 multireader spectrophotometer (Perkin Elmer; Shelton, CT, USA). At time intervals of 15 min, the green fluorescence and the OD600 of each culture were simultaneously recorded. Non-inoculated LB and cells harbouring the pSEVA234 empty plasmid were used as blank to adjust the baseline for measurements. The expression system activity was calculated by normalizing the green fluorescence to the OD.
Flow cytometry detection of ROS (peroxides and hydroperoxides) by dichlorofluorescein derivativesTo detect ROS formation by means of flow cytometry analysis, cultures of E. coli CC118 transformed with vector pSEVA237M were incubated overnight as indicated. The following day, samples were diluted in 25 ml of filtered LB medium to an initial OD600 ~ 0.05. At the mid-exponential phase of growth (OD600 ≈ 0.4), the culture was divided into 7 ml aliquots that were treated either with the stressors H2O2 or diamide (as positive controls of oxidative stress) or with 1 mM of each of the inducers under inspection. One more aliquot was kept untreated as negative control. After an additional 2 h incubation, 1 ml of the cultures was spun down at 13 000 g for 2 min at room temperature, washed once with 1 ml of filtered phosphate buffered saline by centrifugation and resuspension and diluted in filtered phosphate buffered saline such that the OD600 was under 0.4 in a final volume of 1 ml. Then, a freshly prepared solution of 5-(6)-carboxy-2′,7′-dichlorodihydrofluorescein diacetate (carboxy-H2DCFDA; Sigma-Aldrich, St. Louis, MO, USA) in DMSO was added to the samples at 20 μM. Cell suspensions were incubated for 30 min at 37°C protected from light. Finally, the fluorescence of each sample was measured in the green channel using the same excitation/emission parameters of msfGFP.
AcknowledgementsWe are indebted to Rafael Silva-Rocha (Ribeirão Preto Medical School, USP, Brazil) and Juhyun Kim (Imperial College London, UK) for constructing the LacIq/Ptrc and XylS/Pm expression cargoes, respectively. This work was funded by the SETH (RTI2018-095584-B-C42 of MCIN/AEI /10.13039/501100011033/FEDER) and SYCOLIM (ERA-COBIOTECH 2018 - PCI2019-111859-2 of MCIN/AEI /10.13039/501100011033/EU) Projects of the Spanish Ministry of Science and Innovation. MADONNA (H2020-FET-OPEN-RIA-2017-1-766975), SYNBIO4FLAV (H2020-NMBP-TR-IND/H2020-NMBP-BIO-2018-814650) and MIX-UP (MIX-UP H2020-BIO-CN-2019-870294) Contracts of the European Union, InGEMICS-CM (S2017/BMD-3691) and BIOSINT-CM (Y2020/TCS- 6555) Projects of the Comunidad de Madrid – European Structural and Investment Funds – (FSE, FECER) to VdL. The financial support from The Novo Nordisk Foundation through grants NNF20CC0035580, NNF10CC1016517, LiFe (NNF18OC0034818) and TARGET (NNF21OC0067996), the Danish Council for Independent Research (SWEET, DFF-Research Project 8021-00039B), and the European Union’s Horizon 2020 Research and Innovation Programme under grant agreement No. 814418 (SinFonia) to P.I.N. is gratefully acknowledged.
Conflict of interestAuthors declare no conflict of interest.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
The potential of LacI/Ptrc, XylS/Pm, AlkS/PalkB, CprK/PDB3 and ChnR/PchnB regulatory nodes, recruited from both Gram‐negative and Gram‐positive bacteria, as the source of parts for formatting expression cargoes following the Standard European Vector Architecture (SEVA) has been examined. The five expression devices, which cover most known regulatory configurations in bacteria were assembled within exactly the same plasmid backbone and bearing the different functional segments arrayed in an invariable DNA scaffold. Their performance was then analysed in an Escherichia coli strain of reference through the readout of a fluorescence reporter gene that contained strictly identical translation signal elements. This approach allowed us to describe and compare the cognate expression systems with quantitative detail. The constructs under scrutiny diverged considerably in their capacity, expression noise, inducibility and ON/OFF ratios. Inspection of such a variance exposed the different constraints that rule the optimal arrangement of functional DNA segments in each case. The data highlighted also the ease of standardizing inducer‐responsive devices subject to transcriptional activation as compared to counterparts based on repressors. The study resulted in a defined collection of formatted expression cargoes lacking any cross talk while offering a panoply of choices to potential users and help interoperability of the specific constructs.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details




1 The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs Lyngby, Denmark
2 Systems and Synthetic Biology Program, Centro Nacional de Biotecnología (CNB‐CSIC), Madrid, Spain