The discovery of penicillin has been a crucial turning point in human history, completely revolutionizing the treatment of bacterial infections. Since then, many more classes of compounds have been identified or synthesized, but penicillin and its (semi‐) synthetic derivatives are still among the most common antibiotics used worldwide.
The biosynthetic pathway of penicillin (and other β‐lactam antibiotics such as cephalosporins and cephamycins) has been extensively characterized in the past decades, resulting in detailed knowledge on the key enzymes involved and the mechanistic aspects of its biosynthesis (Baldwin & Abraham, 1988; Coque et al., 1991; Liras, 1999; Martín, 1998; Martín et al., 1999; Ozcengiz & Demain, 2013; Queener, 1990; Roach et al., 1997; Schofield et al., 1997; Tahlan et al., 2016; Wu et al., 2012). Two distinct enzymatic steps are involved in the production of the β‐lactam ring. First, the l‐δ‐(α‐aminoadipoyl)‐l‐cysteinyl‐d‐valine synthetase (ACVS) provides the linear tripeptide (L,L,D)‐ACV, a precursor of all β‐lactam antibiotics. The second step requires the enzyme isopenicillin N synthase (IPNS, formerly also ACV cyclase), which catalyzes the formation of the β‐lactam ring. The first enzyme of the pathway, ACVS, belongs to the family of nonribosomal peptide synthetases (NRPS), multimodular enzymes that synthesize small peptides (NRP) by a mechanism that does not require messenger RNA, unlike ribosomal protein synthesis (Marahiel et al., 2009; Reimer et al., 2018).
NRP synthesis generally starts in every module at the adenylation (A) domain, which recognizes a specific substrate and catalyzes its activation as (amino) acyl‐AMP. In this process, ATP is consumed and inorganic pyrophosphate (PPi) is released. The activated substrate is then transferred via a thioesterification reaction to the thiolation (T)—or peptide carrier protein (PCP)—domain through the phosphopantetheine (ppant) arm, with AMP being released. The ppant is a CoA (Coenzyme A)‐derived cofactor, covalently attached to a highly conserved residue of serine of the T domain. The loaded substrates are subsequently transported to the active site of the condensation (C) domain (or a cyclization domain, Cy). Here, peptide formation occurs via nucleophilic attack of the α‐amino group of the downstream substrate to the activated carboxy group of the upstream substrate. As a result, the latter is released from the ppant moiety, and the newly synthesized intermediate is now ready to be transported to the next condensation domain (Condurso & Bruner, 2012; Koglin & Walsh, 2009; Marahiel et al., 2009; Reimer et al., 2018; Schwarzer et al., 2003; Strieker et al., 2010; Walsh, 2016).
The domains A, PCP, and C/Cy are strictly required for NRP synthesis, and therefore, a minimal module contains all three of them (initiation modules can lack a C domain as they represent the starter units of NRPS). Also, optional or modifying domains can be present in one or more modules, such as methyltransferase (De Mattos‐Shipley et al., 2018), epimerization (Samel et al., 2014), formylation (Schoenafinger et al., 2006), and oxidase (Schneider et al., 2003) domains. These domains contribute to further increase the chemical diversity of NRPs and often can provide advantageous characteristics such as increased chemical stability or resistance against peptidase. Lastly, when the synthesis is completed, the peptide can be released via the activity of a thioesterase (Te) domain, either by macrocyclization or hydrolysis, resulting in a cyclic or a linear NRP, respectively (Horsman et al., 2016; Kohli & Walsh, 2003; Schneider & Marahiel, 1998; Tanovic et al., 2008).
Because of the relevance of the β‐lactam biosynthetic pathway and the extensive research covering it, the ACVS became soon a model to investigate nonribosomal peptide biosynthesis. The protein has been studied in filamentous fungi such as Penicillium, Aspergillus, and Cephalosporium as well as bacterial Nocardia and Streptomyces species, both in vivo and in vitro (Baldwin et al., 1990, 1991; Coque et al., 1996; Iacovelli et al., 2020; Jensen et al., 1990; van Liempt et al., 1989; Schwecke et al., 1992; Tahlan et al., 2017; Theilgaard et al., 1997). Despite the lack of 3D structures, it is well known that the ACVS is a trimodular NRPS with a reported canonical domain arrangement: AT1‐CAT2‐CATETe3. Each module is responsible for the activation and incorporation of its specific substrate into the final product. The three substrates l‐α‐aminoadipic acid (l‐α‐Aaa), l‐cysteine, and l‐valine (epimerized to d‐valine by the E domain) are assembled in a co‐linear fashion; thus, the position of the incorporated substrate corresponds to the position of the respective module within the NRPS sequence. Interestingly, the adenylation domain of the first module of ACVS is the only NRPS domain that is known to activate and utilize l‐α‐aminoadipic acid as substrate (Flissi et al., 2020). Besides, l‐α‐Aaa is adenylated on the δ‐carboxyl group, resulting in a non‐canonical peptide bond formation between l‐α‐Aaa and l‐Cys (Iacovelli et al., 2020), suggesting a substrate recognition mechanism different from that of canonical amino acid‐activating adenylation domains (Conti et al., 1996, 1997; Stachelhaus et al., 1999).
In this work, we set out to further investigate the architecture of the first module of the ACVS from the fungus Penicillium rubens and the soil bacterium Nocardia lactamdurans (also known as Amycolatopsis lactamdurans). Through bioinformatic analyses, we observed the presence of a previously unidentified domain related to the condensation domain family, in the N‐terminal region of the enzyme. The domain appears structurally conserved and shows the typical fold of C domains, despite being approximately half the size. We therefore proceeded to generate variants of both ACVS enzymes that lack this domain, to investigate the potential impact on ACV biosynthesis, and thus on the whole β‐lactam biosynthetic pathway.
All cloning procedures in E. coli were performed using the strain DH10β for high transformation efficiencies. Cultures were grown using LB medium at 37°C and 200 rpm, and the antibiotic selection was conducted utilizing 25 μg/ml Zeocin (phleomycin D1). The Nocardia lactamdurans pcbAB gene was cloned using an intermediate gateway vector and subsequently subcloned into a custom‐made pBAD‐plasmid (pBR322 ori; araC; pBAD, ZeoR) using SbfI x NdeI sites, and including the introduction of a 6xHis‐tag on the C‐terminal end. This construct was kindly provided by DSM Sinochem Pharmaceuticals (now Centrient Pharmaceuticals), the Netherlands.
Penicillium rubens DS68530 (∆penicillin biosynthetic cluster, ∆hdfA, derived from DS17690) (Harris et al., 2009; Salo et al., 2015) and DS54468 (1× penicillin biosynthetic cluster, ∆hdfB, derived from DS47274) (Nijland et al., 2010) strains were kindly provided by DSM Sinochem Pharmaceuticals. For storage purposes, control and transformant strains were grown on a complex medium for 5–7 days, after which the mycelium was collected and stored on rice grains (Kovalchuk et al., 2012). For liquid culturing, fungal strains were grown in YGG and PPM (penicillin production medium), prepared as described previously (Weber et al., 2012).
Synthetic DNA was purchased from Invitrogen (GeneArt Strings). PCR primers and sequencing primers, as well as chemicals used for the preparation of stock solutions and media, were purchased from Sigma‐Aldrich (now MilliporeSigma). PCR reactions were carried out using KAPA HiFi HotStart ReadyMix (Roche), according to the manufacturer's instructions. In silico PCR and cloning procedures, as well as subsequent analyses, were performed using the SnapGene® software (from GSL Biotech; available at snapgene.com).
The sequences of Nl ACVS and Pr ACVS were acquired from the database Uniprot (Bateman, 2019), with the respective accession numbers being P27743 and P26046. For domain prediction analyses, the web software BLASTP (version 2.10.1+) was used. The search parameters were set as follows: database=“non redundant protein sequences (nr)”; search algorithm=“blastp (protein‐protein BLAST)”; low complexity filter = no; Composition Based Adjustment = yes; E‐value threshold = 0.01; maximum number of hits = 500. The results were visualized using the graphical interface of BLAST (conserved domains on). Backbone dynamics prediction—to identify folded and unfolded regions—was performed using the web‐based software Dynamine (Cilia et al., 2013, 2014).
Structural models of Nl ACVS M1 and Pr ACVS M1 were obtained using the webserver i‐TASSER (Iterative Threading ASSEmbly Refinement), a fold recognition‐based protein modeling tool (Roy et al., 2010; Yang et al., 2014; Yang & Zhang, 2015). The highest‐ranking model was selected for each of the targets based on their C‐score. The C‐score (range −5,2) is a confidence score that estimates the quality of the predictions, higher is better. The models were then visualized using the software UCSF Chimera (Pettersen et al., 2004) for imaging. Structural alignments were generated using the MatchMaker suite of UCSF Chimera, with the following parameters: chain pairing=best aligning pair of chains; alignment algorithm=Needleman‐Wunsch; Matrix=BLOSUM‐62; gap opening penalty=12; gap extension penalty=1.
Ultimately, multiple sequence alignment analyses and identification of the core sequence motifs C1‐C7 were achieved using the software Clustal Omega (Sievers et al., 2011).
The plasmid pLM‐AMA15.0 (AddGene ID #138944) was constructed using Modular Cloning (Weber et al., 2011) and Gibson Assembly (Gibson et al., 2009), as described previously (Mózsik et al., 2020). The sequence encoding SpCas9‐NLS (nuclear localization sequence) was amplified from pYTK036 (yeast MoClo toolkit, AddGene ID # 65143). The gene was placed under control of the p40S promoter (AN0465, 40S ribosomal protein S8 promoter), amplified from pVE2_10_p40S_QfDBD_VP16AD_cp_penDE_DsRed_0xQUAS (AddGene ID # 154228) (Mózsik et al., 2019). For the sgRNA transcription unit, the hammerhead (HH) and hepatitis delta virus (HDV) ribozyme sequences (self‐cleavage), gpdA (AN8041) promoter, and trpC terminator were amplified from pFC334 (AddGene ID # 87846) (Nødvig et al., 2015). The terbinafine resistance gene, ergA, was amplified from pCP1_45 (Pohl et al., 2020). The fungal AMA1‐based vector (bleR, phleomycin resistance) was repurposed from vector pDSM‐JAK‐109 (Bovenberg et al., 2016). The resulting vector delivers the expression of both the Cas9 protein and the sgRNA (Figure 3). The DNA insert encoding the sgRNA unit targeting C* PrACVS was generated via PCR using two primers with overlapping regions (Table A1). A Golden Gate assembly reaction was performed to replace the lacZ gene of pLM‐AMA15.0 with the aforementioned insert and cloned into DH10β E. coli competent cells. Positive transformants were identified using blue‐white screening and were confirmed by sequencing (Macrogen Europe B.V.). Marker‐free donor DNA for the deletion of C* was generated by PCR using primers with 106 bp homology toward pACVS (5′ of ATG codon) and ACVS ORF downstream of C*ACVS (Figures 3b, 4c, Table A1).
Protoplasts of P. rubens DS54468 were obtained as described previously (Pohl et al., 2016). For the transformation, 25 μg of marker‐free donor DNA was mixed with 1 μg pLM‐AMA15.x CRISPR/Cas9 plasmid targeting C*ACVS (Figure 3b). The fungal selection was carried out on solid medium plates supplemented with 1.1 μg/ml terbinafine hydrochloride (MilliporeSigma). The genotype of positive transformants was confirmed by colony PCR using Phire Green Hot Start II PCR Master Mix (Thermo Fisher Scientific) and sequencing (Macrogen Europe B.V.). For storage and further analysis, the strains were purified by performing two rounds of sporulation on the terbinafine selection medium.
Agar plate‐based antibacterial assays were set up as follows: an overlay of soft LA‐agar (1%) inoculated with Micrococcus luteus (sensitive to β‐lactams) to an OD600 of 0.125 was poured on top of a pre‐solidified agar (1%) bottom layer. Before that, Oxford Towers (8 × 10 mm) were spaced out evenly on top of the bottom agar layer to generate four wells in the top layer. The Oxford Towers were then removed aseptically, and the wells were inoculated with 10 µl of supernatant samples. The plates were then incubated at 30°C for 24 h. For imaging, a LAS‐4000 system (Fujifilm Life Science) was utilized.
For penicillin production, spores (immobilized on 25 rice grains) of both control and transformant strains were inoculated in 25 ml YGG medium and grown for 24 h in shake flasks (25°C, 200 rpm). The cultures were then diluted eight times into 25 ml PPM containing 2.5 g/L phenoxyacetic acid (POA) and grown in the same conditions for 5 days. Supernatant samples for HPLC/MS analysis were taken and filtered using 0.45‐µm syringe filters. The samples were subsequently stored at −20 or −80°C for short‐term and long‐term storage, respectively, before HPLC/MS analysis. The experiment was performed in triplicate.
The deletion variant of Nl ACVS was obtained via a two‐step cloning strategy. Firstly, we amplified the deletion insert from a Golden Gate (GG) intermediate vector bearing the pBAD promoter region and a fragment of the pcbAB gene. The forward primer was designed to bind just outside of the 3′ end of the C* domain (AA position 237). On the 5′ tail, homologous sequence to the RBS (ribosome‐binding sequence) of the promoter and an appropriate restriction enzyme site were added. The reverse primer only contained a homologous sequence and an appropriate restriction site (Table A1).
The deletion insert was then subcloned into the same template intermediate vector, resulting in a Golden Gate‐ready vector now lacking the C* domain (AA positions 2–236). The product was checked by restriction analysis and sequencing (Macrogen Europe B.V.). Lastly, the pBAD‐Nl ΔC*‐ACVS plasmid was assembled using the Golden Gate assembly reaction (Engler et al., 2009) as described previously (Iacovelli et al., 2020) and cloned into DH10β chemically competent E. coli cells for storage.
For overexpression and purification, the plasmids were cloned into the expression strain E. coli HM0079 (Gruenewald et al., 2004). The strains carrying both the wild‐type and the deletion variant were inoculated overnight in 3 ml selective LB. The cultures were then diluted 1:100 into fresh 2xPY medium (10 g/L NaCl, 10 g/L yeast extract, 16 g/L Bacto™ tryptone), grown to an OD600 of 0.6, transferred to 18°C and 200 rpm for 1 h and subsequently induced using 0.2% l‐arabinose. After 18 h, the cells were harvested by centrifugation at 4000 g for 15 min. After resuspension in lysis buffer (HEPES 50 mM pH 7.0, 300 mM NaCl, 2 mM DTT, complete EDTA free protease inhibitor 1 cp/10 ml; Roche no. 04693159001), cells were disrupted using sonication (6 s/15 s on/off; 50x; 10 µm amplitude; Soniprep 150 MSE) and the cell‐free lysate was obtained by centrifugation at 4°C, 17,000 g, 15 min. Purified enzymes were extracted using Ni2+ affinity purification (1 ml slurry—500 µl resin—per 1 L of culture) using disposable gravity‐flow columns. Wash steps were performed using 10 column bed volumes each of wash buffer (HEPES 50 mM pH 7.0, 300 mM NaCl, 20 mM imidazole) followed by a three‐step elution using four‐bed volumes of each elution buffer (HEPES 50 mM pH 7.0, NaCl 300 mM, imidazole 50–150 or 250 mM). Fractions were then analyzed on SDS‐PAGE 5% mini‐gels and imaged. 2D densitometry analyses (AIDA Image Analysis software, Elysia‐raytest) were followed to determine the concentration of the deletion variant relative to the wild‐type ACVS. Lastly, the purified wild‐type and deletion variants were, respectively, diluted and concentrated using Amicon U‐100 spin filters (Merck). The final concentration for wild‐type ACVS was determined using A280 (NanoDrop 1000; Thermo Fisher Scientific).
Diluted/concentrated Nl ACVS and Nl ΔC* ACVS were subjected to in vitro peptide synthesis assays, to determine product formation. Reaction mixture was set up as follows: HEPES 50 mM pH 7.0, 300 mM NaCl, 5 mM ATP pH 7.0, 5 mM l‐α‐aminoadipic acid, 2 mM l‐cysteine, 2 mM l‐valine, 5 mM MgCl2, 2 mM DTT, and ~0.25 µM ACVS/ΔC* ACVS. The total volume was set at 200 µl. Control reactions were set up analogously, but omitting ATP or l‐α‐aminoadipic acid. Reactions were run at 30°C and sampling took place at 0 (T0) and 300 (T5 h) minutes for endpoint value determination. NaOH was added to each sample to a final concentration of 0.1 M to quench the reactions. Samples were subsequently stored at −80°C. Before HPLC/MS analysis, the samples were thawed in ice, reduced by adding DTT to a final concentration of 10 mM, and filtered.
Samples (50 µl) were obtained from fungal liquid cultures and the in vitro reactions were subjected to HPLC/MS analysis, with an injection volume of 5 µl each. The analysis was performed using an HPLC/MS Orbitrap (Thermo Scientific) in combination with a Reverse Phase‐C18 column (ACQUITY UPLC BEH C18, 130 Å, 1.7 µm, 2.1 mm × 150 mm). Scan range was set at 100–2000 m/z in positive ion mode (ion optics voltages: 4.1 kV spray, 24.4 V capillary, and 73.7 V tube lens), with the capillary temperature set at 325°C. For the in vitro samples, a 24‐min gradient program with Milli‐Q water (A), acetonitrile (B) and 2% formic acid (C) was utilized: 0 min, A 90%, B 5%, C 5%; 4 min, A 90%, B 5%, C 5%; 13 min, A 0%, B 95%, C 5%; 16 min A 0%, B 95%, C 5%; 16.01 min, A 90%, B 5%, C 5%; 24 min A 90%, B 5%, C 5% at a flow rate of 0.150 ml/min. For the fungal (in vivo) samples, a 60‐min gradient program with Milli‐Q water (A), acetonitrile (B) and 2% formic acid (C) was utilized: 0 min, A 90%, B 5%, C 5%; 5 min, A 90%, B 5%, C 5%; 30 min, A 35%, B 60%, C 5%; 35 min, A 0%, B 95%, C 5%; 45 min, A 0%, B 95%, C 5%; 45.01 min, A 90%, B 5%, C 5%; 60 min, A 90%, B 5%, and C 5% at a flow rate of 0.300 ml/min. The Bis‐ACV standard was obtained from Bachem, reduced to (L,L,D)‐ACV, and used for peak identification. The Penicillin V standard was purchased from Sigma‐Aldrich (now MilliporeSigma) and used for quantification in a standard curve at concentrations of 0.0005, 0.0025, 0.0125, 0.0625, 0.3125, 0.625, and 1.25 g/L. The CV dipeptide and the pigment chrysogine (control for fungal experiments, Figure A1) were identified according to their predicted monoisotopic masses, in the absence of synthetic standards.
Through domain prediction and backbone dynamics prediction (Figure A2), we identified a structurally organized region at the N‐terminus of the first module of a bacterial and a fungal ACVS, respectively, from Nocardia lactamdurans (Nl) and P. rubens (Pr) (formerly annotated as Penicillium chrysogenum). Initial BlastP analyses show the significant alignment of the amino acids sequence 2–236 (Nl ACVS) and 222–308 (Pr ACVS) to the condensation domain family (pfam00668), with a respective E‐value of 2.35E‐03 and 3.68E‐03. The N‐terminal regions vary in size between 230 amino acids for Nl ACVS and 300 amino acids for Pr ACVS, while the size of a full condensation domain is generally around 450 amino acids. Because of their alignment with the condensation domain protein family, we refer to the two regions as C* domain from now on (Figure 1a). A similar region has been previously observed in a fungal NRPS (Berry et al., 2019) and in fungal α‐aminoadipic acid (Lys2‐type) reductases such as Lys2 from Penicillium chrysogenum and NPS3 from Ceriporiopsis subvermispora (Hijarrubia et al., 2003; Kalb et al., 2014, 2015). Interestingly, the reductases and the first module of ACVS activate the same substrate, and it has been shown that the N‐terminal domain of NPS3 is critical for adenylation activity; hence, it was termed the adenylation activating (ADA) domain (Kalb et al., 2015).
1 FIGURE. (a) Domain organization and reaction scheme of ACVS. The ACV tripeptide is synthesized from l‐α‐aminoadipic acid, l‐cysteine, and l‐valine. In the next step, ACV is converted to isopenicillin N (IPN) by the IPN synthase, which catalyzes the formation of the β‐lactam ring. Following different biocatalytic routes, IPN is further converted into several classes of β‐lactam antibiotics. The newly identified N‐terminal C* domain is highlighted in orange. (b) Modeling of Nl ACVS M1 and Pr ACVS M1. In both cases, all the canonical domains can be observed, with their expected structural organization. Also, at the N‐terminus of the modules, we observe a highly structured domain (C*) that aligns very well with model condensation domain VibH (PDB: 1L5A). Specifically, C* domains align with the C‐lobe of VibH and completely lack elements from the N‐lobe, including the active site motif HHxxxDG. Pr ACVS M1 C* shows an additional disorganized region at the N‐terminal portion (panel B, bottom right), which correlates well with the domain and backbone prediction analyses
To gain further insight on the C* domains from ACVS, we submitted the sequences of the first module of the two enzymes to DynaMine, an online tool for the prediction of backbone dynamics. Starting from the primary sequence as input, DynaMine predicts backbone flexibility at the residue‐level and outputs backbone N‐H S2 order parameter values. These values can range between 0 and 1, indicating, respectively, a highly dynamic residue (therefore part of an unstructured/unfolded region) or a residue with a stable conformation (meaning the residue is likely part of a highly structured/folded region). The result is a graph where the S2 values are plotted against the respective residues, making it possible to identify (predicted) rigid or flexible regions (Figure A2). In accordance with what we previously observed with BlastP, two structured regions were predicted in the N‐terminal portion of the proteins, with Pr ACVS showing an extra stretch predicted to be more flexible and disorganized. Furthermore, we extended the domain prediction analysis to other ACVSs from different organisms, which all showed similar N‐terminal regions (Figure A3). For this work, we focused on Nl ACVS and Pr ACVS.
We therefore proceeded to generate structural models of the initiation modules of the two ACV synthetases, utilizing the web software i‐TASSER, a fold recognition‐based protein modeling tool. For each of the targets, i‐TASSER generated several models based on a database of fold templates. Only the model with the highest C‐score (0.83 for Nl ACVS M1 and 0.80 for Pr ACVS M1) was selected for each target (see corresponding methods section for further details). Given the strong structural conservation of NRPS enzymes, in both cases, the adenylation domains and the PCPs can easily be identified and appear organized in a canonical manner (Figure 1b). In the N‐terminal regions, a third domain can be observed, with well‐conserved structural motifs. The structures were then aligned with the crystal structure of VibH (Keating et al., 2002) (PDB ID: 1L5A), a model condensation domain, using the structure alignment tool of UCSF Chimera (Pettersen et al., 2004). The C* domain of ACVS superimposes almost perfectly with half of VibH, specifically with the C‐terminal lobe (Figure 1b). The C* domain of Pr ACVS has an extra sequence at the N‐terminus—about 60 amino acids in length—which appears to be rather disorganized and does not align with VibH (Figure 1b). C domains are pseudo‐dimers constituted of two lobes, N‐lobe and C‐lobe, organized in a V‐shape structure (Bloudoff et al., 2013; Bloudoff & Schmeing, 2017; Keating et al., 2002). It seems that the C* domains from ACVS enzymes conserve the structure of an isolated C‐lobe, but completely lack the N‐lobe. The canonical active site motif of condensation domains, HHxxxDG (De Crecy‐Lagard et al., 1995; Marshall et al., 2002), as well as the acceptor PCP binding site, are found in the N‐lobe (Bloudoff et al., 2013; Bloudoff & Schmeing, 2017; Keating et al., 2002) and therefore are missing in C* domains. These results are in accordance with the observations of Kalb et al. (2015) on a structural model of the ADA domain of NPS3, further confirming the similarities between these domains. Subsequent multi‐alignment sequence analysis using Clustal Omega revealed that many of the other conserved motifs of C domains (Marahiel et al., 1997; Marxen et al., 2015; Meyer et al., 2016; Rausch et al., 2007) are also absent in C* and ADA (C1‐C5), while some are partially retained (C6‐C7) (Figure 2). These findings strongly suggest that the C* domains do not exert the catalytic function typical for condensation domains during the enzymatic synthesis of ACV. This region is, however, highly conserved structurally and omnipresent in ACVS enzymes and thus likely fulfills another role.
2 FIGURE. Multiple sequence alignment performed using Clustal Omega. The sequences of Nl ACVS M1 C* Pr ACVS M1 C* were aligned to those of several known condensation domains (obtained from the Uniprot database). The ADA domain from the α‐aminoadipate reductase NPS3 was also included in the alignment. The main residues of core motifs C1–C7 of the condensation domain family are highlighted in red. The active site motif, C3, and motifs C1–C5 are completely lacking in both C* domains and ADA, while C6 is retained only in the case of NPS3 and C7 in all cases
Because of the striking presence of this structural domain in the family of ACVS proteins, we investigated the potential function of the C* domain of Pr ACVS. For this purpose, we created a P. rubens strain lacking the C* domain. To do so, we used a novel CRISPR/Cas9 approach based on the delivery of a custom‐made plasmid carrying both the genes encoding Cas9 and the target‐specific sgRNA, as well as an anti‐fungal (terbinafine) resistance marker (Figure 3). The transformants were first selected for the presence of the plasmid and further analyzed for the correct introduction of the designed C* domain genome deletion by targeted PCR and DNA sequencing. Two strains carrying the desired deletion, C1, and C2, were selected for liquid culturing experiments and HPLC/MS analysis. Also, two control strains were included: the parental penicillin producer strain containing one copy of the penicillin biosynthetic cluster (DS54468), and a non‐producer strain (DS68530) lacking the entire penicillin biosynthetic cluster.
3 FIGURE. CRISPR/Cas9 plasmid‐mediated deletion of the C* domain in Penicillium rubens ACVS. (a) Schematic representation of pLM‐AMA15.x plasmid encoding the components of CRISPR/Cas9 genome editing system, SpCas9‐NLS, and the sgRNA targeting the genomic region of interest. (b) Representation of CRISPR/Cas9 plasmid and donor DNA delivery into fungal protoplasts. (c) Illustration of in vivo expressed Cas9‐RNP performing marker‐free genome engineering and removal of C* domain from pcbAB of P. rubens DS54468
For liquid culturing, all the strains were grown in shake flasks in PPM containing POA, a precursor of penicillin V. After 5 days, the supernatant was collected, filtered, and stored. Before HPLC/MS analysis, the samples were subjected to an antibacterial plate assay to screen for the presence of β‐lactam compounds (Figure 4a). The assay indicated the absence of antimicrobial activity in the samples from strains C1, C2, and the negative control, whereas the positive control showed the formation of a large halo. Analysis via HPLC/MS confirmed these results (Figure 4b). The production of penicillin V is abolished in the strains C1 and C2, suggesting that the deletion of C* rendered ACVS nonfunctional, preventing the formation of the antibiotic compound.
4 FIGURE. Deletion of the C* domain in Pr ACVS. (a) Antibacterial plate assays showing that the transformants (C1 and C2) are not able to produce β‐lactam antibiotics anymore. Positive control (producer strain DS54468) and negative control (non‐producer strain DS68530) are included to validate the assay. (b) HPLC/MS analysis of penicillin V production, confirming the loss of penicillin V production by the transformants (C1 and C2). Producer (DS54468) and non‐producer (DS68530) strains, as well as sterile PPM medium, are included
The study in P. rubens does not reveal whether the deletion of the C* domain causes a functional defect or rather disrupts the expression of the ACVS. To investigate this possibility, we then focused on Nl ACVS, which is very well overexpressed in E. coli and has already been extensively characterized as a purified protein in previous studies (Coque et al., 1996; Iacovelli et al., 2020). To generate the Nl ACVS ΔC* variant, we employed a Golden Gate‐based assembly strategy previously described (Iacovelli et al., 2020). The construct was subsequently cloned and overexpressed in E. coli HM0079 (Gruenewald et al., 2004), a strain that natively expresses a phosphopantetheinyl transferase, responsible for the conversion of apo‐PCP to holo‐PCP domains, necessary for the function of NRPS. The Nl ACVS and its ΔC* variant, containing a 6xHis C‐terminal tag, were purified via Ni2+‐affinity chromatography and analyzed via SDS‐PAGE (Figure 5a). Despite the lower expression levels, sufficient amounts of Nl ACVS ΔC* could be purified for activity assays. Further, the purified protein remained fully stable for 5 h necessary to conduct the enzyme assays, indicating that the overall organization and fold of the enzyme are not compromised.
5 FIGURE. Deletion of the C* domain in Nl ACVS. (a) SDS‐PAGE analysis showing expression and purification of the wild‐type and deletion variant of the Nl ACVS. ACVS ΔC* is expressed at lower levels but the purified protein fractions were diluted or concentrated to achieve comparable concentrations for catalytic activity measurements (top right gel). (b) In vitro assay of ACV (black bar) and CV (gray bar) production as detected by HPLC/MS analysis for ACVS (lanes 2 and 4) and its ΔC* variant (lanes 1 and 3). Reactions were performed with the three amino acids l‐α‐aminoadipic acid, l‐cysteine, and l‐valine, except in lane 4 where l‐α‐Aaa was omitted
The wild‐type ACVS and ACVS ΔC* were, respectively, diluted or concentrated to comparable concentrations based on 2D densitometry analysis on Coomassie Brilliant Blue stained SDS‐PAGE (Figure 5a) and subjected to in vitro peptide formation assays. In these assays, all the amino acid substrates (l‐α‐aminoadipic acid, l‐cysteine, l‐valine) are provided along with ATP and Mg2+, necessary for peptide biosynthesis. Two control reactions were included: one where ATP was omitted from the ΔC*‐ACVS reaction mixture and one where l‐α‐aminoadipic acid was omitted from the wild‐type ACVS reaction mixture. Reactions were analyzed via HPLC‐MS for the presence of ACV and the dipeptide CV. In line with the results obtained in P. rubens, Nl ACVS ΔC* is completely unable to synthesize the tripeptide ACV (Figure 5b). Importantly, the enzyme synthesized significant amounts of the CV dipeptide. CV is normally a by‐product or intermediate of ACV synthesis. When the substrate of module 1, l‐α‐aminoadipic acid, is omitted from the reaction with the wild‐type ACVS, therefore bypassing the function of module 1, we observe the production of CV at the same levels as with the ΔC* variant. Expectedly, no ACV was produced in this case. Further, in the control reaction where ACVS ΔC* was not supplemented with ATP neither peptides were detected. The data indicate that the deletion of the C* domain renders module 1 of Nl ACVS unable to incorporate L‐α‐aminoadipic acid. However, the enzyme remains active in the production of CV, an activity that can be attributed to fully functional modules 2 and 3. These results are analogous to those obtained with ADA‐less variants of NPS3, which also showed a drastic loss of adenylation activity. This suggests that C* and ADA domains carry out the same function during the activation of L‐α‐aminoadipic acid.
Here, we report for the first time on the identification of a previously undiscovered conserved domain in the first module of bacterial and fungal ACVS enzymes, which we named the C* domain. A similar domain, called ADA domain, was previously described in α‐aminoadipic acid reductases from fungi, where its presence is essential for the adenylating function (Kalb et al., 2015). Following bioinformatic analysis and molecular modeling, we investigated the importance of the C* domain on the functionality of the ACVS enzymes, by engineering deletion variants and analyzing their enzymatic activity in vitro and in vivo, respectively. The results suggest that the domain arrangement of ACVS should be now regarded as C*AT1‐CAT2‐CATETe3. Further, our experimental data indicate that the presence of the C* domain in the first module of P. rubens and Nocardia lactamdurans ACVS is critical for its functionality as a unit. Without the C* domain, the activation and subsequent incorporation of l‐α‐aminoadipic acid do not occur and the biosynthesis of ACV is stalled. However, the exact function of this newly identified domain remains at the moment elusive. As hypothesized for the ADA domain of NPS3, it is possible that the C* domains have a structural role, perhaps maintaining the correct fold of module 1, which is essential for its activity. Given the uniqueness of the first module of ACVS enzymes and fungal α‐aminoadipic acid reductases in terms of substrate specificity—they are the only adenylating enzymes known to activate l‐α‐aminoadipic acid, and they do so in a non‐canonical manner (Coque et al., 1996; Iacovelli et al., 2020; Jensen et al., 1990; Kudo et al., 2019; Tahlan et al., 2017; Theilgaard et al., 1997)—it is also possible that the C* domain of ACVS and ADA domain of NPS3 are somehow involved in the recruitment and/or positioning of the substrate. Additionally, the presence and importance for catalysis of C* in the first module of ACVS and ADA in NPS3 (and other Lys2‐type reductases) suggest that these enzymes might share a common origin, perhaps an elongation module that later evolved into an initiation module and stand‐alone reductases, respectively.
Further efforts are required to unravel the evolutionary history and exact role of C* domains during the biosynthesis of ACV. New insights about this mechanistic aspect of the enzyme could prompt engineering efforts targeted at modifying the substrate specificity of the first module, to generate modified penicillin derivatives and replace the procedures for semi‐synthetic penicillin with the complete fermentative production of such derivatives.
None required.
We thank DSM Sinochem Pharmaceuticals (now Centrient Pharmaceuticals) for the Nl ACVS construct. We would also like to thank Dr. Marten Exterkate and Niels de Kok for their help with HPLC/MS analysis. The research leading to these results was financially supported by the European Commission’s Marie Curie Initial Training Network Quantfung (FP7‐People‐2013‐ITN, grant no. 607332).
None declared.
Riccardo Iacovelli lead the formal analysis, investigation (lead), methodology, visualization, conceptualization, writing—original draft, and writing—review and editing. László Mózsik lead the Investigation and methodology and supported the visualization, writing—original draft, and writing—review and editing. Roel A. L. Bovenberg supported the conceptualization, supervision, writing—original draft, and writing—review and editing. Arnold J. M. Driessen lead the conceptualization, supervision, and project administration and supported the writing—original draft and writing—review and editing.
All data generated or analyzed during this study are included in this published article.
A1 FIGURE. Production of chrysogine was detected and measured via HPLC/MS in the cultures' supernatant after 5 days of growth, to serve as a control
A2 FIGURE. Domain and backbone dynamics prediction of Nl ACVS M1 (A) and Pr ACVS M1 (B). The upper panels represent the graphic summary of the BlastP domain prediction, where all the canonical domains are identified, as well as the N‐terminal regions that align (partly) to the condensation domain family. The lower panels depict the backbone dynamics prediction obtained with DynaMine. In addition to the three canonical domains of module 1, predicted folded regions can be identified also in the N‐terminal stretch of the two sequences, termed C* domains
A3 FIGURE. BlastP domain predictions of ACV synthetases from different organisms show the omnipresence of the conserved N‐terminal region. The ACVS from Acremonium chrysogenum does not show alignment to the condensation domain family but retains an equally sized region
Description | Template | Primer Sequences (5' → 3') |
sgRNA insert C* PrACVS | Overlapping oligonucleotides |
FW: ATGGTCTCACCGACACTACCTGATGAGTCCGTGAGGACGAAACGAG
RV: ATGGTCTCTAAACATAGGCTTCTCGGCCACTACGACGAGCTTACTCGTTTCGTCCTCACGGACTCA |
Marker‐free donor DNA ∆C* PrACVS | Overlapping oligonucleotides |
FW: CCCAGTATAAGGAATTCCCCTCGAGCTTGTCTGTGATTGCGTTTTTTCTAACACTTGTTGTTGCATCCGATCCGTCCCTACCAATTATTGGTCATTGACAGACATGGAAGAGTGGAACAA
RV: CGCAGACAACGGCTATTTTGTCTTCATGCCGTTCAACCACCTCTTCAATGAGATGGTGCAGTCGCTTTGATGAAGGGTACTCGCCATCCGTGTTGTTCCACTCTTCCATGTCTGTCAATG |
Deletion fragment ∆C*NlACVS | pBAD‐Nl ACVS intermediate vector |
FW: ATCGTGTACAAATCGATTACCTAGGAGGAATTAACCATGCACCAGTGGAACGGCACCGAC
RV: ATCGAAGCTTGCAATGGCGCATGTC |
Enzyme | Amino acid positions | Sequence |
Nl ACVS | 2‐236 | TSARHLKSAADWCARIDAIAGQRCDLEMLLKDEWRHRVAVRDSDTAVRATQEKELTISGQDYTALKQALGAMPLEAFALATLHSVLHAYGHGHQTVVAFLRDGKVLPVVVDHLEQAGLTCAEAAEQLEDAVAREDMYLPPEELLQRGLFDALLVLADGHLGFTELPPAPLVTIVRDDPAAGCLHWRIAYAGEFFEDKIIAGVLDVAREVLGQFIGRPEQLVADIDLVSAEQELQL |
Pr ACVS | 2‐308 | TQLKPPNGTTPIGFSATTSLNASGSSSVKNGTIKPSNGIFKPSTRDTMDPCSGNAADGSIRVRFRGGIERWKECVNQVPERCDLSGLTTDSTRYQLASTGFGDASAAYQERLMTVPVDVHAALQELCLERRVSVGSVINFSVHQMLKGFGNGTHTITASLHREQNLQNSSPSWVVSPTIVTHENRDGWSVAQAVESIEAGRGSEKESVTAIDSGSSLVKMGLFDLLVSFVDADDARIPCFDFPLAVIVRECDANLSLTLRFSDCLFNEETICNFTDALNILLAEAVIGRVTPVADIELLSAEQKQQL |
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2021. This work is published under http://creativecommons.org/licenses/by-nc/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
The
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details


1 Molecular Microbiology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, The Netherlands
2 Synthetic Biology and Cell Engineering, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, The Netherlands; DSM Biotechnology Centre, Delft, The Netherlands