INTRODUCTION
In eukaryotes, the highly coordinated sequence of events occurring in a unidirectional manner to ensure the generation of two new daughter cells is known as the cell cycle. There is an elaborate temporal control of gene expression along the cell cycle, directed by checkpoints and molecular regulators that dictate its progression. For most of the eukaryotic models, this periodical gene expression is strongly dependent on transcriptional control, where networks of transcription factors act to regulate the expression of large sets of mRNAs (4–7). In addition, targeted protein degradation at specific times of the cell cycle is a key step to ensure progression to subsequent phases (8); thus, proteomic analyses are of major relevance to understand the cell cycle (9). Nevertheless, the translational regulation of mRNAs, which represent a main determinant of protein abundance, has not been studied in a genome-wide fashion during the cell cycle until recently. Ribosome profiling consists of the deep sequencing of ribosome-protected mRNA fragments (footprints) and has proved to allow a highly accurate measurement of the translation process on a genome-wide basis (10, 11). The development of this approach led to the first reports of translatome remodeling on the human (12, 13) and budding yeast (14) cell cycle.
As a single-celled organism,
The cell cycle of
Recently, genome-wide approaches have been carried out using synchronic
Here, we provide the first ribosome profiling study of the trypanosomatid cell cycle, and in parallel, we determined the quantitative proteome. The global patterns of gene expression regulation were compared at three levels during the proliferative transition (G1/S) of synchronized epimastigote populations. We determined the differentially expressed genes (DEGs), identifying their specific levels of control. The ontological gene terms enriched in each phase highlight known cell cycle pathways and novel periodically expressed proteins. Cell cycle regulators, including cyclins, CDKs, and RBPs differentially expressed during the G1/S transition, are analyzed. Our results show the outstanding role of translational regulation in the two cell cycle phases studied. Coexpressed groups of functionally related genes that may comprise regulons were identified. The comparison of the regulatory levels reveals a complex and regulon-specific interplay between the translatome and the proteome. Our study improves the understanding of
RESULTS
Translational and proteomic remodeling during
Hydroxyurea (HU)-induced synchronization was used to obtain cell cycle-enriched populations of
FIG 1
Ribosome profiling data set. (A) A fluorescence-activated cell sorting analysis of DNA content staining with propidium iodide was carried out for HU-synchronized
As a quality control, read mappings resulting from a ribosome footprint assay should display a 5′-end 3-bp mapping periodicity and should map preferably over protein coding sequences (10, 23). For the first analysis, we discriminated the reads by length, determined the p-site offset for each read length, and calculated the 5′ periodicity taking genes displaying a continuous overall coverage as previously reported (10). Periodicity was observed in the translatome data as the RFPs mapped more frequently to the first codon position, and the second codon position was the least represented (11). As expected, the transcriptome data did not display this 3-nucleotide (nt) periodicity (Fig. 1E). Ribosome footprints predominantly map to protein coding regions as 93% of the reads fall within the initiation and stop codons as expected. This was not the case for polyadenylated reads coming from the transcriptome study, since only 45% of these reads mapped to coding regions while the remaining mapped to the untranslated regions (UTRs) and intergenic regions (Fig. 1F). Both observations support that RFP reads correspond to transcripts actively translated by polyribosomes.
Protein extracts of the G1 (0 h post-HU release) and S-phase (6 h post-HU release) enriched cell cycle populations were obtained with a protocol identical to that used for the preparation of ribosome footprints (Fig. 2A). Tryptic peptide-digested samples were processed in parallel using high-pH reversed-phase fractionation to reduce complexity prior to analysis by liquid chromatography-tandem mass spectrometry (LC-MS/MS) (Table S1). Principal-component analysis showed a correct grouping of the replicates, demonstrating separation of the data sets by the first component that contained 66% of the variance (Fig. 2B). Data analysis identified 4,524 protein groups containing 4,918 proteins, with normalized label-free quantification (LFQ) intensity values presented in Table S2. The lower sensitivity and range of the proteomic detection, compared to the Ribo-seq, together with the use of only two replicates, which increases the likelihood of type II errors, led us to use a 1.5-fold change for differential protein expression assessment. We considered only protein groups supported by an FDR lower than 0.05. Following these criteria, 408 genes (8.3% of total) displayed altered protein levels, with 168 and 240 upregulated genes in G1 and S, respectively (Fig. 2C and D). We also incorporated the proteins detected in both replicates of one cell cycle stage (above the 5th percentile in LFQ values) but absent from the other, resulting in an additional 36 and 208 proteins for G1 and S, respectively. Therefore, in total we identified 652 genes regulated at the protein abundance level (P-DEGs) in the G1/S transition, of which 204 and 448 are upregulated at G1 and S phase, respectively.
FIG 2
Proteomic data set. (A) A fluorescence-activated cell sorting analysis of DNA content staining with propidium iodide was carried out for HU-synchronized
Comparison of the transcriptomic, translatomic, and proteomic changes at the
We have generated transcriptomic (previous study [44]) and translatomic and proteomic (current study) data sets of G1/S cell cycle phases of
In order to compare the sensitivities of the three approaches used, we set a detection threshold of 15 raw reads per gene in the sequencing studies (23) and 1 unique peptide for the proteomic study (46) (Fig. 3A). We had previously identified 9,040 genes of the 10,342
FIG 3
Data set comparisons. (A) The threshold for detection was 15 nCounts for both the transcriptome (T, violet) and translatome (R, red), and at least 1 unique peptide for the proteome (P, blue). (B.) The histogram presents the mRNA levels for the set of detected genes in each data set. (C) Dot plots for pairwise gene expression comparisons of the two data sets. R values for Pearson correlations for the comparisons are presented above the arrows connecting the data sets analyzed.
A broad picture of the expression changes observed in the G1/S transition at the three different levels of gene expression analyzed is presented in Fig. 4. For this initial analysis we included a larger proportion of the data sets, comprising genes with 15 normalized reads for the sequencing studies and at least 1 valid LFQ value in one replicate for each cycle phase for the protein quantification analysis. Globally, the changes observed at the translatome are larger in number and dynamic range than those observed at the transcriptome (Fig. 4A), confirming that
FIG 4
Correlations of S/G1 changes. Log2 fold change comparisons between the 3 data sets. Transcriptome data set is from the work of Chavez et al., 2017 (44). For this analysis, we included genes with 15 normalized reads for the sequencing studies (transcriptome [T] and translatome [R]) and at least 1 valid LFQ value for one replicate of each of the two cell cycle phases studied for the proteome (P). The number of genes analyzed in each diagram resulting from the threshold mentioned above is presented at the bottom. Fold changes were taken from average gene expression for the replicates for both R and P. Genes displaying changes are colored as described below the plots; the thresholds considered are those used to define DEGs in further analysis (1.5-fold for T and P and 2-fold for R). (A) Translatome versus transcriptome. (B) Proteome versus transcriptome. (C) Translatome versus proteome.
As expected, the comparison between the transcriptome and the proteome also shows that proteomic variations are more numerous and larger than those of transcriptomics (Fig. 4B). Yet, due to proteomic sensitivity limitations, fewer genes are included in this analysis (3,043) relative to the transcriptome-translatome comparison. Over half of these genes (53%) do not change their levels, neither in the proteome nor in the transcriptome. The vast majority of the modulated genes show changes only in the proteome (41%), while few of them change only in the transcriptome (2%), reinforcing the relevance of posttranscriptional mechanisms to achieve differential protein levels from similarly abundant mRNAs. From the subset of genes that displayed changes in both studies (62), again most of these genes (40 genes representing 1.3% of the total) showed changes in the same direction.
The fraction of regulated genes identified in the translatome-proteome comparison is the highest among the three data sets (55% of the genes, translatome fold change > 2 and proteome fold change > 1.5) (Fig. 4C). While 41.8% exhibit differential levels in the proteome and 26.1% in the translatome study, only 12.6% of them present differential levels in both studies. If the latter observation holds true for the genes not detected at protein level, it will indicate that the regulation of protein stability affects more genes than the translational control of mRNAs during the G1/S cell cycle transition. Since the coordination of protein modifications and stability represents a molecular hallmark of the cell cycle transitions (50), being essential for the G1/S transition (85), a strong regulation of protein levels is expected. Unexpectedly, changes in opposite directions (222 out of 372, 7%) are more frequent than those in the same direction (150, 5%). The uncoupling of gene expression changes in omics experiments has been discussed recently and proved to be more common than previously recognized (51). As presented above, differences in the magnitude of modulation in each data set could contribute to the profiles observed (Fig. S2). An alternative explanation is possible if protein half-life is longer than the polysomal mRNA activity, and so variations in production in the translatome have less effect on the proteome. This hypothesis will be addressed below in this work.
Genes and biological process differentially regulated at the translatome and proteome during
Seeking to investigate the biological function of the DEGs along the
TABLE 1
Enriched Gene Ontology terms of G1/S DEGs
G1-enriched GO terms | S-enriched GO terms | ||||||
---|---|---|---|---|---|---|---|
Name | No. of genes | FE | Name | No. of genes | FE | ||
| |||||||
Monosaccharide binding (MF) | 3 | 56.6 | 1.8E−03 | DNA binding (MF) | 13 | 6.0 | 3.0E−05 |
Ligase activity, forming carbon-nitrogen bonds (MF) | 4 | 17.4 | 9.8E−03 |
| 8 | 11.6 | 1.4E−04 |
Nucleoside phosphate catabolic process (BP) | 4 | 18.9 | 1.7E−02 | Chromatin (CC) | 5 | 11.8 | 6.4E−03 |
| 7 | 5.9 | 2.1E−02 | Nucleic acid binding (MF) | 21 | 2.5 | 6.9E−03 |
| 11 | 4.0 | 2.2E−02 | Nucleus (CC) | 32 | 1.9 | 8.1E−03 |
| 5 | 10.1 | 4.0E−02 | Protein folding (BP) | 8 | 6.0 | 2.9E−02 |
Regulation of macromolecule metabolic process (BP) | 11 | 3.5 | 7.8E−02 |
| 4 | 12.2 | 3.2E−02 |
Chromosomal part (CC) | 6 | 6.0 | 5.0E−02 | ||||
Kinetoplast (CC) | 9 | 3.8 | 5.7E−02 | ||||
| |||||||
| 87 | 5.3 | 5.9E−44 | Cytoskeleton (CC) | 113 | 2.5 | 1.8E−19 |
| 102 | 3.4 | 2.0E−30 | ATP binding (MF) | 107 | 2.0 | 1.8E−11 |
| 97 | 3.5 | 6.9E−30 | Nucleoside triphosphatase activity (MF) | 72 | 2.4 | 1.5E−10 |
Biosynthetic process (BP) | 167 | 2.2 | 1.1E−24 | Ciliary basal body (CC) | 48 | 3.0 | 1.7E−10 |
Protein-containing complex | 192 | 1.9 | 1.1E−19 | Microtubule organizing center (CC) | 48 | 2.9 | 8.3E−10 |
Vacuole (CC) | 42 | 3.1 | 5.9E−10 | Microtubule motor activity (MF) | 29 | 4.2 | 1.4E−09 |
Oxidoreductase activity (MF) | 65 | 2.3 | 2.8E−09 | Chromosome segregation (BP) | 15 | 6.5 | 2.1E−07 |
Mitochondrion (CC) | 143 | 1.5 | 4.1E−07 | Protein binding (MF) | 120 | 1.7 | 3.7E−07 |
| 72 | 2.0 | 9.6E−07 | Small-molecule binding (MF) | 119 | 1.7 | 3.9E−07 |
| 38 | 2.5 | 3.9E−05 | DNA helicase activity (MF) | 12 | 7.2 | 1.2E−06 |
| 21 | 3.3 | 2.8E−04 | DNA repair (BP) | 24 | 3.4 | 2.9E−05 |
Glucose metabolic process (BP) | 8 | 6.4 | 2.9E−03 | Cell projection (CC) | 130 | 1.5 | 8.8E−05 |
Cytochrome complex (CC) | 6 | 7.5 | 3.7E−03 | Telomere organization (BP) | 7 | 9.5 | 2.4E−04 |
Cofactor binding (MF) | 30 | 2.2 | 1.1E−02 | Kinetochore (CC) | 10 | 5.7 | 3.7E−04 |
Proteasome complex (CC) | 12 | 3.2 | 4.4E−02 | Axoneme (CC) | 40 | 2.0 | 3.2E−03 |
Nucleotide catabolic process (BP) | 10 | 4.0 | 5.5E−02 | DNA recombination (BP) | 9 | 5.4 | 6.2E−03 |
NADP binding (MF) | 7 | 5.1 | 5.9E−02 |
| 20 | 2.5 | 2.0E−02 |
| 69 | 1.5 | 7.3E−02 |
| 15 | 2.8 | 8.5E−02 |
| |||||||
| 22 | 3.22 | 3.6E−04 |
| 12 | 7.3 | 3.7E−06 |
| 14 | 3.65 | 5.3E−03 | DNA packaging (BP) | 9 | 7.8 | 3.9E−04 |
| 18 | 2.81 | 1.2E−02 | Small-molecule metabolic process (BP) | 41 | 2.2 | 9.2E−04 |
Calmodulin binding (MF) | 4 | 15 | 1.9E−02 | Mitochondrion (CC) | 78 | 1.6 | 1.1E−03 |
Translation elongation factor activity (MF) | 7 | 6.11 | 2.5E−02 | Chromatin assembly (BP) | 7 | 9.6 | 1.2E−03 |
Entry into host cell (BP) | 3 | 28.13 | 2.8E−02 | Ciliary transition zone (CC) | 14 | 3.9 | 2.3E−03 |
Nucleic acid binding (MF) | 34 | 1.86 | 4.7E−02 | Oxidoreductase activity (MF) | 33 | 2.3 | 2.3E−03 |
| 23 | 2.18 | 6.0E−02 | Nucleotide metabolic process (BP) | 19 | 3.2 | 3.5E−03 |
Ciliary plasm (CC) | 67 | 1.7 | 4.4E−03 | ||||
Coenzyme binding (MF) | 16 | 3.1 | 1.4E−02 | ||||
Voltage-gated channel activity (MF) | 5 | 10.3 | 1.5E−02 | ||||
Cofactor binding (MF) | 15 | 2.9 | 3.7E−02 | ||||
| 15 | 2.9 | 4.7E−02 |
a
The top significantly enriched (adjusted
The upregulation of ribosomal proteins and genes related to translation regulation in G1 arises among the top biological features, being mainly observed in the translatome but also in the proteome. It is worth noting that no G1 upregulation of ribosomal protein genes was observed at the transcriptome. A similar translational regulation of ribosomal proteins was observed through ribosome profiling of epimastigote transition to the quiescent metacyclic trypomastigote (23), where the ribosomal proteins are more efficiently translated in the noninfective replicative epimastigote.
Cellular functions related to carbohydrate metabolism and energy production were found overrepresented in G1 as for the translatome and transcriptome. This is expected since cell growth mainly occurs at this stage (52, 53). However, these processes were not as clearly overrepresented in the G1 proteomic data. This finding suggests that while the synthesis of these enzymes slows down in S phase, their half-life is long enough to maintain the steady-state levels at least at the sampling time. Another remarkable molecular function upregulated in G1 is RNA binding, which is modulated at both the translatome and the proteome data sets.
As expected, molecular pathways related to DNA metabolism are overrepresented in S phase. Although these terms are enriched at the three levels, slightly different ones emerged from the different sets of genes. DNA replication was specifically observed in the transcriptome and the translatome, while DNA repair and recombination pathways are observed only for the translatome genes. Meanwhile, the terms observed in the proteome are biased toward DNA packaging and chromatin assembly, perhaps because of the high expression of these genes and the smaller size of the proteomic data set.
Interestingly, terms related to the mitochondrion and the respiratory chain upregulated in G1 at the translatome level are also upregulated in S phase at the proteome, suggesting that these mRNAs are loaded onto the polyribosomal compartment in G1, thus provoking increased protein levels at S phase.
Finally, cellular functions related to the mitotic spindle formation and organelle organization are overrepresented in the translatome data set at S phase but not yet in the proteome. Coincidentally, these biological functions peaked at G2/M in our previous transcriptomic study (44); thus, these proteins may increase their abundance later at G2/M phase.
Analysis of the expression profiles of putative cell cycle regulators.
The 11
FIG 5
Expression profiles of selected cell cycle regulators. Heatmaps for transcriptome-translatome-proteome (T-R-P) log2 fold change of selected differentially expressed genes. (A) Genes coding for cyclins. (B) Genes coding for cdc2-related kinases (CRKs). (C) Genes coding for RNA binding proteins (RBPs). The white asterisk denotes the data set that displayed regulation. Gray shading implies fold change could not be calculated from the proteomic data. Black shading indicates that no protein was detected in any of the replicates.
The eight annotated trypanosomatid CDK analogs (cdc2-related kinases [CRKs]) are identified in the translatome, and all but one (CRK3) at the proteome. Most of them remain unchanged in G1/S, as expected since their functionality is posttranslationally regulated by phosphorylation (46). Interestingly, CRK11 (TcCLB.511751.50) showed concordant downregulation in S phase at both translation and protein levels, while CRK1 (TcCLB.504181.40) is only significantly upregulated at the protein level in S phase (Fig. 5B). While there are no reports for CRK11 in the literature, CRK1 is a deeply studied regulator of the cell cycle in trypanosomatids. CYC2, CYC4, CYC5, three putative G1 cyclins, are confirmed partners of CRK1 in
In the context of gene expression regulation in trypanosomatids, RNA binding proteins (RBPs) are candidate surrogates for transcription factors; thus, we searched for the differentially expressed RBPs at the G1/S transition. Twenty-eight out of the 87 genes bearing an “RNA-binding” annotation in TriTrypDB (CL Brener Esmeraldo-like haplotype) are differentially expressed in at least one data set. For this set of RBPs, protein levels are frequently independent from ribosome occupancy, suggesting that they are under diverse and complex control mechanisms. RBP40 is the only one that has been characterized in
Cycling sequence binding proteins (CSBPs), associated in two complexes (CSPBI and CSBPII), were described in the closely related trypanosomatid
The CSBPII complex consists of two RBPs with the PSP-1 C-terminal domain (CSBPII_45 and CSBPII_33), whose activities are modulated by phosphorylation. CSBPII_45 binds to a short sequence motif necessary for periodic expression of mRNAs along the cell cycle (64). None of the CSBPII
Identification of posttranscriptionally coexpressed gene sets in
To identify putative posttranscriptional regulons operating in the epimastigote G1/S transition, we clustered the 2,757 genes with reliable fold change values in both the translatome and the proteome (see Materials and Methods for more details). The Pearson correlation clustering based on the logarithmic S/G1 gene fold change resulted in 8 gene groups with similar regulation profiles (Fig. 6A). The top 3 nonredundant Gene Ontology (GO) terms identified for each cluster are listed in Fig. 6B. It is worth noting that the GO analysis presented here is expected to differ from the one in Table 1, since it considers genes with reliable S/G1 fold change values in the ribosome profiling and proteomic studies simultaneously. Clusters 2 and 6 are mainly regulated in the translatome without major consequences on protein levels. Cluster 2 is composed of genes upregulated in the translatome at G1, which are enriched in terms related to the cytoskeleton and the microtubule motor activity as well as the cilia/flagella. These biological processes point to both the cytoskeletal reorganization that takes place at G2/M and the development of the second flagellum that begins at early G2 and takes place over this stage before mitosis begins. As these genes raise their ribosome occupancy in S phase but not yet the protein levels, their mRNAs might be loaded on the polyribosome compartment but not yielding increased steady-state protein levels yet. Similar biological processes are enriched in cluster 3, which exhibits an upregulation in both the translatome and the proteome, indicating a regulatory heterogeneity for the proteins involved in these pathways. Then, we looked at the group of genes upregulated in G1 phase at the translatome while not changing the protein levels significantly (cluster 6), which are enriched in metabolic processes and energy consumption terms. As mentioned earlier, a high metabolic activity is a hallmark of G1 of the cell cycle. Again, a delayed effect of mRNA translation in the proteome due to protein stabilization may explain the lack of corresponding variation of the proteins encoded by these transcripts in G1.
FIG 6
Putative sets of coregulated genes. (A) (Top) Pearson correlation clustering for a set of 2,757 genes with valid fold change values for both the translatome (R) and the proteome (P) data sets. (Middle) The number of genes comprising each cluster. (Bottom) Schematic representation of the location of the genes of each cluster on the fold change diagram presented in Fig. 4C. (B) Top 3 nonredundant gene ontology terms overrepresented in each cluster of genes. Bar heights represent fold enrichment values, and color represents statistical significance (Bonferroni-corrected
Seeking to investigate if the different regulation of each cluster is associated with mRNA and protein metabolism, we calculated the average half-life of both molecules in asynchronous cultures using
Furthermore, proteins involved in motility represented by GO axoneme and cell motility represented by clusters 1, 2, and 3 are significantly more translated in S phase. This is confirmed at the protein level, more evidently in cluster 3 and for some genes in cluster 2. The apparent uncoupling of translation and protein levels in cluster 1 may be explained by their low mRNA and protein half-life (Fig. S3). This observation suggests that there is a dedicated regulatory mechanism responsible, which opens the possibility of increasing the steady-state protein levels later at S phase. The absence of the ribosomal proteins term enriched (14 genes from Table 1) in the clusters may be due to heterogeneity of the translation and protein abundance dynamic, as these genes were dispersed in 4 different clusters and thus did not lead to ontology term enrichment.
DISCUSSION
The absence of transcriptional regulation in
Using the most efficient HU synchronization protocol published in the
Since the epimastigote form is a noninfective stage, our findings would not be directly applicable to the patient treatments but might be useful for parasite-vector control strategies. The decision to study the cell cycle of this parasite form was based on the need for a large number of cell cycle-enriched parasite populations for high-throughput methodologies (68). At the moment, this cannot be achieved working with the intracellular amastigote stage. Nonetheless, the epimastigote form has been widely used as the parasite model to study different biological aspects including sensitivity to chemotherapy drugs. At the same time, molecular mechanisms and machinery of cell replication are likely to be remarkably similar among developmental stages, while the most striking differences would be expected to be at replication control/checkpoint.
Using ribosome profiling and proteomics, we obtained over 31 million sequencing reads and over 500,000 triggered MS/MS spectra, representing 7,248 transcripts and 4,524 protein groups (corresponding to 4,918 genes), respectively. Similar yields were previously reported in trypanosomatid ribosome profiling applying the same cutoff for detection (7,873 transcripts in
We found a high correlation among the three data sets and a higher correlation between ribosome occupancy and protein abundance (translatome-proteome) in comparison to transcript levels (both transcriptome-translatome and transcriptome-proteome). This finding agrees with ribosome profiling studies in diverse processes of various model organisms, reviewed by Eastman and collaborators (11). A broader regulation was observed at the G1/S transition for the translatome and the proteome levels compared to the transcriptome, seen in the higher number of DEGs (T-DEGs, 305; R-DEGs, 1,784; P-DEGs, 653) as well as in the wider dynamic range of variation observed (see Fig. S2 in the supplemental material), reinforcing the relevance of gene expression control steps occurring after the establishment of mRNA steady-state levels. The slightly larger modulation of the translatome than of the proteome may be influenced by methodological biases; thus, the true relevance of this difference is uncertain. Indeed, transgenomic comparisons are expected to be affected by intrinsic biases of the different techniques. In addition, although the three data sets have been generated using identical and reproducible inter-data-set synchronization protocols and identical subsequent intra-data-set procedures in parallel, we contrasted independent experiments; therefore, batch effects cannot be completely ruled out.
Translational control of specific mRNAs over the cell cycle has been described recently in model organisms employing the ribosome profiling strategy, as reviewed by Aramayo and Polymenis (68). Comparing the numbers of translationally regulated mRNAs in each model,
A recent proteomic study on
Both the R-DEGs and the P-DEGs identified in the G1/S phase transition are enriched in well-known cell cycle pathways and associated processes. Our analysis reveals that translational regulation magnifies the differences already present at the level of transcript abundance for several biological processes, such as glycosome biology and functions related to energy metabolism for G1-upregulated genes. Due to the broader translational control compared to the transcriptome, the R-DEGs not only include but further expand the list of genes observed in the T-DEGs (Fig. S4). In addition, translational regulation provokes a higher magnitude of change than RNA metabolism (Fig. S4). A similar phenomenon is observed for S-phase-upregulated genes, particularly for DNA replication pathways; however, in this case, a different set of genes is involved in the transcriptome and translatome (Fig. S5). Nevertheless, translational regulation is mostly acting on genes that are not regulated at the mRNA level, many of which were not previously studied during the trypanosomatid cell cycle. As an example, translational regulation was observed in G1-upregulated genes such as ribosomal proteins and other genes involved in translation, proteasome, and mitochondrial and oxidative processes; likewise, S-phase translationally upregulated genes include microtubule biology-, cytoskeleton-, and motility-related genes and the kinetochore complex. Interestingly, our study found certain groups of translationally regulated genes, such as ribosomal proteins, that have also been identified as translationally regulated in the parasite life cycle (23). This suggests that certain gene sets might be mainly regulated at specific gene expression levels regardless of the biological process studied. The comparison of the translatome and the proteome shows that the majority of the genes regulated are related to different cell processes in each data set, although some ontology terms are regulated in both, such as translation-related terms and RNA binding and chromosome genes (Fig. S5). It is interesting that genes related to the G2/M-phase processes increase translation levels on S phase without change in protein abundance. Since it is generally accepted that global translation decreases toward the G2/M phase of the cell cycle (12, 75), it is tempting to propose that G2/M-related proteins need to be produced in S phase while the translation machinery is still highly active but might be accumulated or stabilized in later G2/M phase. Additional time points would be required to test this hypothesis. Overall, the results suggest that the differential gene expression of related gene terms is achieved in multiple ways to finely define the time coordination of biological processes along the cell cycle.
Due to the importance of the identification of
In this study, we also aimed to identify putative posttranscriptional regulons operating in the epimastigote G1/S transition; thus, we focused on the coexpressed genes. Based on the expression at the translatome and the proteome, eight gene clusters with similar expression profiles were identified, changing at a single (only translatome and only proteome) or both levels. The comparison of the concordance of the changes in the translatome and proteome shows both clusters that include genes regulated in the same direction and genes regulated in opposite directions. The regulatory complexity revealed by this analysis suggests that diverse coordinated mechanisms may be needed to define the precise level of specific groups of proteins at different time points during the cell cycle.
A proteome and phosphoproteome study of the
In conclusion, we had generated a comprehensive data set uncovering three levels of gene expression, a comparison that has not been assessed in trypanosomatids before. Indeed, very few similar studies are currently published in the literature; thus, further investigation is still required to understand the complexity of the regulation. Our study reveals a larger translational regulation during the G1/S transition of the
MATERIALS AND METHODS
Parasites.
Hydroxyurea-induced synchronization and flow cytometry analysis.
Parasites were synchronized with hydroxyurea (HU) as originally described by Galanti et al. (47) and previously set up for our TcI strain (44). Late G1 and mid-S-phase-enriched parasite population samples were collected at 0 and 6 h post-HU release, respectively. A mock sample of parasites was treated under identical conditions except for the use of HU. An aliquot of 2 × 106 parasites/ml was washed twice in cold phosphate-buffered saline (PBS) prior to fixation in 500 μl 70% ethanol in PBS at 4°C for at least 1 h. DNA-specific propidium iodide (PI) staining was conducted by incubation of the fixed parasites for 30 min at 37°C in PBS containing 20 μg/ml PI and 200 μg/ml RNase A. Three technical replicates per biological sample were analyzed for DNA content in a flow cytometer (Accuri C6; BD Biosciences), and the proportions of G1, S, and G2/M cells in the samples were determined as previously described (77).
Ribosome profiling and deep sequencing.
Three independent synchronization experiments were prepared in parallel to harvest G1- or S-enriched parasite cultures. G1-phase and S-phase samples were harvested in independent experiments. Ribosome-protected footprints (RFPs) were generated through nuclease treatment of cell extract in the presence of cycloheximide (CHX) as previously described (10) and recently optimized for
Sequence read processing, mapping, and differential gene expression.
Read trimming was performed using fastx_clipper (FASTX_Toolkit, v0.0.14) with the parameters
Label-free proteomics sample preparation and analysis.
Two independent synchronization experiments were performed in parallel. Intrareplicate G1 and S phases were derived from the same synchronized culture. Cell cycle-enriched parasite populations were lysed at 5 × 108 cells/ml in SDS lysis buffer (8% SDS, 200 mM Tris, pH 8.5, 200 mM dithiothreitol [DTT], and cOmplete mini EDTA-free protease inhibitor cocktail by Sigma) at 95°C for 5 min. Peptide samples for analysis by mass spectrometry were prepared as described by Urbaniak and collaborators (82), based on modifications of the filter-aided sample preparation (FASP) procedure (83). Protein samples were defrosted to give a total of 2.5 × 109 lysed cells (0.5 ml), solubilized with 4% SDS, and then reductively alkylated in a 30,000-molecular-weight-cutoff vertical spin filtration unit (Vivascience) using the FASP procedure adapted for the larger volumes used here. The sample was digested with a 1:100 ratio (wt/wt) of trypsin gold (Promega) in the filtration unit for 18 h at 37°C, tryptic peptides were eluted by centrifugation, and the filter was washed sequentially with 1 ml of 50 mM NH4HCO3 and 1 ml of 0.5 M NaCl. The combined eluent was desalted using a 500-mg C18 cartridge (SepPak; Waters) and lyophilized. In order to reduce sample complexity, peptide preparations were fractionated in a high-pH reversed-phase peptide fractionation kit (Thermo Scientific Pierce), increasing the proportion of acetonitrile (ACN) in order to obtain different eluates (flowthrough [FT] = 0% ACN, 2% ACN, 3% ACN, 4% ACN, 6% ACN, 10% ACN, 50% ACN). Later, the eluates were combined into 4 fractions (F1 = FT + 4% ACN; F2 = 2% ACN + 4% ACN; F3 = 3% ACN + 50% ACN, and F4 = 6% ACN) based on peptide quantitation and the hydrophobicity nature of each eluate. Liquid chromatography-tandem mass spectrometry (LC-MS/MS) was performed by the FingerPrints Proteomic Facility at the University of Dundee. Liquid chromatography was performed on a fully automated Ultimate U3000 nano-LC System (Dionex) fitted with a 1- by 5-mm PepMap C18 trap column and a 75-μm by 15-cm reverse-phase PepMap C18 nanocolumn (LC Packings; Dionex). Samples were loaded in 0.1% formic acid (buffer A) and separated using a binary gradient consisting of buffer A (0.1% formic acid) and buffer B (90% methyl cyanide [MeCN], 0.08% formic acid). Peptides were eluted with a linear gradient from 5% to 40% buffer B over 65 min. The high-performance liquid chromatography (HPLC) system was coupled to an LTQ Orbitrap Velos Pro mass spectrometer (Thermo Scientific) equipped with a Proxeon nanospray ion source. The mass spectrometer was operated in data-dependent mode to perform a survey scan over a range of 335 to 1,800
Proteomics data processing.
Data were processed using MaxQuant15 version 1.3.0.5 which incorporates the Andromeda search engine (84). Proteins were identified by searching a protein sequence database containing
Clustering analysis.
In order to select a group of genes with reliable fold change values in both the ribosome profiling and the proteomic analysis, we applied a set of filters to the data sets. First, we selected the genes with over 40 nRFPs in the ribosome profiling experiment. Only the proteins classified as “single copy” in Table S2 were considered in this analysis, to avoid redundancy in the expression profiles for proteins of multicopy families. In this case we did not set a probabilistic filter or an arbitrary cutoff in the fold change values, aiming to keep a larger set of genes with data in both studies to build the coexpression profiles. The clustering and heatmaps were obtained from the Broad Institute Morpheus web server using row and column clustering by Pearson correlation (79). An arbitrary cutoff was taken from the observation of the distance matrix that resulted in 8 groups of putative coregulated genes.
Data availability.
Raw sequences obtained were deposited at SRA in the BioProject PRJNA704643 (https://www.ncbi.nlm.nih.gov/sra/PRJNA704643). The MS/MS raw files were deposited into the Peptide Atlas repository and can be accessed at http://www.peptideatlas.org/PASS/PASS01658.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2021 Chávez et al. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
ABSTRACT
IMPORTANCE
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer