Introduction
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) promotes several dysfunctions in human immune responses while triggering a broad spectrum of clinical presentations that range from asymptomatic infection to a mild, moderate, or sometimes lethal severe symptomatology (Ge et al., 2020; The, 2020; Wu and McGoogan, 2020). Convalescent patients report prolonged COVID-19 symptoms beyond the time course for typical cold and flu events, which highlights the possibility of long-term tissue damage generated by this virus (Ladds et al., 2020; Nalbandian et al., 2021; Ryan et al., 2022; Subramanian et al., 2022). Similarly as other respiratory viruses, it is known that SARS-CoV-2 triggers an immune response involving the recruitment, activation, and differentiation of innate and adaptive immune cells (Newton et al., 2016; Shen et al., 2023; Wauters et al., 2021). For mildly ill patients, these coordinated immunological efforts resolve infection but for unknown reasons the virus evades these responses in severely ill patients to produce life-threatening COVID-19 (Park, 2020; Rashid et al., 2022; Sun et al., 2022; Thorne et al., 2022). The genetic background and physiological health of individual patients certainly plays a major role in the clinical presentation of COVID-19 but the exact mechanisms of how the virus evades innate and adaptive responses is not known, or why there is such great variability in severity of clinical presentation among patients. This information is critical for developing new diagnostics that detect patients who will eventually progress to severe COVID-19 before respiratory failure ensues, and furthermore, provide host and virus targets to engineer effective treatments (Li et al., 2021b; Samadizadeh et al., 2021).
Biomarkers linked to COVID-19 severity hold promise for detecting patients that will eventually develop severe COVID-19 (Janssen et al., 2021; The, 2020). In this context, blood-derived cues were associated with severe COVID-19, including an imbalance in immune cell populations that included neutrophil abundance, lymphopenia, myeloid dysfunction, and T cell activation/exhaustion (Ahern et al., 2022; Chen and John Wherry, 2020; Mann et al., 2020; Wauters et al., 2021). The differential expression of select chemokines and their receptors (Khalil et al., 2021) with associated cytokine storm drove monocyte and megakaryocyte dysfunction in severely ill patients (Ren et al., 2021). Comprehensive knowledge of host immune responses against SARS-CoV-2 is still limited but these divergent cell profiles implicated cell-to-cell signaling events occurring between the innate and adaptive cell compartments as critical for the progression of severe COVID-19 (Daamen et al., 2021; Rabaan et al., 2023; Wang et al., 2022).
One approach to identifying changes in immune responses affiliated with severe COVID-19 is to monitor autocrine, paracrine, and endocrine signaling in individual patients over time. Temporal events associated with each type of signaling is obviously difficult to disentangle from measuring the activities of circulating peripheral cells alone because there are distinct events happening in localized microenvironments, for example, the spleen and lymph nodes. A complementary tactic to access information about these events is to monitor gene expression for the synthesis of chemokines and cell-associated receptors as a proxy of biochemical events happening in distinct immune effector cells. Based on current knowledge, we hypothesized that critical events occurring at the earliest stages of infection necessary for effective viral clearance are either perturbed or disrupted so as to promote cytokine storm and other pathologies associated with severe outcomes. We predicted these pathological immune events may be observable by measuring changes gene expression reflecting activities in distinct effector cells during the first weeks of infection (Ahern et al., 2022; Bernardes et al., 2020; Notarbartolo et al., 2021; Xiong et al., 2020; Zheng et al., 2020). However, these types of experiments require careful design because the type and quantity of all immune responses are dynamic during infections and comparing poorly matched peripheral blood mononuclear cells (PBMCs) may confound identification of bonafide immune dysregulation evident between patients (Bernardes et al., 2020; Notarbartolo et al., 2021; Zheng et al., 2020).
Here, we designed a longitudinal comparison between mild and severe patients, choosing the appropriate samples according to the clinical progression and the unbiased gene expression profile to study how changes in gene expression in distinct immune effector cells changed during the earliest time points since peak of symptoms and during progression of clinical disease. We repeatedly measured whole-transcriptome profiles of PBMCs from the same cohort of mildly and severely ill patients to identify molecular pathways that were enriched during the clinical trajectory of COVID-19 over time. Briefly, to gain more insights into our findings and get more comprehensive knowledge of the phenomenon, we used a pairwise comparison of gene expression, gene set enrichment, and weight-correlated gene network analyses. By doing so, we identified pathways of genes involved with the natural killer (NK) cell cytotoxicity enriched in mild patients when compared to severe. Besides focusing on a particular molecular pathway, we investigated the interactions to better comprehend the underlying phenomena of a successful immune response, contributing to an integrated point of view throughout the transcriptomic analyses of functional pathways to mitigate potential biases attributed to focusing the study on a single pathway. In this regard, we revealed that the NK signaling pathway was intricately related to other transcriptional circuits, such as those governing Th1/Th2 cell differentiation and cytokine–cytokine receptor signaling pathways. These interactions highlight the importance of these pathways as bridges between the innate and adaptive immune responses throughout the disease, implying that the innate NK signaling pathway (cell cytotoxic activity) is beneficial, and possibly a critical activity required to effectively eradicate coronavirus. We also observed that an adaptive immune response including early cell-mediated immunity was significant in lowering disease severity. The link between the primary innate NK cell activity and the transcriptional priming of adaptive Th1 and Th2 cell responses appears to be more robust in mild patients than in severe. This work provides clear guidance to develop better medical practices and prevention tactics against SARS-CoV-2 and other related infectious respiratory virus (Haitao et al., 2020; Ponti et al., 2020; Zhang and Guo, 2020).
Results
Clinical features and temporal gene expression patterns in SARS-CoV-2 infected patients
A total of 22 peripheral blood samples were obtained from eight COVID-19 patients. These samplings following a longitudinal schedule complemented with two samples from healthy donors. All patients were recruited after an average period of 5 days after symptoms onset (Figure 1A). Some samples were taken from patients at the Hospital of Osorno and the Hospital of Puerto Montt, which are cities located in the region Los Lagos. The remaining samples were collected from patients at the Hospital Base and Clínica Alemana in Valdivia, a city located in the Region Los Ríos. All infections occurred between November 2020 and May 2021 (Table 1).
Figure 1.
Clinical profile and gene expression patterns for mild and severe COVID-19 patients during 28 days.
(A) Longitudinal sampling schedule for severe and mild COVID-19 donors of peripheral blood (22 samples in total from 8 donors). Three sampling times (black dots) are displayed with respect to the recruitment day (D0). In addition, panel A shows the diagnosis day with positive PCR (green star), symptoms onset day (orange star), and hospitalization day (red cross). (B) The COVID-19 progression according to the WHO ordinal scale describes the temporal disease severity for each donor. Severe patients (S1–S4) are displayed in lines with colors scaling from green to red, while mild patients (M1–M4) are shown in blue line colors. The
Figure 1—figure supplement 1.
Heatmap of temporally and differentially expressed genes over the course of COVID-19 progression.
At the top, each column corresponds to the sampling points (0, 7, and 28 days since recruitment) of mild and severe patients. Genes are displayed as horizontal rows and are clustered by the similarity of expression profiles, represented by the dendrogram to the left of the heatmap. To the right of the heatmap, yellowish color indicates higher expression, while bluish color means lower expression represented by the
Figure 1—figure supplement 2.
Volcano plot depicting pairwise gene expression comparisons for detected differentially expressed genes (DEGs) between mild and severe COVID-19 patients at D0, D7, and D28 (A, B, and C, respectively).
Red or blue indicates genes that were significantly over- or sub-expressed at a particular sampling time, based on filtering by FDR ≤0.05 and the absolute value of −log10 (FC) (≥2.0). The remaining genes that do not show differential expression are indicated in gray. FDR = false discovery rate; FC = fold change.
Table 1.
Clinical characteristics of the cohort.
Clinical characteristics | Mild | Severe | All |
---|---|---|---|
Sex, | (2/2) | (2/2) | (4/4) |
Total, | 4 | 4 | 8 |
Median age, years ± SD | 39.0 ± 3.9 | 46.7 ± 8 | 42.9 ± 7 |
Days from onset of symptoms to recruitment, median ± SD | 1.2 ± 1.3 | 10.0 ± 1.8 | 5 ± 4.7 |
Days from COVID-19 diagnosis to recruitment, median ± SD | 3.0 ± 0.7 | 8.5 ± 2.1 | 6 ± 3.4 |
Symptoms, | |||
Fever | 1 (25%) | 3 (75%) | 4 (50%) |
Chills | 1 (25%) | 2 (50%) | 3 (37.5%) |
Fever feeling | 1 (25%) | 2 (50%) | 3 (37.5%) |
Odynophagia | 2 (50%) | 1 (25%) | 3 (37.5%) |
Cough | 1 (25%) | 4 (100%) | 5 (62.5%) |
Expectoration | - | - | - |
Dyspnea | - | 4 (100%) | 4 (50%) |
Thoraxic pain | 1 (25%) | 1 (25%) | 2 (25%) |
Diarrhea | 1 (25%) | 2 (50%) | 3 (37.5%) |
Anosmia | 1 (25%) | - | 1 (12.5%) |
Ageusia | 1 (25%) | - | 1 (12.5%) |
Myalgia | 1 (25%) | 2 (50%) | 3 (37.5%) |
Headache | 2 (50%) | 1 (25%) | 3 (37.5%) |
Treatment | |||
Hospitalization, | - | 4 (100%) | 4 (50%) |
Intubation, | - | 4 (100%) | 4 (50%) |
Mechanical ventilation, | - | 4 (100%) | 4 (50%) |
Samples, | 12 | 10 | 22 |
A crucial issue with longitudinal studies is defining an appropriate sampling schedule that provides a reasonable comparison between patients during the time course of naturally occurred infections. To align the comparability and consistency of data measured between patients, we designed a protocol consisting of three donations per patient to monitor events occurring during both acute infection and the recovery phase. We collected peripheral blood samples on days 0, 7, and 28 (D0, D7, and D28) during the peaks of symptoms (Figure 1B).
Clinical features of COVID-19 patients with mild and severe symptoms were determined by medical personnel at the hospitals mentioned above and used to describe their disease trajectories over time using the WHO ordinal scale (Ahern et al., 2022, Figure 1B). In contrast to mild patients, all four severe patients experienced symptoms such as fever, cough, headache, chills, diarrhea, myalgia, and dyspnea (Table 1). These patients received mechanical ventilation on sampling D0. In general, symptoms from severe and mild patients diminished gradually up to D28 after recruitment, with the exception of one mild and another severe patient who still experienced mild symptoms.
Mild and severely ill patients displayed different transcriptional programs at the beginning of disease onset. To determine the gene expression profiles of each patient over the time of disease progression, we developed an RNA-seq approach that takes advantage of the longitudinal sampling scheme. By using all expressed genes, we performed a principal component analysis to unbiasedly compare the transcriptional signatures of each patient and two healthy donors. All peripheral blood samples from severe patients on D0 (represented by red circles) were widely dispersed over the left of principal component 1 (PC1) (Figure 1C), in contrast with mild patients on D0 (represented by blue circles), suggesting that both groups of patients displayed different transcriptional programs at the beginning of the disease.
The transcriptional profiles of severely ill patients changed during the recovery phase to be consistent with that observed in mildly ill patients. Gradually, along with disease progression and medical treatments, samples from severely ill patients shifted to the right of PC1 (D7 and D28). Interestingly, on D28, when the majority of patients had recovered, samples from severely ill patients were pooled together with those mild patients who had already recovered. These observations indicated that despite the transcriptional profiles being closer to that of mild patients at D28 as compared to D0, severely ill patients still exhibited higher variability between themselves and controls (Figure 1C). In contrast, every mild COVID-19 patient was separated from the severe group on D0 and D7. Notably, only one mild COVID-19 patient (M1) clustered with severe patients at D0. This donor showed a broader set of symptoms over time when compared with the rest of mild patients (Figure 1B). This evidence indicated a dissimilar transcriptional response in that specific patient at the onset of disease. Over time, and after medical treatments, the transcriptional program of this patient shifted to be consistent with the other mild patients (Figure 1C).
The timing of COVID-19-related gene expression differed between mild and severely ill patients. We focused on the temporal variation of gene expression to identify differentially expressed genes (DEGs) associated with COVID-19 progression. We found statistically significant differences in the timing of differential gene expression between mild and severely ill individuals (Figure 1D and figure supplement 1). We observed that severe patients displayed a transcriptional response completely different from that of mild patients at the sequential time points of D0 and D7 (Figure 1D). Previous longitudinal studies identified molecular markers associated with severe COVID-19 (Bernardes et al., 2020; Notarbartolo et al., 2021; Zheng et al., 2020). We detected these same molecular markers in our severely ill cohort (Figure 1D). The expression profiles of those genes varied significantly between mild and severe patients. For instance, the expression of MMP9 metalloproteinase (Zheng et al., 2020), S100A8/A9 alarmins (Bernardes et al., 2020), PADI2 (Notarbartolo et al., 2021), and IL18Rap peptidyl-arginine deiminases (Masood et al., 2021; Schultze and Aschenbrenner, 2021) were higher in severe patients on D0 than mild or control patients (Figure 1D). In addition, we found that IFNG, CCL2, and CXCL10 cytokines, which were previously described as molecular markers in severely ill patients (Sette and Crotty, 2021; Vabret et al., 2020), displayed low expression in our severe COVID-19 patients in comparison with mildly ill patients during the progression of disease (Figure 1D).
The immune response of mild and severe patients is activated differentially during the acute phase of the COVID-19 infection
Most of the variations observed in the gene expression profiles of mild and severely ill patients occurred during the acute phase of disease. We performed pairwise gene expression comparisons between mild and severe patients and found DEGs mainly on D0 and D7. On D0, we found a total of 812 DEGs including 298 upregulated and 514 downregulated genes (Figure 1—figure supplement 2). On D7, the number of DEGs was similar to D0, with 319 genes showing higher expression and 563 genes with lower expression. We found no differential gene expression between mild and severe patients at D28, supporting the interpretation that most imbalances in the gene expression profiles in the PBMCs of severely ill patients leveled out by D28 (Figure 1—figure supplement 2).
Functional pathways involved with humoral immunity were enriched in severely ill patients during the acute phase compared to pathways involved with cell-mediated immunity in mild patients. The above results provided only a course overview of the transcriptional responses during COVID-19 progression. We expanded our focus to detect molecular mechanisms and pathways involved in the immune responses of all patients by linking functional pathways to deferentially expressed genes (DEGs) detected between severely ill, mildly ill and control patients. We used a twofold change in gene expression level as a threshold to identify DEGs between mild and severe patients on D0 and D7. We found upregulated expression for genes involved in biological processes that included the T receptor signaling pathway, positive regulation of T cell activation, and regulation of leukocyte cell adhesion in mild COVID-19 patients at D0 (Figure 2A). We observed genes involved with immunoglobulin production, antigen receptor-mediated signaling pathway, and adaptive immune response were upregulated at D7 (Figure 2B). In contrast, we observed enrichment of gene expression in pathways involved with complement activation, humoral immune response mediated by circulated immunoglobulin, and defense response to bacterium on D0 in severe COVID-19 patients. Furthermore, DEGs in functional pathways mediating hydrogen peroxide metabolic processes, cellular oxidant detoxification, and reactive oxygen species were enriched on D7 of infection in this group (Figure 2A, B). Biological pathways consistent with a robust lymphocyte cellular immune response were enriched on D0 in mild patients. This functional profile is distinctly different to the antibody/complement-dependent humoral immune responses observed in severely ill individuals at the same time point (Figure 2A). Nonetheless, differential expression of genes associated with immunoglobulin function were mainly enriched in mild patients at D7 (Figure 2B), while severe patients showed enrichment for genes related to inflammation, reactive oxygen species, and responses against bacteria at that time of infection (Figure 2B).
Figure 2.
Gene set enrichment analysis (GSEA) shows biological processes and enriched pathways associated to innate and adaptive immune response after severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection.
(A, B) Bubble plots show the biological processes (BP) of enriched genes for differentially expressed genes (DEGs) between mild and severe for D0 and D7 after recruitment, respectively. (C, D) Bubble plots show the Kyoto encyclopedia of gene and genomes (KEGG) enriched pathways of enriched genes for DEGs between mild and severe for D0 and D7, respectively. The
In addition to enriched biological processes, we also focused on Kyoto encyclopedia of gene and genomes (KEGG) pathway enrichment among DEGs at D0 and D7 after COVID-19 infection. On D0, mild and severe patients showed considerable differences in terms of the innate response, with the NK-mediated cytotoxicity pathway enriched in mild-infected patients, while neutrophil extracellular trap formation was enriched in severe ones (Figure 2C). Furthermore, DEGs associated with the antigen processing and presentation pathways are enriched in mild COVID-19 patients, in contrast with the enrichment of complement and coagulation cascade pathways in severely ill patients (Figure 2C). On the other hand, on D7 of the COVID-19 infections, NK cell-mediated cytotoxicity is one of the main enriched pathways in the mild-infected group, whereas IL-17 signaling is the most significant pathway in the severe-infected group (Figure 2D). This finding is remarkable because besides COVID19, IL-17 is affiliated with other clinical pathophysiologies in which a dysregulation between innate and adaptive immune responses such as myocarditis and lupus (Lee et al., 2019; Rangachari et al., 2006; Sadeghi et al., 2021).
Taken together, we show that there are distinct transcriptional responses along the COVID-19 progression, which suggest that immune responses to SARS-CoV-2 infection occur differently in individuals; thus, there might exist a differential imprinting associated with the severity of the COVID-19 infection.
Higher expression of NK cell hub-genes is a core event of acute phase that distinguishes mild from severe symptoms in COVID-19 individuals
Given that our findings pointed out changes in the immune response after SARS-CoV-2 infection of the patients cataloged as mild and severely ill, we decided to uncover molecular pathways that might be responsible of the differences observed between patient groups during COVID-19 progression. To do so, we first identified genes that were differentially expressed between severity groups, and second, we chose only those that also showed changes in their trajectories across sampling times. In doing so, we found 828 genes that exhibited temporal differences in expression level during disease progression. Then using the Enrichr platform, we discovered additional biological processes and KEGG pathways that were differentially enriched during the COVID-19 progression in mild and severe patients (Figure 3). For instance, mild-infected patients exhibited expression of genes involved in kinase activity, enzyme-linked receptor activity, and apoptotic process not only at D0 (acute phase) but also at D7 (middle phase) (Figure 3A, C). In contrast, severely ill patients exhibited high level expression of genes involved in neutrophil activity. This observation was the most notorious outcome elicited by SARS-CoV-2 during acute COVID-19 in this group (Figure 3B, D, F). We observed that NK cell cytotoxicity was the most enriched pathway among the temporal and differential expressed genes in mildly ill patients during the acute phase (Figure 3B, C). Among these enriched genes, we found abundant membrane receptor genes that included
Figure 3.
Biological processes and Kyoto encyclopedia of gene and genomes (KEGG) pathway for genes with differential expression levels over time in mild versus severe patients.
Bar plots show gene ontology analysis for biological processes in light-blue/dark-blue and KEGG pathway in light-green/dark-green, using upregulated genes (A, C, E) and downregulated genes (B, D, F). The size of each bar is according to its −log10 p-value, and name pathways with asterisk indicates a
To confirm the importance of the differentially enriched pathways between mild and severe COVID-19 patients, we focused on analyzing the context of gene–gene interactions (Figure 4A and Figure 4—figure supplement 1) and changes in their quantitative expression levels overtime graphed as a heatmap (Figure 4B). The genes displayed in this KEGG pathway graph represent the upregulated genes (red boxes) in mild patients and their interactions involved in NK cell-mediated cytotoxicity (Figure 4A). Interestingly, all these genes showed overtime trajectories with high levels on days 0 and 7 in mild patients. These gene expression levels became roughly equivalent by D28 in both the mild and severe groups (Figure 4B). Complementing these observations, we constructed a protein–protein interaction (PPI) network using only upregulated genes during the early phase (days 0 and 7), followed by a clustering process that detected proteins with more significant interactions among the selected genes (Figure 4C). Notably, we detected KLRD1, CD247, and IFNG as central nodes of PPI networks. This finding makes sense because these proteins exhibit numerous interactions with other proteins involved in activating or inhibiting NK cell cytotoxicity (e.g., KLRC1, KLRC3, and KIR3DL2), as well as Th1/Th2 cell differentiation (CD4) and cytokine-cytokine receptor interaction (IL5RA and IL2RB). In Figure 4D, we show the comparative trajectories of these node genes between both groups of severity. Interestingly, we found a convergence of KLRD1 and CD247 genes on D28, while IFNG remained differentially expressed between patient groups.
Figure 4.
Gene function network for natural killer (NK) cell hub-genes with differential expression levels between mild and severe patients during acute phase of COVID-19.
(A) Kyoto encyclopedia of gene and genomes (KEGG) pathway of NK cell-mediated cytotoxicity represents the set NK cell hub-genes upregulated (red-boxes) in mild versus severe patients. The green boxes correspond to genes without differential expression. (B) Heatmap shows the differential expression levels of NK cell hub-genes over time (D0, D7, and D28 after recruitment) separated by mild and severe groups. The expression levels are represented by the
Figure 4—figure supplement 1.
Kyoto encyclopedia of gene and genomes (KEGG) graphs show genes differentially expressed over time of the Th1 and Th2 cell differentiation pathway (A) and the cytokine–cytokine receptor interaction pathway (B).
Red boxes depict upregulated genes in mild COVID-19 patients during the acute phase of the disease, whereas green boxes depict genes without significant differential gene expression within the KEGG pathways.
Figure 4—figure supplement 2.
Gene ontology (GO) and Kyoto encyclopedia of gene and genomes (KEGG) and REACTOME pathway analyses of differentially expressed genes (DEGs) found in the pairwise comparison between D0 and D7 of COVID-19 infection.
Bar graphs showing the enrichment of GO biological processes (in blue color), GO molecular functions (in purple color), KEGG pathways (in dark green color), and Reactome pathways (in light green color) between mild and severe COVID-19 patients. Enrichment results are sorted by −log10(p
Figure 4—figure supplement 3.
Protein–protein interaction (PPI) network graphs show the upregulated genes found in the pairwise comparison (D0 and D7) in mild versus severe COVID-19 patients.
(A) Topological representation of the PPI network of upregulated genes in mild patients on D0. (B) Topological representation of the PPI network of upregulated genes in mild patients on D7. Some nodes are color-coded to highlight proteins involved in the following pathways: Cytokine–cytokine receptor interaction (blue), natural killer (NK) cell-mediated cytotoxicity (red); and Th1 and Th2 cell differentiation (yellow).
Figure 4—figure supplement 4.
Gene ontology (GO) and networks of enriched pathway analyses of differentially expressed genes (DEGs) found in the pairwise comparison between D7 of mild patients and D0 of severe patients with COVID-19.
Bar graphs show the enrichment of GO terms for biological processes while the networks reveal Kyoto encyclopedia of gene and genomes (KEGG) enriched pathways for upregulated genes (A, B) and downregulated genes (C, D) in mild patients, respectively. The size of each bar is according to its −log10 p-value, and name GO terms with asterisk indicates a
Once we identified the trajectories of NK cell hub-genes participating in COVID-19 progression, we asked whether there were any DEGs (adj. p ≤ 0.05 and log2-fold change ≥2.0) obtained from a pairwise comparison of mildly and severely ill patients at days 0 and 7 that would have been left out from the longitudinal analysis. Given that the number of DEGs at each time point is higher when compared to the list of genes exhibiting differential trajectories, we performed a gene ontology (GO) and pathways analysis with the new set list of genes (Figure 4—figure supplement 2). The main result showed that NK cell-mediated cytotoxicity was predominant on D7. This finding reinforced the interpretation that there is a dysregulation of innate immunity, as previously suggested in severe patients (Paludan and Mogensen, 2022), with an over-representation of neutrophil activation. The results shown in Figure 4—figure supplement 3 and Figure 4—source data 1 summarize the pathway and PPI network analysis for these genes on D0 and D7, respectively, and show the predominant enrichment of NK genes. Taken together, these data are consistent with an active and regulated innate NK cytotoxic immune response mounted during the acute phase of infection in mild COVID-19 patients. This observation contrasts with the humoral- and neutrophil-biased response observed in severely ill individuals.
Previous comparisons were done with the assumption that both cohorts were at the peak of their symptoms on D0. However, taking into account the delta at the days of symptoms onset, we also analyzed the pairwise comparison for D7 in mild patients, but this time comparing it to D0 in severe patients. Figure 4—figure supplement 4 highlights GO terms related to NK cell response and enriched pathways for NK cell cytotoxicity in mild patients. This result supports the idea that the transcriptional program differs between mild and severe individuals with the outstanding contribution of NK cells in mild patients. Otherwise, severe patients exhibit an inflammatory response reflected by the complement and coagulation pathways, which is according to our previous analysis.
Gene co-expression identifies NK hub-genes linked to the innate and adaptive immune response of mild COVID-19 patients
We identified genes that were coordinately expressed during COVID-19. We developed a weighted gene correlation network to simultaneously analyze all peripheral blood samples collected from patients during the longitudinal protocol and those from heathy donors to identify genes with coordinated expression. By using a differential co-expression approach, we identified 10 modules of co-expressed genes (Figure 5A). We then used these networks to correlate each module with available clinical information of the patients by calculating the module significance (MS) for each module–trait correlation. Not surprisingly, we found that most module eigen genes grouped according to the degree of COVID-19 infection (i.e., mild or severe patients) (Figure 5B). Among co-expressed gene modules, we focused on three modules that contained the largest number of genes. These modules correspond to blue (704 genes), brown (508 genes), and turquoise (712 genes). The blue and brown modules, which are correlated positively with mild patients (Figure 5B), were enriched with genes related to T cell activation and platelet function, respectively (Figure 5C). In contrast, the turquoise module, which was correlated positively with severe COVID-19 patients (Figure 5B), was enriched with genes related to neutrophil activation and inflammatory responses (Figure 5C).
Figure 5.
Gene co-expression network analysis among the longitudinal transcriptomic profiling.
(A) Gene hierarchical clustering dendrogram of 10 detected modules based on Topological Overlap Matrices (TOM) measure. The branches and color bands represent the assigned module. (B) Module–trait relationships (MTRs) between detected modules and disease severity of COVID-19. MTRs are obtained by calculating Pearson correlation between the traits and the module eigengenes. The red and blue colors indicate strong positive or negative correlations, respectively. Rows represent module eigengene (ME) and columns indicate the disease severity of COVID-19, where correlations from each patient are shown from left to right according to their sampling time (D0, D7, and D28). (C) Gene ontology (GO) enrichment analysis of genes in the blue, brown, and turquoise modules. The color in the bar graphs refers to the module eigengene (ME). Enrichment results are sorted by −log10(p-value) (higher on top) of each biological process GO term. (D–F) Network visualization of enriched pathways based on co-expression genes from blue, brown, and turquoise modules, respectively.
Figure 5—figure supplement 1.
Gene ontology (GO) analysis of the seven smallest modules of co-expression.
The genes from the modules red, black, magenta, green, pink, yellow, and gray were used to study GO biological process terms. GO terms with statistically significant
Figure 5—figure supplement 2.
Network for genes of enriched pathways from the seven smallest modules of co-expression.
Genes from modules red (A), black (B), pink (C), yellow (D), green (E), magenta (F), and gray (G) were used for analysis and the networks show the most enriched pathways.
As shown thus far, the Th1/Th2 cell differentiation pathway was relevant in the immune response of mild COVID-19 patients, and because the blue module is enriched in lymphocyte-based immune response genes, we performed a gene–gene network analysis to determine how the genes from this module might have worked in the context of an adaptive immune response. In this analysis, we found genes belonging to the NK cell-mediated cytotoxicity pathway grouped together with the cytokine–cytokine receptor interaction and Th1/Th2 cell differentiation pathways (Figure 5D). Furthermore, these genes were previously identified as differentially expressed in the NK cytolytic pathway, like
The remaining seven modules were analyzed for GO (Figure 5—figure supplement 1) which reveals different aspects related to the immunological response to viral infection. In addition, we searched for enriched pathways and displayed the results in the form of a network graph (Figure 5—figure supplement 2) that can be further investigated in future studies.
Discussion
We systematically analyzed transcriptomic features of PBMCs from COVID-19 patients with mild and severe symptoms at three sequential time points (D0, D7, and D28) during the peak of the symptoms. Our longitudinal analysis revealed key temporal features of immune responses that distinguished mild from severe patients during acute disease. We observed a prominent role of NK cell-mediated cytotoxicity function pathways during COVID-19 progression. These pathways include genes such as
In our NK cell gene hub, we recognized activating (
Other genes located in the NK cell hub included membrane proteins such as
Until now, we have discussed relevant genes involved in immune pathways enriched in mild or severe COVID-19 progression. However, we also decided to look for genes exhibiting coordinated gene expression patterns across all our samples. We found that one module (blue module), which has a strong positive correlation with mild patients, included genes involved with metabolic pathways regulating T cell activation, kinase activity, and antigen presentation. In contrast, the turquoise module, which exhibited a strong positive correlation with severely ill individuals, contained genes associated with neutrophil-related biological processes. This finding was indicative of an opposed early fate of innate immune responses between mild and severe COVID-19 cases. Neutrophil long-term differential enrichment seen across severe cases could be related to other repercussions of SARS-CoV-2 infection, like neutrophil-induced platelet aggregation (Jevtic and Nazy, 2022). Consistent with this interpretation, dysfunction of platelets has been associated with abnormal clot formation in severe COVID-19 cases (Litvinov et al., 2021). In this sense, our results show that the brown module, which is negatively correlated with severe patients, displays biological processes linked to platelet degranulation activity and negative regulation of aggregation. Hence, these results are consistent with a platelet dysfunction pathology linked to severely ill patients.
We performed a pathway enrichment analysis to understand how the positive correlation of genes in the blue co-expression module related to immune response functions in mildly ill patients. Not surprisingly, the main pathways enriched included Th1/Th2 cell differentiation, cytokine–cytokine receptor interaction, antigen processing, and NK cell-mediated cytotoxicity pathways. Albeit the co-expression analysis included all samples, regardless of the severity of the disease or the longitudinal sampling. This effort revealed transcriptional programs of immune response that were consistent with the profiles detected in mild and severe patients reported in our previous investigations. This evidence supports the idea that the transcriptional regulation of cell-mediated immunity in mildly ill patients is more robust than that observed in patients with severe clinical progression.
In order to identify differences in transcriptional programs associated with mild or severe outcomes, we carefully compared changes in gene expression during the acute phase. This analysis detected a broader list of genes than those found using the longitudinal analysis alone. This accomplishment was resulted from only considering the differences in gene expression between mild and severe groups, independently of quantitative changes in gene expression overtime. These results consistently showed a NK cell hub of genes being differentially expressed in mild patients. Importantly, we found novel DEGs including
Complementary to this scenario of NK cell activating receptors being important in mitigating symptom severity as previously reported (Gardiner, 2008), we found that the
Another NK cell activating receptor (the NKG2C protein) encoded by the
Another important observation detected in our comparative analysis included enrichment of the IL-17 pathway in severe patients on D7. Our analysis identified a group of 10 genes (
Even though the primary criterion for comparing mild with severe was the peak of symptoms in earlier analyses, we also investigate according to the time of the symptoms onset. As a result, we also performed a complement analysis comparing mild-D7 to severe-D0 to identify the enriched pathways that represent this comparison. Notably, the most important GO terms and enriched pathways in mild patients correspond to NK cell cytotoxicity, represented by the same set of genes reported in previous comparisons. This lends significant support to the primary findings of our study, which show that a transcriptional program is differentially regulated in mild patients and is focused on innate immune response associated with NK cell cytotoxicity.
In conclusion, the longitudinal trajectories of gene expression, the differential GO/pathways, and PPI analyses, together with the co-expressed gene–gene correlation network is consistent with the existence of a regulatory transcriptional program linked to an early activation of NK cell cytotoxicity in mild COVID-19 patients. This work establishes the notion that innate immune responses are crucial for the progression of COVID-19 severity, and reinforces the importance of the NK cell cytotoxicity pathway in distinguishing between mild and severe COVID-19 progression. Taken together, these differential responses are complemented by cytokine activities and Th1/Th2 cell differentiation programs indicating a well-regulated crosstalk between innate and adaptive immune responses in the mild COVID-19 progression.
Materials and methods
Patient cohort and PBMCs sampling
We recruited a total of eight patients diagnosed to be suffering from COVID-19 who were separated into two groups, one composed of four mild outpatients and another, conformed of four severe hospitalized individuals. Peripheral venous blood samples were obtained by using the venipuncture technique in Vacutainer K2 ethylenediamine-tetraacetic acid (EDTA) tubes (BD, USA) from each patient three times, including two clinical stages (acute phase and convalescence). PBMC was isolated from each fresh heparinized peripheral blood sample, through density gradient centrifugation on Ficoll-Paque Plus (GE Healthcare Life Sciences, USA) by centrifuging at 1600 rpm for 30 min (using minimum acceleration and no deceleration configurations). PBMC-containing fraction was then washed two times with 2 mM EDTA in phosphate-buffered saline (PBS) and stored in RNAlater solution (Sigma, USA) at −20°C until RNA extraction. The detailed clinical features of all patients and the detailed sampling time are shown in Figure 1 and Table 1. All samples were processed in a qualified BSL-2 laboratory and, according to protocols and approval from Institutional Review Boards, CEC-SSLR Ord N°226 and Ord N°399. Written informed consent was received before the participation of each patient.
RNA extraction, library preparation, and PBMC transcriptome sequencing
Total RNA extraction was performed from PBMC by Diagenode (Belgium). RNA samples were quantified using QubitTM RNA BR Assay Kit (Thermo Fisher Scientific, USA) and secondly checked for integrity using HS RNA Kit (Agilent, USA) on a Fragment analyzer system (Agilent, USA). The library preparation was performed using NEBnext ultraII Directional Kit and sequencing of the samples was performed on an Illumina NovaSeq 6000 instrument producing 150 bp paired-end reads running Control Software 1.7.0.
Identification of DEGs along COVID-19 progression and between disease severities
Sequencing-quality check was performed using FastQC (Andrews, 2010), and low-quality reads were trimmed using Trim_Galore! (Krueger, 2019) with
Gene set enrichment analysis
In order to perform gene set enrichment analysis (GSEA) (Schultze and Aschenbrenner, 2021), after DEGs analysis only values of log2 fold change equal to or greater than 1 were considered. Then the GSEA was performed through ClusterProfiler package (v.3.16) (Yu et al., 2012) in R, we evaluated the significantly enriched biological process (GO) using gseGO function and pathways from KEGG using gseKEGG function.
Enrichment analysis of DEGs along COVID-19 progression and between mild and severe patients
GO and pathways enrichment analyses were performed with both upregulated and downregulated genes using Enrichr platform (Xiong et al., 2020). Significant GO terms (Biological processes and Molecular functions) and pathways (KEGG and Reactome) were calculated from the adjusted p-value (
Weighted correlation network analysis
Based on the assumption that DEGs may explain transcriptional differences observed between mild and severe COVID-19 patients. Read counts from DEGs among all samples were selected as a reference set for construction of a weighted gene co-expression networks and modules detection. Co-expressed gene modules were constructed using WGCNA R package v1.71 (Langfelder and Horvath, 2008) under a signed networks approach because it provides a better understanding of molecular regulatory mechanisms at the systemic level, facilitating better separation of modules in terms of biological performances. To do so, we removed outliers using the adjacency function and a standardized connectivity score of <−2.0; then we used the pickSoftThreshold function to identify the soft thresholding power
After the modules were identified, the module eigengene (ME) was summarized by the first principal component of the module expression levels. Module–trait relationships were estimated using Pearson correlation between MEs and disease severity. To evaluate the correlation strength, we calculated the MS that is defined as the average absolute gene significance of all genes involved in a module.
PPI network
All genes with differential trajectories over time (#827 genes) were included to construct a PPI network by STRING V11.5 (Szklarczyk et al., 2021). After clusterization, we focused our analysis on the principal cluster (one of three) with the highest connectivity between genes and highlighted the genes involved in the most significant pathways according to KEGG.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2024, Medina et al. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Patients present a wide range of clinical severities in response severe acute respiratory syndrome coronavirus 2 infection, but the underlying molecular and cellular reasons why clinical outcomes vary so greatly within the population remains unknown. Here, we report that negative clinical outcomes in severely ill patients were associated with divergent RNA transcriptome profiles in peripheral immune cells compared with mild cases during the first weeks after disease onset. Protein–protein interaction analysis indicated that early-responding cytotoxic natural killer cells were associated with an effective clearance of the virus and a less severe outcome. This innate immune response was associated with the activation of select cytokine–cytokine receptor pathways and robust Th1/Th2 cell differentiation profiles. In contrast, severely ill patients exhibited a dysregulation between innate and adaptive responses affiliated with divergent Th1/Th2 profiles and negative outcomes. This knowledge forms the basis of clinical triage that may be used to preemptively detect high-risk patients before life-threatening outcomes ensue.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer