This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
Liver cancer is the fifth most common cancer and the fourth leading cause of cancer-related death worldwide [1]. HCC is the most common type of liver cancer, accounting for about 75%–85%, and has the characteristics of a high mortality and high metastasis and recurrence rate [2]. Studies have shown that genetic mutations, chromosomal aberrations, molecular signaling pathways, and epigenetic dysregulation are all associated with the development of HCC [3]. At present, in addition to traditional surgical resection, radiofrequency ablation, transarterial chemotherapy, and other methods have been developed for the treatment of liver cancer [4]. Undeniably, surgical resection is still the most effective treatment for HCC. However, due to the insidious onset of HCC, many patients have already lost the opportunity for surgery when they come to the clinic. Even with surgical resection, the 5-year recurrence rate is as high as 70% [5], and the 5-year overall survival (OS) rate is only 15%–19% [6].
It is well known that the tumor microenvironment plays a crucial role in the occurrence and development of tumors. The interaction between various signaling molecules in the microenvironment is also a hot topic in tumor-related research. The occurrence of HCC is closely related to the inflammatory response of the environment, and 90% of HCC is associated with inflammation [7]. In the state of liver inflammation, the dysregulation of the interaction between cytokines, chemokines, and growth factors is an important cause of liver cancer [8, 9]. The original research on the IL-6, IL-1, and TGF-beta inflammatory molecules based on recent years for immune checkpoint research to further explore the development mechanism of HCC provided new insights. Studies have found that immune checkpoint molecules, such as programmed death-1(PD-1), cytotoxic T-lymphocyte antigen 4 (CTLA4), lymphocyte activating gene 3 protein (LAG-3), and mucin domain molecule 3 (TIM-3), are upregulated on liver cancer cells and tumor-specific T cells, which can lead to CD8+ T cell apoptosis and poor prognosis in patients [10]. At the same time, we cannot ignore that there are a large number of proangiogenic factors produced by cancer cells or tumor-infiltrating lymphocytes or macrophages in the tumor microenvironment, such as vascular endothelial growth factor (VEGF), which can promote tumor angiogenesis [11]. Angiogenesis is indispensable for tumor development, invasion, and metastasis [12]. On this basis, targeted drugs such as sorafenib, lenvatinib, VEGF inhibitors, and immune checkpoint inhibitors (ICIs) have significantly improved the prognosis of patients in recent years, but the overall treatment effect is still poor due to the changes in HCC heterogeneity and the continuous emergence of phenotypic drug resistance [13–19]. Therefore, it is particularly urgent to find a way to evaluate the disease early and take personalized treatment measures to improve the prognosis of patients.
The development of liver cancer is a multistep process caused by changes in signaling pathways triggered by multiple genes, and it shows high heterogeneity within tumors, between tumors, and between patients [20–24]. DEGs play an important role in this process. Therefore, considering the high heterogeneity of HCC, the limited treatment methods, and the poor prognosis of patients, it is more urgent to further explore the development mechanism of HCC and new survival and prognostic models. Nault et al. first identified a genetic marker associated with the development of HCC in 2013 [25]. Subsequent studies have also shown that gene mutations occurring in HCC can be used as biological markers for targeted therapies [26]. Although it has been demonstrated that programmed death ligand-1 (PD-L1) inhibitors combined with antiangiogenesis therapy can significantly improve the prognosis of patients with HCC [27], intervention-related toxicity and difficulty in determining the optimal dosing phase have hindered further benefit for patients [28, 29]. Therefore, the search for HCC-related dysfunctional genes is particularly important to elucidate the mechanisms underlying the development of the disease and to improve the prognosis of patients. Thanks to the rapid development of sequencing technology, many disease-related marker genes have been identified one after another, which has laid a solid foundation for the screening of HCC-related genes and the establishment of prognostic models. Public databases such as The Cancer Genome Atlas (TCGA) are useful tools to screen microarray data for DEGs associated with the development of HCC [30, 31].
In this study, we used a random forest algorithm to identify key genes expressed in HCC in the TCGA database and screened DEGs between HCC and normal samples. On this basis, 7 important DEGs were finally screened. Subsequently, we performed enrichment analysis on these 7 important DEGs and analyzed the expression levels of these genes in different clinical states. Furthermore, we performed survival analysis and COX regression analysis, constructed a prognostic risk score model, and plotted the receiver operating characteristic (ROC) curve. Finally, 20 coexpressed genes were screened by GeneMANIA, and GO and KEGG enrichment analyses were performed to further explore their biological functions and molecular pathways. The DEGs of HCC discovered in this study, as well as the constructed survival prognosis prediction model, are expected to provide new insights into the clinical treatment and biological mechanisms of HCC.
2. Materials and Methods
2.1. Data Source
The data for our study were extracted from The Cancer Genome Atlas (TCGA; https://portal.gdc.cancer.gov, up to July 31, 2022) database, which contains transcriptomic data of 374 HCC tumor samples and 50 normal samples, as well as 370 clinical samples and related data. We used data obtained from the TCGA database using Illumina HiSeq Systems, and the sequencing data format was a Counts file.
2.2. Random Forest Screening for Important Genes
Build a random forest model using the random forest package [32]. First, calculate the average model false positive rate based on out-of-band data for all genes and select 400 as the optimal number of trees to include in the random forest. Next, build a random forest model and use the Gini coefficient method to obtain dimension importance values for the random forest model. The genes with the top 30 importance values were selected for subsequent analysis.
2.3. Identification of DEGs
Expression data downloaded from TCGA were analyzed using the Limma package of R version 4.2.0 [33], and fold differential expression was calculated after removing or averaging probe sets without corresponding gene symbols or genes with multiple probe sets, respectively. The criteria for setting the DEG were as follows: genes with adjusted
2.4. Expression of DEGs in Different Clinical States
We further investigated the expression of DEGs and their association with different clinical states of HCC: event, age, gender, and stage. Violin plots were drawn using the ggplot2 package of the R software. Differences in gene expression among different groups were analyzed using SPSS 27.0. Definitions:
2.5. Construction and Validation of an HCC Prognostic Risk Scoring Model
Kaplan‒Meier survival analysis was performed on the important DEGs using the R software survival and survminer packages, and the related survival curves were drawn.
Based on univariate and multivariate Cox regression analyses, a prognostic risk score model was constructed. According to the risk score grouping, prognostic analysis and Cox regression analysis were performed to verify whether the risk score could be used as an independent risk factor for evaluating the survival and prognosis of HCC patients. The specificity and sensitivity of the risk scoring model were verified using the R software pROC package, and the ROC was drawn using the ggplot2 package.
2.6. Enrichment Analyses of Important DEGs
Analysis of Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment plays a very important role in the annotation of gene products and the study of molecular mechanisms [34, 35]. We used enrichGO and enrichKEGG packages in the R language for enrichment, and adjusted
2.7. Analysis of the Relationship between DEGs Genes
We constructed a coexpression network of these genes using GeneMANIA (https://www.genemania.org/) [36] and identified associations within them. Subsequently, we further carried out enrichment analysis on the coexpressed genes of important DEGs, intending to explore their biological functions and molecular mechanisms.
3. Results
3.1. Identification of Important DEGs
The research flowchart of this study is shown in Figure 1. First, we downloaded the expression profiling data of LIHC from the TCGA database along with clinical data. To find the genes with the greatest influence on the phenotype, we used the random forest method to screen. The relationship between the error of the reference model and the number of decision trees is shown in Figure 2(a). We selected 400 trees as the parameters of the final model, and the model error was stable at this time. We evaluated the final results using the Gini coefficient method and selected the top 30 genes as candidate genes (Table 1 and Figure 2(b)).
[figure(s) omitted; refer to PDF]
Table 1
30 important genes screened by random forest.
Ensemble | Good | Poor | Mean decrease accuracy | Mean decrease Gini |
ENSG00000261461.1 | 3.5035258 | 3.4857582 | 3.7528651 | 0.508084 |
ENSG00000171819.5 | 3.5092202 | 2.720329 | 3.4323096 | 0.5052108 |
ENSG00000276984.1 | 2.2262274 | 3.0303442 | 3.0226219 | 0.4598939 |
ENSG00000280160.1 | 0.2249288 | 2.1851122 | 1.5534425 | 0.4370629 |
ENSG00000272732.1 | 2.7193541 | 2.1123561 | 2.9543775 | 0.428348 |
ENSG00000205955.4 | 1.6407665 | 1.4453403 | 1.9828496 | 0.421363 |
ENSG00000269930.1 | 1.678516 | 3.1569168 | 2.6375549 | 0.4010261 |
ENSG00000265688.2 | 0.1760152 | 1.291468 | 1.4563289 | 0.3938647 |
ENSG00000184811.4 | 3.1610986 | 3.5654376 | 3.5533163 | 0.3897576 |
ENSG00000135912.11 | 2.5015435 | 0.0774343 | 1.4520492 | 0.3676444 |
ENSG00000223656.1 | 1.872504 | 1.3207259 | 1.7311298 | 0.3409327 |
ENSG00000095383.20 | 2.9219665 | 2.4899488 | 2.8241699 | 0.3226696 |
ENSG00000100577.19 | −1.168253 | 1.3519506 | 0.5418066 | 0.3008067 |
ENSG00000147588.7 | 1.8170996 | 2.4736802 | 2.6879344 | 0.2993155 |
ENSG00000128165.9 | 3.1080429 | 3.1663719 | 3.3007222 | 0.2966643 |
ENSG00000169258.7 | 1.5831498 | 2.5090588 | 2.4754708 | 0.2964592 |
ENSG00000101057.16 | 2.2969796 | 2.7405219 | 2.6271281 | 0.2817687 |
ENSG00000215386.13 | 1.3803304 | 0.0963518 | 0.6424683 | 0.2776084 |
ENSG00000231982.1 | 2.3729127 | 2.6866166 | 2.5394458 | 0.2699614 |
ENSG00000267586.7 | 2.1017253 | −0.0713384 | 1.2067781 | 0.2620233 |
ENSG00000168490.14 | 0.3380425 | 1.1292257 | 1.0463427 | 0.2565421 |
ENSG00000142871.18 | −0.4686499 | 0.9268551 | 0.3333044 | 0.2423563 |
ENSG00000275494.1 | 0.2773768 | 1.0829379 | 0.7525331 | 0.2374018 |
ENSG00000227959.1 | 1.5318984 | 1.7262925 | 1.6557142 | 0.2360688 |
ENSG00000213694.6 | 2.2435296 | 2.2237149 | 2.2319256 | 0.2287172 |
ENSG00000275265.1 | 0.3430476 | 0.4610777 | 0.6555662 | 0.2262146 |
ENSG00000253105.6 | 2.2140516 | 2.2472489 | 2.5262515 | 0.2159858 |
ENSG00000176485.12 | 0.3947449 | 1.7110925 | 1.6244757 | 0.2140441 |
ENSG00000245067.7 | 2.3810183 | 0.1541471 | 1.4993123 | 0.2139093 |
ENSG00000245322.7 | 2.4175229 | −0.7897455 | 0.6036063 | 0.2138007 |
Next, we screened 1,564 DEGs using the Limma method and plotted the volcano (Figure 2(c)). After interacting with 30 candidate genes, 7 important DEGs were finally screened (Table 2). Analysis of the expression profiles of these seven DEGs revealed that MAFG-DT, GPRIN1, and MYBL2 were highly expressed in the tumor group, and LINC00907, GSTZ1, CCN1, and GSTM5 were lowly expressed in the tumor group (Figure 2(d)).
Table 2
7 important DEGs related to HCC.
Gene symbol | LogFC | AveExpr | t | Adj. | B | Change | |
MAFG-DT | 2.295816916 | 0.180864209 | 8.733112195 | 5.68E − 17 | 9.15E − 16 | 27.82424792 | Up |
GPRIN1 | 2.444281099 | 0.984424466 | 8.88620807 | 1.79E − 17 | 3.09E − 16 | 28.94197713 | Up |
MYBL2 | 3.86104155 | 2.858320008 | 8.757496695 | 4.73E − 17 | 7.71E − 16 | 27.95856037 | Up |
LINC00907 | −2.60570328 | −3.393211756 | −9.088003866 | 3.82E − 18 | 7.25E − 17 | 30.46891163 | Down |
GSTZ1 | −2.341972944 | 4.83864018 | −13.29705978 | 5.36E − 34 | 5.72E − 32 | 66.16025292 | Down |
CCN1 | −2.059250199 | 5.163337155 | −11.41610643 | 1.60E − 26 | 7.98E − 25 | 49.05138576 | Down |
GSTM5 | −2.895398904 | −1.505519136 | −9.292124572 | 7.84E − 19 | 1.63E − 17 | 31.95202187 | Down |
3.2. Expression and Clinical Parameters of Important DEGs in Patients
To further analyze the relationship between these seven important DEGs and clinical status, we used SPSS to compare the expression differences of each gene under different groups, and used the ggplot2 package of R software to draw a violin plot (Figures 3–6).
[figure(s) omitted; refer to PDF]
According to the outcome of patience, we divided the patients into the normal group, live group, and dead group. The results showed that compared with the normal group, the live and death groups showed significant differences in each gene (
Stratified analysis by age revealed that GSTZ1 expression was significantly lower in the 41–60 years group than that in the 61–80 years group (
By analyzing the expression levels of each gene in patients of different genders, we found that the expression level of GSTZ1 (
The expression level of MAFG-DT was significantly lower in stage I than in stages II (
3.3. Impact of Important DEGs on Patients’ OS
We performed survival analysis using the survival and survminer packages of R software and used survival curves combined with log-rank tests to assess the impact of important DEGs on patient OS. As shown in Figure 7, high expression of GPRIN1 and MYBL2 indicated better prognosis of patients (
[figure(s) omitted; refer to PDF]
3.4. Construction and Validation of a Prognostic Risk Model for HCC Patients
First, the selected seven important DEGs were combined with survival time and survival status and then included in a multivariate Cox regression analysis (Table 3 and Figure 8(a)), and a prognostic risk score model was constructed based on the results. The risk score calculation method is 7 important as the product of DEGs expression level and risk coefficient. The specific risk score model is risk score = “MAFG-DT”
Table 3
Multivariate Cox regression analysis of important DEGs.
Important genes | Coeff | HR | 95% CI | ||
Lower | Upper | ||||
MAFG-DT | 0.069645 | 1.0721 | 0.8671 | 1.326 | 0.52014 |
GSTZ1 | −0.070909 | 0.9315 | 0.4820 | 1.800 | 0.83293 |
GPRIN1 | 0.009885 | 1.0099 | 0.7659 | 1.332 | 0.94415 |
MYBL2 | 0.461418 | 1.5863 | 1.1213 | 2.244 | 0.00915 |
LINC00907 | 0.010721 | 1.0108 | 0.8732 | 1.170 | 0.88581 |
CCN1 | 0.227363 | 1.2553 | 0.6951 | 2.267 | 0.45090 |
GSTM5 | −0.116514 | 0.8900 | 0.7918 | 1.000 | 0.05078 |
[figure(s) omitted; refer to PDF]
We divided 346 HCC samples into a high-risk group and a low-risk group, with 173 cases in each group, using the median risk value (2.4211) as the cutoff value. Survival analysis showed that there was a significant difference between the two groups (
The results of univariate Cox regression analysis showed that the risk score model was significantly correlated with survival time and survival status (hazard ratio = 0.5066, 95% Confidence interval = 0.353–0.727,
Table 4
Univariate and multivariate Cox regression analyses on the survival of HCC patients.
Variable | Univariate analysis | Multivariate analysis | ||
HR (95% CI) | HR (95% CI) | |||
Age | 1.009 (0.9946–1.023) | 0.23 | 1.0078 (0.9928–1.0229) | 0.31007 |
Gender | 0.7857 (0.5469–1.129) | 0.192 | 0.9014 (0.6043–1.3447) | 0.61101 |
Stage | ||||
Stage II | 1.452 (0.8705–2.423) | 0.15300 | 1.3944 (0.8344–2.3303) | 0.20446 |
Stage III | 3.062 (1.9841–4.726) | 4.31e − 07 | 2.8765 (1.8589–4.4513) | 2.11e − 06 |
Stage IV | 6.415 (1.9691–20.897) | 0.00204 | 5.5058 (1.6529–18.3403) | 0.00546 |
Risk group | 0.5066 (0.353–0.727) | 0.000224 | 0.5549 (0.3767–0.8173) | 0.00287 |
HR, hazard ratio;
Finally, using ROC to evaluate the relationship between clinical characteristics and the prognosis of HCC patients, the results showed that the risk scoring model (AUC = 0.582) was the second most important factor for prognosis after stage (AUC = 0.659) (Figure 9).
[figure(s) omitted; refer to PDF]
3.5. Enrichment Analysis and Interaction Analysis
To further explore the mechanism of action and signaling pathway of important DEGs, we performed GO and KEGG enrichment analyses on them. GO enrichment analysis found that important DEGs were mainly enriched in the glutathione metabolic process (
Table 5
GO enrichment results of important DEGs.
Category | ID | Description | Bg ratio | qvalue | geneID | Count | ||
BP | GO:0006749 | Glutathione metabolic process | 65/18800 | 0.00011692 | 0.016368738 | 0.005907515 | GSTZ1/GSTM5 | 2 |
MF | GO:0004364 | Glutathione transferase activity | 26/18410 | 1.91E − 05 | 0.000363455 | 8.05E − 05 | GSTZ1/GSTM5 | 2 |
MF | GO:0016765 | Transferase activity, transferring alkyl or aryl (other than methyl) groups | 59/18410 | 0.000100347 | 0.000953296 | 0.000211257 | GSTZ1/GSTM5 | 2 |
MF | GO:0004602 | Glutathione peroxidase activity | 22/18410 | 0.005961396 | 0.035906466 | 0.007957112 | GSTZ1 | 1 |
MF | GO:0005520 | Insulin-like growth factor binding | 29/18410 | 0.00785223 | 0.035906466 | 0.007957112 | CCN1 | 1 |
MF | GO:0016859 | Cis-trans isomerase activity | 41/18410 | 0.011086965 | 0.035906466 | 0.007957112 | GSTZ1 | 1 |
MF | GO:0004601 | Peroxidase activity | 52/18410 | 0.014044721 | 0.035906466 | 0.007957112 | GSTZ1 | 1 |
MF | GO:0050840 | Extracellular matrix binding | 55/18410 | 0.014850152 | 0.035906466 | 0.007957112 | CCN1 | 1 |
MF | GO:0016684 | Oxidoreductase activity, acting on peroxide as acceptor | 56/18410 | 0.015118512 | 0.035906466 | 0.007957112 | GSTZ1 | 1 |
MF | GO:0016209 | Antioxidant activity | 85/18410 | 0.022875552 | 0.044982892 | 0.009968508 | GSTZ1 | 1 |
MF | GO:0051219 | Phosphoprotein binding | 88/18410 | 0.023675206 | 0.044982892 | 0.009968508 | GPRIN1 | 1 |
[figure(s) omitted; refer to PDF]
Table 6
KEGG enrichment results of important DEGs.
ID | Description | Bg ratio | q value | geneID | Count | ||
hsa00350 | Tyrosine metabolism | 36/8159 | 0.013180205 | 0.049939588 | 0.021903328 | 2954 | 1 |
hsa00480 | Glutathione metabolism | 57/8159 | 0.020814906 | 0.049939588 | 0.021903328 | 2949 | 1 |
hsa05204 | Chemical carcinogenesis–DNA adducts | 69/8159 | 0.02515986 | 0.049939588 | 0.021903328 | 2949 | 1 |
hsa00982 | Drug metabolism–cytochrome P450 | 72/8159 | 0.026244087 | 0.049939588 | 0.021903328 | 2949 | 1 |
hsa01524 | Platinum drug resistance | 73/8159 | 0.026605317 | 0.049939588 | 0.021903328 | 2949 | 1 |
hsa00980 | Metabolism of xenobiotics by cytochrome P450 | 78/8159 | 0.028410127 | 0.049939588 | 0.021903328 | 2949 | 1 |
hsa00983 | Drug metabolism–other enzymes | 80/8159 | 0.029131427 | 0.049939588 | 0.021903328 | 2949 | 1 |
DEGs, differentially expressed genes.
To understand the interaction network of these genes, we used the GeneMANIA database for analysis. The results showed that these genes were in a complex PPI network, with physical Interactions of 77.64%, coexpression of 8.01%, predicted of 5.37%, colocalization of 3.63%, genetic Interactions of 2.87%, pathway of 1.88% and shared protein domains of 0.60% (Figure 10(c)). GO enrichment analysis of these 25 coexpressed genes found that, for biological processes, they were mainly enriched in the glutathione metabolic process (
Table 7
GO enrichment results of interacting genes.
Category | ID | Description | Bg ratio | qvalue | geneID | Count | ||
BP | GO:0006749 | Glutathione metabolic process | 65/18800 | 2.27E − 10 | 1.34E − 07 | 1.06E − 07 | GSTZ1/GSTM5/GSS/GSTM4/GSTM2/GSTM1 | 6 |
BP | GO:0042537 | Benzene-containing compound metabolic process | 27/18800 | 4.18E − 08 | 1.23E − 05 | 9.79E − 06 | FAH/GSTM4/GSTM2/GSTM1 | 4 |
BP | GO:0006575 | Cellular-modified amino acid metabolic process | 188/18800 | 1.40E − 07 | 2.74E − 05 | 2.18E − 05 | GSTZ1/GSTM5/GSS/GSTM4/GSTM2/GSTM1 | 6 |
BP | GO:0006570 | Tyrosine metabolic process | 15/18800 | 9.35E − 07 | 0.000135472 | 0.000107739 | GSTZ1/FAH/HGD | 3 |
BP | GO:0009074 | Aromatic amino acid family catabolic process | 16/18800 | 1.15E − 06 | 0.000135472 | 0.000107739 | GSTZ1/FAH/HGD | 3 |
BP | GO:0006790 | Sulfur compound metabolic process | 339/18800 | 4.36E − 06 | 0.000394304 | 0.000313583 | GSTZ1/GSTM5/GSS/GSTM4/GSTM2/GSTM1 | 6 |
BP | GO:0042178 | Xenobiotic catabolic process | 25/18800 | 4.69E − 06 | 0.000394304 | 0.000313583 | GSTM4/GSTM2/GSTM1 | 3 |
BP | GO:0042759 | Long-chain fatty acid biosynthetic process | 29/18800 | 7.42E − 06 | 0.000546204 | 0.000434386 | GSTM4/GSTM2/GSTM1 | 3 |
BP | GO:0009072 | Aromatic amino acid family metabolic process | 31/18800 | 9.11E − 06 | 0.000596212 | 0.000474157 | GSTZ1/FAH/HGD | 3 |
BP | GO:1901606 | Alpha-amino acid catabolic process | 94/18800 | 0.000257064 | 0.015141082 | 0.012041429 | GSTZ1/FAH/HGD | 3 |
BP | GO:1900221 | Regulation of amyloid-beta clearance | 20/18800 | 0.00031786 | 0.017019984 | 0.013535686 | CLU/HDAC1 | 2 |
BP | GO:0006805 | Xenobiotic metabolic process | 108/18800 | 0.000386746 | 0.018101332 | 0.014395662 | GSTM4/GSTM2/GSTM1 | 3 |
BP | GO:0001676 | Long-chain fatty acid metabolic process | 111/18800 | 0.000419095 | 0.018101332 | 0.014395662 | GSTM4/GSTM2/GSTM1 | 3 |
BP | GO:0009063 | Cellular amino acid catabolic process | 112/18800 | 0.000430252 | 0.018101332 | 0.014395662 | GSTZ1/FAH/HGD | 3 |
BP | GO:1990748 | Cellular detoxification | 115/18800 | 0.000464867 | 0.018253777 | 0.014516899 | GSTZ1/GSTM2/GSTM1 | 3 |
BP | GO:0006520 | Cellular amino acid metabolic process | 285/18800 | 0.000508838 | 0.018731601 | 0.014896904 | GSTZ1/FAH/HGD/GSS | 4 |
BP | GO:0097237 | Cellular response to toxic substance | 123/18800 | 0.000565787 | 0.019602849 | 0.015589791 | GSTZ1/GSTM2/GSTM1 | 3 |
BP | GO:0098754 | Detoxification | 150/18800 | 0.001006626 | 0.032939028 | 0.026195814 | GSTZ1/GSTM2/GSTM1 | 3 |
BP | GO:0097242 | Amyloid-beta clearance | 39/18800 | 0.001220616 | 0.037009268 | 0.029432802 | CLU/HDAC1 | 2 |
BP | GO:0006633 | Fatty acid biosynthetic process | 162/18800 | 0.001256681 | 0.037009268 | 0.029432802 | GSTM4/GSTM2/GSTM1 | 3 |
BP | GO:0071466 | Cellular response to xenobiotic stimulus | 168/18800 | 0.001395135 | 0.039130226 | 0.031119561 | GSTM4/GSTM2/GSTM1 | 3 |
CC | GO:0045171 | Intercellular bridge | 75/19594 | 2.36E − 06 | 0.00014373 | 0.000109131 | GSTM5/GSTM4/GSTM2/GSTM1 | 4 |
CC | GO:0005667 | Transcription regulator complex | 483/19594 | 2.59E − 05 | 0.000788585 | 0.000598753 | LIN9/TFDP2/E2F4/LIN37/RBL1/HDAC1 | 6 |
CC | GO:0017053 | Transcription repressor complex | 76/19594 | 0.000121293 | 0.002466282 | 0.001872587 | LIN9/LIN37/HDAC1 | 3 |
MF | GO:0004364 | Glutathione transferase activity | 26/18410 | 1.56E − 10 | 1.25E − 08 | 7.22E − 09 | GSTZ1/GSTM5/GSTM4/GSTM2/GSTM1 | 5 |
MF | GO:0043295 | Glutathione binding | 10/18410 | 4.64E − 10 | 1.86E − 08 | 1.07E − 08 | GSS/GSTM4/GSTM2/GSTM1 | 4 |
MF | GO:1900750 | Oligopeptide binding | 11/18410 | 7.28E − 10 | 1.94E − 08 | 1.12E − 08 | GSS/GSTM4/GSTM2/GSTM1 | 4 |
MF | GO:0016765 | Transferase activity, transferring alkyl or aryl (other than methyl) groups | 59/18410 | 1.15E − 08 | 2.31E − 07 | 1.34E − 07 | GSTZ1/GSTM5/GSTM4/GSTM2/GSTM1 | 5 |
MF | GO:0072341 | Modified amino acid binding | 93/18410 | 6.00E − 06 | 9.60E − 05 | 5.56E − 05 | GSS/GSTM4/GSTM2/GSTM1 | 4 |
MF | GO:1901681 | Sulfur compound binding | 267/18410 | 2.10E − 05 | 0.000279468 | 0.000161797 | CCN1/GSS/GSTM4/GSTM2/GSTM1 | 5 |
MF | GO:0042277 | Peptide binding | 322/18410 | 5.13E − 05 | 0.000586451 | 0.000339524 | GSS/GSTM4/GSTM2/CLU/GSTM1 | 5 |
MF | GO:1990841 | Promoter-specific chromatin binding | 62/18410 | 7.00E − 05 | 0.000699951 | 0.000405235 | E2F4/RBL1/HDAC1 | 3 |
MF | GO:0033218 | Amide binding | 402/18410 | 0.000146108 | 0.001298736 | 0.0007519 | GSS/GSTM4/GSTM2/CLU/GSTM1 | 5 |
MF | GO:0004602 | Glutathione peroxidase activity | 22/18410 | 0.000370295 | 0.002962359 | 0.00171505 | GSTZ1/GSTM2 | 2 |
MF | GO:0005178 | Integrin binding | 156/18410 | 0.001059937 | 0.007708636 | 0.004462895 | CCN1/CIB2/ITGB5 | 3 |
MF | GO:0004601 | Peroxidase activity | 52/18410 | 0.002075466 | 0.01383644 | 0.008010571 | GSTZ1/GSTM2 | 2 |
MF | GO:0016684 | Oxidoreductase activity, acting on peroxide as acceptor | 56/18410 | 0.002402771 | 0.014786285 | 0.008560481 | GSTZ1/GSTM2 | 2 |
MF | GO:0016209 | Antioxidant activity | 85/18410 | 0.005443299 | 0.031104563 | 0.018007905 | GSTZ1/GSTM2 | 2 |
BP, biological process; CC, cell component; MF, molecular function.
Table 8
KEGG enrichment results of interacting genes.
ID | Description | Bg ratio | q value | geneID | Count | ||
hsa04218 | Cellular senescence | 156/8164 | 3.38E − 08 | 1.35E − 06 | 8.89E − 07 | 4605/132660/286826/2305/1874/55957/5933 | 7 |
hsa00480 | Glutathione metabolism | 57/8164 | 1.50E − 07 | 2.99E − 06 | 1.97E − 06 | 2949/2937/2948/2946/2944 | 5 |
hsa05204 | Chemical carcinogenesis–DNA adducts | 69/8164 | 1.65E − 05 | 0.000164828 | 0.00010844 | 2949/2948/2946/2944 | 4 |
hsa00982 | Drug metabolism–cytochrome P450 | 72/8164 | 1.95E − 05 | 0.000164828 | 0.00010844 | 2949/2948/2946/2944 | 4 |
hsa01524 | Platinum drug resistance | 73/8164 | 2.06E − 05 | 0.000164828 | 0.00010844 | 2949/2948/2946/2944 | 4 |
hsa00980 | Metabolism of xenobiotics by cytochrome P450 | 78/8164 | 2.68E − 05 | 0.00016932 | 0.000111395 | 2949/2948/2946/2944 | 4 |
hsa00983 | Drug metabolism–other enzymes | 80/8164 | 2.96E − 05 | 0.00016932 | 0.000111395 | 2949/2948/2946/2944 | 4 |
hsa00350 | Tyrosine metabolism | 36/8164 | 7.27E − 05 | 0.000363501 | 0.000239145 | 2954/2184/3081 | 3 |
hsa04110 | Cell cycle | 126/8164 | 0.000175201 | 0.000778672 | 0.000512284 | 7029/1874/5933/3065 | 4 |
hsa05418 | Fluid shear stress and atherosclerosis | 139/8164 | 0.000255701 | 0.001022805 | 0.000672898 | 2949/2948/2946/2944 | 4 |
hsa05225 | Hepatocellular carcinoma | 168/8164 | 0.000526692 | 0.001915242 | 0.001260028 | 2949/2948/2946/2944 | 4 |
hsa05207 | Chemical carcinogenesis–receptor activation | 212/8164 | 0.001260675 | 0.004202252 | 0.002764639 | 2949/2948/2946/2944 | 4 |
hsa05208 | Chemical carcinogenesis–reactive oxygen species | 223/8164 | 0.001520568 | 0.00467867 | 0.003078073 | 2949/2948/2946/2944 | 4 |
4. Discussion
At present, many studies have found many genes that affect HCC, but the mechanism of HCC occurrence and development is still not very clear, and there is an urgent need to further explore the factors affecting its development and prognosis. Although several previous studies have analyzed gene signatures related to HCC prognosis [30, 37–39], these studies did not further screen genes that are more closely related to patient survival after screening DEGs. Therefore, in this study, we used the random forest and limma algorithms to screen out 30 important genes and 1,564 DEGs, respectively, and then took the intersection of the two to further screen out 7 important DEGs: MAFG-DT, GSTZ1, GPRIN1, MYBL2, LINC00907, CCN1, and GSTM5. Subsequent enrichment analysis, expression profiling analysis, survival analysis, and the constructed prognostic prediction model indicated that they are closely related to the occurrence and prognosis of HCC.
Among the seven important DEGs, MAFG-DT (logFC = 2.295817), GPRIN1 (logFC = 2.444281), and MYBL2 (logFC = 3.861042) were all significantly elevated in liver cancer samples (Figure 2(d)). MAFG-DT is an oncogenic long noncoding RNA (lncRNA), and many previous studies have shown that overexpression of MAFG-DT can promote the proliferation and metastasis of cancer cells [40–43]. High expression of MAFG-DT is significantly associated with poor prognosis in bladder and breast cancer patients [44, 45]. In this study, MAFG-DT was highly expressed in liver cancer patients, and the normal group was significantly different from the liver cancer group after grouping by age, gender, and stage (Figures 3–6). In addition, after grouping according to the high and low expression of MAFG-DT, the survival time of the low expression group was higher than that of the high expression group, although there was no significant difference (
In addition, LINC00907 (logFC = −2.6057), CCN1 (logFC = −2.05925), GSTZ1 (logFC = −2.34197), and GSTM5 (logFC = −2.8954) were lowly expressed in liver cancer tissues. Through survival analysis, only GSTM5 was found to be associated with patient prognosis (
Next, we constructed a risk-scoring model based on the multivariate Cox regression analysis of 7 important DEGs. Kaplan‒Meier survival analysis showed that the high-risk group had a significantly lower survival time (
To further study the molecular signaling pathways of important DEGs, we performed enrichment analysis and coexpression analysis. As we discussed before, both GSTZ1 and GSTM5 belong to the GST family, so the screened important DEGs were mainly enriched in the glutathione metabolic process and glutathione transferase activity (
However, this study still has some limitations. Firstly, the data for model construction and validation were obtained from the TCGA database. This public database contains incomplete information, and the data are all retrospective. Therefore, prospective real-world studies are necessary to verify the reliability of the model. It should be noted that in the process of research, it is necessary to comprehensively collect data at all stages of disease progression, such as blood samples and imaging data, etc., to eliminate information distortion caused by incomplete data collection as far as possible. To improve the representativeness of the results, multicenter studies are also necessary. Secondly, the data used in this study were only from the TCGA database, which may make the results lack representative. Therefore, data from other databases, such as GEO and Oncomine databases, can be selected for subsequent analysis and cross-validation. Thirdly, the diagnostic efficacy of the risk score model constructed in this study was not superior to staging, although the results of its KM analysis were beneficial. One of the reasons for this may be due to the bias caused by the data analysis using only the TCGA database. However, compared with the stage, the risk scoring model has more important significance in the evaluation of patients in the early stage of the disease to improve the prognosis. The performance of this model will be tested in clinical cohorts in the future. Finally, the seven important DEGs screened in this study are currently only data demonstrations. We will carry out in vitro and in vivo experiments to further explore the specific molecular pathways of these genes in HCC.
5. Conclusion
In conclusion, we innovatively used a combination of random forest and Limma to screen out the important DEGs for HCC development. Expression analysis and survival analysis were performed, indicating that these genes are closely related to the survival of HCC patients. The subsequently constructed prognostic risk scoring model has good predictive value for the prognosis of HCC patients, and combining it with other clinical indicators can provide an effective reference for clinical treatment. Subsequent enrichment analysis and coexpression analysis showed that seven important DEGs were closely related to cellular senescence and glutathione metabolism, which also provided new ideas for further research on the molecular mechanism of HCC occurrence and development. In brief, the early risk score model provided in this study can be used as a reference for subsequent personalized treatment of patients and ultimately help to improve prognosis.
Authors’ Contributions
Yikai Wang and Xiongtao Liu conceived and designed the study. Yikai Wang analyzed data, created graphs, and wrote the manuscript. Le Ma, Pengjun Xue, and Bianni Qin checked the analysis results. Ting Wang, Bo Li, and Lina Wu helped collect data and references. Liyan Zhao checked the article’s format and language. Xiongtao Liu is responsible for the overall content as a guarantor.
Acknowledgments
This article was supported by the Key Research and Development Program of Shaanxi (grant number 2021SF-227).
Glossary
Abbreviations
CTLA4:Cytotoxic T-lymphocyte antigen 4
DEGs:Differentially expressed genes
GO:Gene ontology
HCC:Hepatocellular carcinoma
ICIs:Immune checkpoint inhibitors
KEGG:Kyoto Encyclopedia of Genes and Genomes
LAG-3:Lymphocyte activating gene 3 protein
OS:Overall survival
PD-1:Programmed death-1
PD-L1:Programmed death ligand-1
ROC:Receiver operating characteristic
TCGA:The Cancer Genome Atlas
TIM-3:Mucin domain molecule 3
VEGF:Vascular endothelial growth factor.
[1] S. Chidambaranathan-Reghupaty, P. B. Fisher, D. Sarkar, "Hepatocellular carcinoma (HCC): epidemiology, etiology and molecular classification," Advances in Cancer Research, vol. 149,DOI: 10.1016/bs.acr.2020.10.001, 2021.
[2] S. Rodriguez, S. Kristy, T. Mehmetoglu-Gurbuz, M. Goldfarb, S. Karri, J. Rocha, M. Shahinian, A. Yazadi, S. Poudel, R. Subramani, "Phytochemicals as an alternative or integrative option," Conjunction with Conventional Treatments for Hepatocellular Carcinoma, vol. 13, 2021.
[3] T. Couri, A. Pillai, "Goals and targets for personalized therapy for HCC," Hepatology International, vol. 13 no. 2, pp. 125-137, DOI: 10.1007/s12072-018-9919-1, 2019.
[4] K. Oura, A. Morishita, J. Tani, T. Masaki, "Tumor immune microenvironment and immunosuppressive therapy in hepatocellular carcinoma: a review," International Journal of Molecular Sciences, vol. 22 no. 11,DOI: 10.3390/ijms22115801, 2021.
[5] E. Lai, G. Astara, P. Ziranu, A. Pretta, M. Migliari, M. Dubois, C. Donisi, S. Mariani, N. Liscia, V. Impera, M. Persano, S. Tolu, F. Balconi, G. Pinna, D. Spanu, A. Pireddu, G. Saba, S. Camera, F. Musio, M. Puzzoni, V. Pusceddu, C. Madeddu, A. Casadei Gardini, M. Scartozzi, "Introducing immunotherapy for advanced hepatocellular carcinoma patients: too early or too fast?," Critical Reviews in Oncology, vol. 157,DOI: 10.1016/j.critrevonc.2020.103167, 2021.
[6] C. Allemani, H. K. Weir, H. Carreira, R. Harewood, D. Spika, X. S. Wang, F. Bannon, J. V. Ahn, C. J. Johnson, A. Bonaventure, R. Marcos-Gragera, C. Stiller, G. Azevedo e Silva, W. Q. Chen, O. J. Ogunbiyi, B. Rachet, M. J. Soeberg, H. You, T. Matsuda, M. Bielska-Lasota, H. Storm, T. C. Tucker, M. P. Coleman, "CONCORD Working Group," Lancet, vol. 385 no. 9972, pp. 977-1010, DOI: 10.1016/S0140-6736(14)62038-9, 2015.
[7] H. Nakagawa, S. Maeda, "Inflammation- and stress-related signaling pathways in hepatocarcinogenesis," World Journal of Gastroenterology, vol. 18 no. 31, pp. 4071-4081, DOI: 10.3748/wjg.v18.i31.4071, 2012.
[8] C. Berasain, J. Castillo, J. Prieto, M. A. Avila, "New molecular targets for hepatocellular carcinoma: the ErbB1 signaling system," Liver International, vol. 27 no. 2, pp. 174-185, DOI: 10.1111/j.1478-3231.2006.01424.x, 2007.
[9] A. Bishayee, Advances in Experimental Medicine & Biology, vol. 816, pp. 401-435, DOI: 10.1007/978-3-0348-0837-8_16, 2014.
[10] P. Leone, A. G. Solimando, R. Fasano, A. Argentiero, E. Malerba, A. Buonavoglia, L. G. Lupo, V. De Re, N. Silvestris, V. Racanelli, "The evolving role of immune checkpoint inhibitors in hepatocellular carcinoma treatment," Vaccines (Basel), vol. 9 no. 5,DOI: 10.3390/vaccines9050532, 2021.
[11] T. Voron, E. Marcheteau, S. Pernot, O. Colussi, E. Tartour, J. Taieb, M. Terme, Frontiers in Oncology, vol. 4,DOI: 10.3389/fonc.2014.00070, 2014.
[12] R. S. Kerbel, "Tumor angiogenesis," New England Journal of Medicine, vol. 358 no. 19, pp. 2039-2049, DOI: 10.1056/nejmra0706596, 2008.
[13] M. Kudo, R. S. Finn, S. Qin, K. H. Han, K. Ikeda, F. Piscaglia, A. Baron, J. W. Park, G. Han, J. Jassem, J. F. Blanc, A. Vogel, D. Komov, T. R. J. Evans, C. Lopez, C. Dutcus, M. Guo, K. Saito, S. Kraljevic, T. Tamai, M. Ren, A. L Cheng, "Lenvatinib versus sorafenib in first-line treatment of patients with unresectable hepatocellular carcinoma: a randomised phase 3 non-inferiority trial," The Lancet, vol. 391 no. 10126, pp. 1163-1173, DOI: 10.1016/s0140-6736(18)30207-1, 2018.
[14] J. Bruix, S. Qin, P. Merle, A. Granito, Y. H. Huang, G. Bodoky, M. Pracht, O. Yokosuka, O. Rosmorduc, V. Breder, R. Gerolami, G. Masi, P. J. Ross, T. Song, J. P. Bronowicki, I. Ollivier-Hourmand, M. Kudo, A. L. Cheng, J. M. Llovet, R. S. Finn, M. A. LeBerre, A. Baumhauer, G. Meinhardt, G. Han, "RESORCE Investigators," Lancet, vol. 389 no. 10064, pp. 56-66, DOI: 10.1016/S0140-6736(16)32453-9, 2017.
[15] G. K. Abou-Alfa, T. Meyer, A. L. Cheng, A. B. El-Khoueiry, L. Rimassa, B. Y. Ryoo, I. Cicin, P. Merle, Y. Chen, J. W. Park, J. F. Blanc, L. Bolondi, H. J. Klumpen, S. L. Chan, V. Zagonel, T. Pressiani, M. H. Ryu, A. P. Venook, C. Hessel, A. E. Borgman-Hagey, G. Schwab, R. K. Kelley, "Cabozantinib in patients with advanced and progressing hepatocellular carcinoma," New England Journal of Medicine, vol. 379 no. 1, pp. 54-63, DOI: 10.1056/nejmoa1717002, 2018.
[16] A. X. Zhu, Y. K. Kang, C. J. Yen, R. S. Finn, P. R. Galle, J. M. Llovet, E. Assenat, G. Brandi, M. Pracht, H. Y. Lim, K. M. Rau, K. Motomura, I. Ohno, P. Merle, B. Daniele, D. B. Shin, G. Gerken, C. Borg, J. B. Hiriart, T. Okusaka, M. Morimoto, Y. Hsu, P. B. Abada, M. Kudo, "REACH-2 study investigators," The Lancet Oncology, vol. 20 no. 2, pp. 282-296, DOI: 10.1016/S1470-2045(18)30937-9, 2019.
[17] R. S. Finn, S. Qin, M. Ikeda, P. R. Galle, M. Ducreux, T. Y. Kim, M. Kudo, V. Breder, P. Merle, A. O. Kaseb, D. Li, W. Verret, D. Z. Xu, S. Hernandez, J. Liu, C. Huang, S. Mulla, Y. Wang, H. Y. Lim, A. X. Zhu, A. L. Cheng, "Atezolizumab plus bevacizumab in unresectable hepatocellular carcinoma," New England Journal of Medicine, vol. 382 no. 20, pp. 1894-1905, DOI: 10.1056/nejmoa1915745, 2020.
[18] Y. Myojin, T. Kodama, K. Maesaka, D. Motooka, Y. Sato, S. Tanaka, Y. Abe, K. Ohkawa, E. Mita, Y. Hayashi, H. Hikita, R. Sakamori, T. Tatsumi, A. Taguchi, H. Eguchi, T. Takehara, "ST6GAL1 is a novel serum biomarker for lenvatinib-susceptibleFGF19-driven hepatocellular carcinoma," Clinical Cancer Research, vol. 27 no. 4, pp. 1150-1161, DOI: 10.1158/1078-0432.CCR-20-3382, 2021.
[19] M. Arechederra, S. K. Bazai, A. Abdouni, C. Sequera, T. J. Mead, S. Richelme, F. Daian, S. Audebert, R. Dono, A. Lozano, D. Gregoire, U. Hibner, D. S. Allende, S. S. Apte, F. Maina, "ADAMTSL5 is an epigenetically activated gene underlying tumorigenesis and drug resistance in hepatocellular carcinoma," Journal of Hepatology, vol. 74 no. 4, pp. 893-906, DOI: 10.1016/j.jhep.2020.11.008, 2021.
[20] M. Fantini, M. Benvenuto, L. Masuelli, G. V. Frajese, I. Tresoldi, A. Modesti, R. Bei, International Journal of Molecular Sciences, vol. 16 no. 5, pp. 9236-9282, DOI: 10.3390/ijms16059236, 2015.
[21] H. Hai, A. Tamori, N. Kawada, World Journal of Gastroenterology, vol. 20, pp. 6236-6243, DOI: 10.3748/wjg.v20.i20.6236, 2014.
[22] Y. Hoshida, S. M. Nijman, M. Kobayashi, J. A. Chan, J. P. Brunet, D. Y. Chiang, A. Villanueva, P. Newell, K. Ikeda, M. Hashimoto, G. Watanabe, S. Gabriel, S. L. Friedman, H. Kumada, J. M. Llovet, T. R. Golub, "Integrative transcriptome analysis reveals common molecular subclasses of human hepatocellular carcinoma," Cancer Research, vol. 69 no. 18, pp. 7385-7392, DOI: 10.1158/0008-5472.can-09-1089, 2009.
[23] J. C. Nault, A. Villanueva, "Intratumor molecular and phenotypic diversity in hepatocellular carcinoma," Clinical Cancer Research, vol. 21, pp. 1786-1788, DOI: 10.1158/1078-0432.ccr-14-2602, 2015.
[24] S. Torrecilla, D. Sia, A. N. Harrington, Z. Zhang, L. Cabellos, H. Cornella, A. Moeini, G. Camprecios, W. Q. Leow, M. I. Fiel, K. Hao, L. Bassaganyas, M. Mahajan, S. N. Thung, A. Villanueva, S. Florman, M. E. Schwartz, J. M. Llovet, "Trunk mutational events present minimal intra- and inter-tumoral heterogeneity in hepatocellular carcinoma," Journal of Hepatology, vol. 67 no. 6, pp. 1222-1231, DOI: 10.1016/j.jhep.2017.08.013, 2017.
[25] J. Nault, A. De Reyniès, A. Villanueva, J. Calderaro, S. Rebouissou, G. Couchy, T. Decaens, D. Franco, S. Imbeaud, F. Rousseau, D. Azoulay, J. Saric, J. Blanc, C. Balabaud, P. Bioulac–Sage, A. Laurent, P. Laurent–Puig, J. M. Llovet, J. Zucman–Rossi, "A hepatocellular carcinoma 5-gene score associated with survival of patients after liver resection," Gastroenterology, vol. 145 no. 1, pp. 176-187, DOI: 10.1053/j.gastro.2013.03.051, 2013.
[26] G. Khemlina, S. Ikeda, R. Kurzrock, "The biology of Hepatocellular carcinoma: implications for genomic and immune therapies," Molecular Cancer, vol. 16 no. 1,DOI: 10.1186/s12943-017-0712-x, 2017.
[27] M. Kudo, K. Motomura, Y. Wada, Y. Inaba, Y. Sakamoto, M. Kurosaki, Y. Umeyama, Y. Kamei, J. Yoshimitsu, Y. Fujii, M. Aizawa, P. B. Robbins, J. Furuse, "Avelumab in combination with axitinib as first-line treatment in patients with advanced hepatocellular carcinoma: results from the phase 1b VEGF liver 100 trial," Liver Cancer, vol. 10 no. 3, pp. 249-259, DOI: 10.1159/000514420, 2021.
[28] R. Domenico, P. Francesco, "The anti-VEGF(R)," Drug Discovery Legacy: Improving Attrition Rates by Breaking the Vicious Cycle of Angiogenesis in Cancer, vol. 13, 2021.
[29] M. J. Ansari, D. Bokov, A. Markov, A. T. Jalil, M. N. Shalaby, W. Suksatan, S. Chupradit, H. S. Al, N. Shomali, A. Zamani, A. Mohammadi, M. Dadashpour, "Cancer combination therapies by angiogenesis inhibitors; a comprehensive review," Cell Communication and Signaling, vol. 20 no. 1,DOI: 10.1186/s12964-022-00838-y, 2022.
[30] X. Gao, X. Wang, S. Zhang, "Bioinformatics identification of crucial genes and pathways associated with hepatocellular carcinoma," Bioscience Reports, vol. 38 no. 6,DOI: 10.1042/bsr20181441, 2018.
[31] Q. Zhu, Y. Sun, Q. Zhou, Q. He, H. Qian, "Identification of key genes and pathways by bioinformatics analysis with TCGA RNA sequencing data in hepatocellular carcinoma," Molecular and Clinical Oncology, vol. 9 no. 6, pp. 597-606, DOI: 10.3892/mco.2018.1728, 2018.
[32] A. Liaw, M. Wiener, "Classification and regression by randomForest," R News, vol. 23, 2002.
[33] R. Core Team, R: A Language and Environment for Statistical Computing, 2022.
[34] Gene Ontology Consortium, "The Gene Ontology resource: enriching a GOld mine," Nucleic Acids Research, vol. 49 no. 1, pp. D325-D334, DOI: 10.1093/nar/gkaa1113, 2021.
[35] M. Kanehisa, M. Furumichi, M. Tanabe, Y. Sato, K. Morishima, "KEGG: new perspectives on genomes, pathways, diseases and drugs," Nucleic Acids Research, vol. 45 no. 1, pp. D353-D361, DOI: 10.1093/nar/gkw1092, 2017.
[36] D. Warde-Farley, S. L. Donaldson, O. Comes, K. Zuberi, R. Badrawi, P. Chao, M. Franz, C. Grouios, F. Kazi, C. T. Lopes, A. Maitland, S. Mostafavi, J. Montojo, Q. Shao, G. Wright, G. D. Bader, Q. Morris, "The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function," Nucleic Acids Research, vol. 38 no. 2, pp. W214-W220, DOI: 10.1093/nar/gkq537, 2010.
[37] Z. Zhou, Y. Li, H. Hao, Y. Wang, Z. Zhou, Z. Wang, X. Chu, "Screening hub genes as prognostic biomarkers of hepatocellular carcinoma by bioinformatics analysis," Cell Transplantation, vol. 28 no. 1, pp. 76S-86S, DOI: 10.1177/0963689719893950, 2019.
[38] X. Song, R. Du, H. Gui, M. Zhou, W. Zhong, C. Mao, J. Ma, "Identification of potential hub genes related to the progression and prognosis of hepatocellular carcinoma through integrated bioinformatics analysis," Oncology Reports, vol. 43 no. 1, pp. 133-146, DOI: 10.3892/or.2019.7400, 2020.
[39] N. Li, L. Li, Y. Chen, "The identification of core gene expression signature in hepatocellular carcinoma," Oxidative Medicine and Cellular Longevity, vol. 2018,DOI: 10.1155/2018/3478305, 2018.
[40] Y. Sui, G. Lin, Y. Zheng, W. Huang, European Journal of Pharmacology, vol. 859,DOI: 10.1016/j.ejphar.2019.172465, 2019.
[41] H. Ouyang, L. Zhang, Z. Xie, S. Ma, Experimental and Therapeutic Medicine, vol. 18 no. 4, pp. 2547-2553, DOI: 10.3892/etm.2019.7850, 2019.
[42] H. Li, G.-Y. Zhang, C.-H. Pan, X. Y. Zhang, X. Y. Su, European Review for Medical and Pharmacological Sciences, vol. 23 no. 7, pp. 2838-2846, DOI: 10.26355/eurrev_201904_17561, 2019.
[43] Y. C. Jia, J. Yi Wang, Yu Y. Liu, B. Li, H. Guo, A. M. Zang, "LncRNA MAFG-AS1 facilitates the migration and invasion of NSCLC cell via sponging miR-339-5p from MMP15," Cell Biology International, vol. 43 no. 4, pp. 384-393, DOI: 10.1002/cbin.11092, 2019.
[44] Y. Cui, Z. Zhou, Y. Chai, X. Che, Y. Zhang, "Identification of a nomogram from ferroptosis-related long noncoding RNAs signature to analyze overall survival in patients with bladder cancer," Journal of Oncology, vol. 2021,DOI: 10.1155/2021/8533464, 2021.
[45] X. Su, Z. Yu, Y. Zhang, J. Chen, L. Wei, L. Sun, "Construction and analysis of the dysregulated ceRNA network and identification of risk long noncoding RNAs in breast cancer," Frontiers in Genetics, vol. 12,DOI: 10.3389/fgene.2021.664393, 2021.
[46] N. Iida, T. Kozasa, "Identification and biochemical analysis of GRIN1 and GRIN2," Methods in Enzymology, vol. 390, pp. 475-483, DOI: 10.1016/S0076-6879(04)90029-8, 2004.
[47] A. Tameni, E. Sauta, V. Mularoni, F. Torricelli, G. Manzotti, G. Inghirami, R. Bellazzi, V. Fragliasso, A. Ciarrocchi, "The DNA-helicase HELLS drives ALK ALCL proliferation by the transcriptional control of a cytokinesis-related program," Cell Death & Disease, vol. 12 no. 1,DOI: 10.1038/s41419-021-03425-0, 2021.
[48] W. Zhou, P. Li, P. Jin, "miR-654-5p promotes gastric cancer progression via the GPRIN1/NF- κ B pathway," Open Medicine, vol. 16 no. 1, pp. 1683-1695, DOI: 10.1515/med-2021-0369, 2021.
[49] G. Qi, C. Zhang, H. Ma, Y. Li, J. Peng, J. Chen, B. Kong, Am Journal Cancer Res, vol. 11 no. 2, pp. 389-415, 2021.
[50] J. Chen, X. Chen, "MYBL2 is targeted by miR-143-3p and regulates breast cancer cell proliferation and apoptosis," Oncology Research Featuring Preclinical and Clinical Cancer Therapeutics, vol. 26 no. 6, pp. 913-922, DOI: 10.3727/096504017x15135941182107, 2018.
[51] Y. Wang, L. Lei, Y.-G. Chi, L. B. Liu, B. P. Yang, European Review for Medical and Pharmacological Sciences, vol. 23 no. 19, pp. 8257-8264, DOI: 10.26355/eurrev_201910_19136, 2019.
[52] V. S. Pankratz, Z. Sun, J. Aakre, Y. Li, C. Johnson, Y. I. Garces, M. C. Aubry, J. R. Molina, D. A. Wigle, P. Yang, Journal of Thoracic Oncology, vol. 6 no. 9, pp. 1488-1495, DOI: 10.1097/JTO.0b013e318223bf05, 2011.
[53] H. Walston, A. N. Iness, L. Litovchick, "DREAM on: cell cycle control in development and disease," Annual Review of Genetics, vol. 55 no. 1, pp. 309-329, DOI: 10.1146/annurev-genet-071819-103836, 2021.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2023 Yikai Wang et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/
Abstract
Introduction and Aims. Hepatocellular carcinoma (HCC) is one of the most lethal tumors of the digestive system, but its mechanisms remain unclear. The purpose of this study was to study HCC-related genes, build a survival prognosis prediction model, and provide references for treatment and mechanism research. Methods. Transcriptome data and clinical data of HCC were downloaded from the TCGA database. Screen important genes based on the random forest method, combined with differential expression genes (DEGs) to screen out important DEGs. The Kaplan‒Meier curve was used to evaluate its prognostic significance. Cox regression analysis was used to construct a survival prognosis prediction model, and the ROC curve was used to verify it. Finally, the mechanism of action was explored through GO and KEGG pathway enrichment and GeneMANIA coexpression analyses. Results. Seven important DEGs were identified, three were highly expressed and four were lowly expressed. Among them, GPRIN1, MYBL2, and GSTM5 were closely related to prognosis (
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details









1 Department of Infectious Diseases, The Second Affiliated Hospital of Xi’an Jiaotong University, Xi’an, Shaanxi 710004, China
2 Department of Operating Room, The Second Affiliated Hospital of Xi’an Jiaotong University, Xi’an 710004, Shaanxi Province, China